BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. RID: RYTRETRZ01N Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 13,377,472 sequences; 4,581,520,208 total letters Query= Cowczarzaki_tblastnselU Length=183 Score E Sequences producing significant alignments: (Bits) Value gb|EFW45143.1| predicted protein [Capsaspora owczarzaki ATCC ... 385 1e-105 ref|NP_030274.1| unknown protein [Arabidopsis thaliana] >sp|Q... 85.9 2e-15 ref|XP_002881499.1| hypothetical protein ARALYDRAFT_482716 [A... 84.7 5e-15 gb|ABK23985.1| unknown [Picea sitchensis] 83.2 1e-14 gb|EFA74551.1| hypothetical protein PPL_00049 [Polysphondyliu... 82.4 2e-14 ref|XP_001752693.1| predicted protein [Physcomitrella patens ... 81.3 6e-14 ref|XP_639255.1| hypothetical protein DDB_G0283129 [Dictyoste... 79.0 3e-13 gb|ACO51726.1| C1orf93 homolog [Rana catesbeiana] 77.8 5e-13 emb|CBN80880.1| Uncharacterized [Dicentrarchus labrax] 76.6 1e-12 ref|XP_002326159.1| predicted protein [Populus trichocarpa] >... 76.6 1e-12 gb|ACO13446.1| C1orf93 [Esox lucius] 76.3 2e-12 ref|XP_002574167.1| PRX_like2 domain-containing protein [Schi... 76.3 2e-12 ref|XP_002587381.1| hypothetical protein BRAFLDRAFT_96268 [Br... 75.9 2e-12 gb|ADO27754.1| uncharacterized protein c1orf93-like protein [... 75.5 3e-12 gb|AAW27591.1| SJCHGC05103 protein [Schistosoma japonicum] >e... 75.1 4e-12 ref|NP_001017220.1| hypothetical protein LOC549974 [Xenopus (... 74.7 4e-12 ref|XP_001762195.1| predicted protein [Physcomitrella patens ... 74.7 5e-12 gb|ACI67539.1| C1orf93 homolog [Salmo salar] 74.7 6e-12 emb|CAX73679.1| hypothetical protein [Schistosoma japonicum] 74.3 7e-12 gb|ACI68733.1| C1orf93 homolog [Salmo salar] 73.9 9e-12 ref|XP_002965388.1| hypothetical protein SELMODRAFT_68006 [Se... 73.6 1e-11 ref|NP_001158627.1| UPF0308 protein C9orf21 homolog [Oncorhyn... 73.6 1e-11 gb|ACI70082.1| C1orf93 homolog [Salmo salar] 73.6 1e-11 ref|XP_002273449.1| PREDICTED: hypothetical protein [Vitis vi... 73.2 1e-11 ref|XP_002525198.1| conserved hypothetical protein [Ricinus c... 72.8 2e-11 ref|NP_001087128.1| chromosome 1 open reading frame 93 [Xenop... 72.8 2e-11 ref|XP_002666723.1| PREDICTED: UPF0308 protein C9orf21 homolo... 72.4 2e-11 emb|CAG07032.1| unnamed protein product [Tetraodon nigroviridis] 72.4 2e-11 gb|ACU20039.1| unknown [Glycine max] 72.0 3e-11 ref|NP_201385.2| unknown protein [Arabidopsis thaliana] >gb|A... 71.6 4e-11 ref|NP_998478.1| hypothetical protein LOC406605 [Danio rerio]... 71.6 4e-11 ref|XP_002664824.1| PREDICTED: hypothetical protein [Danio re... 71.6 5e-11 ref|XP_002977235.1| hypothetical protein SELMODRAFT_58056 [Se... 71.2 6e-11 ref|XP_001370596.1| PREDICTED: similar to SFLQ611 [Monodelphi... 70.5 8e-11 ref|XP_002334170.1| predicted protein [Populus trichocarpa] >... 70.5 8e-11 ref|XP_002866692.1| hypothetical protein ARALYDRAFT_496819 [A... 70.1 1e-10 ref|XP_795970.1| PREDICTED: hypothetical protein [Strongyloce... 69.3 2e-10 ref|XP_001926014.1| PREDICTED: UPF0765 protein C10orf58 isofo... 69.3 2e-10 ref|NP_001069904.1| hypothetical protein LOC616897 [Bos tauru... 69.3 2e-10 ref|XP_003130904.1| PREDICTED: UPF0308 protein C9orf21 homolo... 68.6 4e-10 gb|EEC75002.1| hypothetical protein OsI_11064 [Oryza sativa I... 68.2 4e-10 ref|XP_002468056.1| hypothetical protein SORBIDRAFT_01g038790... 68.2 4e-10 ref|NP_001049763.1| Os03g0284600 [Oryza sativa Japonica Group... 68.2 5e-10 gb|ACR37670.1| unknown [Zea mays] 67.8 6e-10 ref|XP_001493974.2| PREDICTED: similar to UPF0308 protein C9o... 67.8 6e-10 gb|EGC30304.1| hypothetical protein DICPUDRAFT_99572 [Dictyos... 67.4 7e-10 gb|EDL84435.1| similar to UPF0308 protein C9orf21, isoform CR... 67.4 7e-10 dbj|BAB24662.1| unnamed protein product [Mus musculus] 67.4 8e-10 ref|XP_001784902.1| predicted protein [Physcomitrella patens ... 67.4 9e-10 ref|XP_002928763.1| PREDICTED: UPF0308 protein C9orf21-like, ... 67.0 9e-10 gb|EFB20490.1| hypothetical protein PANDA_018799 [Ailuropoda ... 67.0 9e-10 ref|NP_079646.1| hypothetical protein LOC66129 [Mus musculus]... 67.0 1e-09 ref|XP_001106503.1| PREDICTED: UPF0308 protein C9orf21-like [... 67.0 1e-09 gb|ADO28366.1| upf0308 protein c9orf21-like protein [Ictaluru... 66.2 2e-09 ref|NP_001180447.1| selenoprotein U [Gallus gallus] >ref|NP_0... 66.2 2e-09 ref|ZP_01909605.1| hypothetical protein PPSIR1_24954 [Plesioc... 66.2 2e-09 dbj|BAE40544.1| unnamed protein product [Mus musculus] 66.2 2e-09 sp|Q5ZI34.2|CJ058_CHICK RecName: Full=UPF0765 protein C10orf5... 65.9 2e-09 gb|EAW92661.1| chromosome 9 open reading frame 21, isoform CR... 65.9 2e-09 ref|NP_001180474.1| selenoprotein U [Oryzias latipes] 65.9 2e-09 ref|XP_002820049.1| PREDICTED: UPF0308 protein C9orf21-like [... 65.5 3e-09 ref|XP_520707.2| PREDICTED: similar to TPA_exp: C9ORF21 isofo... 65.5 3e-09 gb|EDL90877.1| similar to RIKEN cDNA 5730469M10, isoform CRA_... 65.1 4e-09 ref|NP_001145525.1| hypothetical protein LOC100278941 [Zea ma... 65.1 4e-09 gb|EDL90876.1| similar to RIKEN cDNA 5730469M10, isoform CRA_... 65.1 4e-09 ref|NP_001014162.1| hypothetical protein LOC361118 precursor ... 65.1 4e-09 ref|XP_536403.1| PREDICTED: similar to R53.5 [Canis familiaris] 65.1 4e-09 gb|ACN25853.1| unknown [Zea mays] 64.7 5e-09 ref|XP_001368977.1| PREDICTED: similar to C9ORF21 [Monodelphi... 64.7 5e-09 gb|AAI14901.1| Chromosome 1 open reading frame 93 ortholog [B... 64.7 5e-09 ref|XP_002922464.1| PREDICTED: uncharacterized protein C10orf... 64.7 5e-09 ref|NP_001035688.1| hypothetical protein LOC617001 [Bos tauru... 64.7 5e-09 ref|NP_714542.1| hypothetical protein LOC195827 [Homo sapiens... 64.7 6e-09 ref|XP_002924432.1| PREDICTED: uncharacterized protein C1orf9... 64.3 7e-09 gb|ACN35248.1| unknown [Zea mays] 64.3 7e-09 ref|XP_001496590.1| PREDICTED: similar to SFLQ611 [Equus caba... 64.3 7e-09 ref|XP_002945792.1| hypothetical protein VOLCADRAFT_127337 [V... 63.9 8e-09 ref|XP_546736.2| PREDICTED: hypothetical protein XP_546736 [C... 63.9 8e-09 ref|XP_001915482.1| PREDICTED: hypothetical protein [Equus ca... 63.9 8e-09 sp|Q641F0.2|CJ058_XENLA RecName: Full=UPF0765 protein C10orf5... 63.5 1e-08 ref|NP_001087861.1| chromosome 10 open reading frame 58 [Xeno... 63.2 1e-08 gb|ACO12061.1| C10orf58 homolog precursor [Lepeophtheirus sal... 63.2 1e-08 ref|XP_002463942.1| hypothetical protein SORBIDRAFT_01g009350... 63.2 1e-08 ref|NP_001106912.1| prostamide/PG F synthase [Sus scrofa] >db... 63.2 2e-08 gb|ADE77692.1| unknown [Picea sitchensis] 63.2 2e-08 ref|XP_002191279.1| PREDICTED: hypothetical protein [Taeniopy... 62.8 2e-08 ref|XP_002320012.1| predicted protein [Populus trichocarpa] >... 62.8 2e-08 ref|NP_001180455.1| selenoprotein U [Taeniopygia guttata] >re... 62.8 2e-08 ref|NP_001008167.1| chromosome 9 open reading frame 21 [Xenop... 62.8 2e-08 ref|NP_001029771.1| hypothetical protein LOC534049 precursor ... 62.4 2e-08 ref|XP_002756224.1| PREDICTED: uncharacterized protein C10orf... 62.4 3e-08 ref|NP_001092155.1| hypothetical protein LOC100049742 [Xenopu... 62.4 3e-08 ref|XP_002263959.1| PREDICTED: hypothetical protein [Vitis vi... 62.0 3e-08 emb|CAN81555.1| hypothetical protein VITISV_040397 [Vitis vin... 62.0 3e-08 ref|NP_001051154.1| Os03g0729300 [Oryza sativa Japonica Group... 62.0 4e-08 ref|NP_001051153.1| Os03g0729200 [Oryza sativa Japonica Group... 61.6 4e-08 gb|EAY91739.1| hypothetical protein OsI_13380 [Oryza sativa I... 61.6 4e-08 ref|XP_002269002.1| PREDICTED: hypothetical protein, partial ... 61.6 4e-08 emb|CBI33289.3| unnamed protein product [Vitis vinifera] 61.6 4e-08 ref|XP_848380.1| PREDICTED: similar to UPF0308 protein C9orf2... 61.6 5e-08 ALIGNMENTS >gb|EFW45143.1| predicted protein [Capsaspora owczarzaki ATCC 30864] Length=297 Score = 385 bits (989), Expect = 1e-105, Method: Compositional matrix adjust. Identities = 183/183 (100%), Positives = 183/183 (100%), Gaps = 0/183 (0%) Query 1 ASNNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVK 60 ASNNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVK Sbjct 115 ASNNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVK 174 Query 61 IVLVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISE 120 IVLVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISE Sbjct 175 IVLVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISE 234 Query 121 WRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 WRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA Sbjct 235 WRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 294 Query 181 TQP 183 TQP Sbjct 295 TQP 297 >ref|NP_030274.1| unknown protein [Arabidopsis thaliana] sp|Q9ZUU2.2|U308_ARATH RecName: Full=UPF0308 protein At2g37240, chloroplastic; Flags: Precursor gb|AAK91362.1| At2g37240/F3G5.3 [Arabidopsis thaliana] gb|AAC98045.2| expressed protein [Arabidopsis thaliana] gb|AAM67197.1| unknown [Arabidopsis thaliana] gb|AAP21145.1| At2g37240/F3G5.3 [Arabidopsis thaliana] Length=248 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 58/194 (30%), Positives = 96/194 (50%), Gaps = 33/194 (17%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 + +KVL+L E+ ++D+WKD++ ++ R FGC LC ++A+++ E K +DA+GV +V Sbjct 72 DTVKVLDLRGN-EIPISDLWKDRKAVVAFARHFGCVLCRKRAAYLAEKKDVMDASGVALV 130 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR 122 L+G G+ A F+E +F EVY DP +Y+A L F+S +++ Sbjct 131 LIGPGSIDQANTFVEQT-----KFKGEVYADPNHASYEA--------LEFVSGVSVTFTP 177 Query 123 KANKNHPNADLQG----------------DGLQTGGIYLVGPGADSAIHFAFNEYDHPVG 166 KA + ++G G Q GGI + GPG D + ++ D G Sbjct 178 KAAMKILESYMEGYRQDWKLSFMKDTVERGGWQQGGILVAGPGKD---NISYIRKDKEAG 234 Query 167 TLVDNDQILAAVKA 180 ++IL A A Sbjct 235 DDPPVEEILKACCA 248 >ref|XP_002881499.1| hypothetical protein ARALYDRAFT_482716 [Arabidopsis lyrata subsp. lyrata] gb|EFH57758.1| hypothetical protein ARALYDRAFT_482716 [Arabidopsis lyrata subsp. lyrata] Length=248 Score = 84.7 bits (208), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 57/194 (30%), Positives = 95/194 (49%), Gaps = 33/194 (17%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 + +K+L+L E+ ++D+WKD++ ++ R FGC LC ++A+++ E K +DA+GV +V Sbjct 72 DTVKILDLRGN-EIPISDLWKDRKAVVAFARHFGCVLCRKRAAYLAEKKDVMDASGVTLV 130 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR 122 L+G G+ A F+E +F EVY DP +Y+A L F+S ++ Sbjct 131 LIGPGSIDQANTFMEQT-----KFKGEVYADPNHASYEA--------LEFVSGVTVTFTP 177 Query 123 KANKNHPNADLQG----------------DGLQTGGIYLVGPGADSAIHFAFNEYDHPVG 166 KA + ++G G Q GGI + GPG D + ++ D G Sbjct 178 KAAMKILESYMEGYRQDWKLSFMKDTVERGGWQQGGILVAGPGKD---NISYIRKDKEAG 234 Query 167 TLVDNDQILAAVKA 180 ++IL A A Sbjct 235 DDPPVEEILKACCA 248 >gb|ABK23985.1| unknown [Picea sitchensis] Length=261 Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 51/149 (35%), Positives = 76/149 (52%), Gaps = 13/149 (8%) Query 16 LVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKF 75 L L D+WKD++ ++ R FGC LC ++A + K Q+DAAGV +VL+G GN A+ F Sbjct 97 LHLTDLWKDRKAVIGFARHFGCVLCRKRADVLASQKSQMDAAGVALVLIGPGNIEQAKAF 156 Query 76 IENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHF--LSWTAISE-WRKANKNHPNAD 132 + +FP E+Y DP T++ A F L+ T I E + + + Sbjct 157 ADQT-----KFPGEIYADPNHTSFNALKFVSGVFTTFTPLAATKIIELYVEGYRQDWGLS 211 Query 133 LQGD-----GLQTGGIYLVGPGADSAIHF 156 Q D G Q GGI + GPG D+ ++ Sbjct 212 FQKDTMNRGGWQQGGILVAGPGGDNILYL 240 >gb|EFA74551.1| hypothetical protein PPL_00049 [Polysphondylium pallidum PN500] Length=662 Score = 82.4 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 55/172 (32%), Positives = 88/172 (52%), Gaps = 13/172 (7%) Query 16 LVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKF 75 L +W ++R ++ +LRRFGC +C Q + +KP+LD G+ ++ +G R E F Sbjct 136 LPFTSLWNNKRCVIAVLRRFGCLVCRLQCMDLSSLKPKLDRMGIALIAIGF-ERVGLEDF 194 Query 76 IENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHF---LSWTAISEWRK-ANKNHPNA 131 I G F E+YID ++ Y+A L+R+G L +S +RK A + + Sbjct 195 IA-----GGFFNGEIYIDRSRSVYRALSLKRMGFWDTTIGLMDPRLSVYRKEAKEKGLPS 249 Query 132 DLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKATQP 183 + +GDGLQ G +VGP A H+ F + + + D ++IL A K P Sbjct 250 NFRGDGLQLGATLVVGPKPQGA-HYDFRQKNFL--DVFDLNKILKACKQPYP 298 >ref|XP_001752693.1| predicted protein [Physcomitrella patens subsp. patens] gb|EDQ82564.1| predicted protein [Physcomitrella patens subsp. patens] Length=293 Score = 81.3 bits (199), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 53/178 (30%), Positives = 93/178 (53%), Gaps = 17/178 (9%) Query 6 KVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVG 65 K+ D + + L+ W+DQ V+L +LRRFGC LC Q+ + ++ QL+A V++V +G Sbjct 121 KIRGPGDSQNVKLSSFWEDQPVVLHVLRRFGCQLCRGQSVEMAKMLSQLEANNVRVVGIG 180 Query 66 TGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVG----LLHFLSWTAISEW 121 ++ E+F EN + +E+YID E+ +KA L +VG + + ++ E Sbjct 181 L-EKFGLEEFEEN-----NYWKSELYIDNEKKIHKALALTKVGWVGTFMMLFANKSVKEA 234 Query 122 RKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEY-DHPVGTLVDNDQILAAV 178 + K+ P + QGDG Q G +++ G + + ++ D P N +IL A+ Sbjct 235 AQKTKDTP-GNFQGDGRQLGATFVMAKGGELLLDHRQKDFGDQPT-----NAEILTAL 286 >ref|XP_639255.1| hypothetical protein DDB_G0283129 [Dictyostelium discoideum AX4] gb|EAL65894.1| hypothetical protein DDB_G0283129 [Dictyostelium discoideum AX4] Length=883 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 46/158 (30%), Positives = 87/158 (56%), Gaps = 11/158 (6%) Query 5 IKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLV 64 I V ++ D KEL+L +++++R+++ + RRFGC +C QA + +KP+LD G+++V + Sbjct 449 ITVCDVTDGKELLLTSLYENKRIVVAIFRRFGCLICRLQALDLSALKPKLDKIGIELVGI 508 Query 65 GTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLH----FLSWTAISE 120 G F E+ +E + F ++Y+D ++ Y+A L+R L FL + Sbjct 509 G-----FDEEGLEEFQ-QLKFFAGKIYLDKTRSVYRALNLKRRSKLTTYELFLDPRVMVY 562 Query 121 WRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF 158 +R+ + +++ + DG Q G ++GP A H+ F Sbjct 563 YRRIKEMGFSSNYRKDGFQLGATMVLGPKPQEA-HYDF 599 >gb|ACO51726.1| C1orf93 homolog [Rana catesbeiana] Length=201 Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 52/169 (31%), Positives = 84/169 (50%), Gaps = 13/169 (7%) Query 21 MWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIENVP 80 +WKD ++ LRRFGC +C A V ++K LDA ++++ +G E F++ Sbjct 28 LWKDNTSVIFFLRRFGCQICRWIAKDVSQLKESLDANQIRLIGIGPETVGLQE-FLD--- 83 Query 81 GNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADLQGD 136 G+ F E+Y+D + +YK G +R L + + R KAN + + GD Sbjct 84 --GKYFTGELYLDESKQSYKELGFKRYNALSIVPAALGKKVRDIVTKANADGVQGNFSGD 141 Query 137 GLQTGGIYLVGPGADSA-IHFAFNEYDH--PVGTLVDNDQILAAVKATQ 182 LQ+GG+ +V G + A +HF + P+ TLV I A V ++Q Sbjct 142 LLQSGGMLVVSKGGEKALLHFVQDSPGDFVPLDTLVTALGITADVTSSQ 190 >emb|CBN80880.1| Uncharacterized [Dicentrarchus labrax] Length=201 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 60/184 (33%), Positives = 91/184 (50%), Gaps = 14/184 (7%) Query 7 VLELADKKELV-LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVG 65 +L+ A+ +E V L +W+DQ V+L LRRFGC +C AS + +++P L A+GV + VG Sbjct 13 LLKSAETEESVELQSLWQDQPVVLFFLRRFGCQVCRWMASEISKLEPDLRASGVALAGVG 72 Query 66 TGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR--- 122 AE F E G F +Y+D + YK G +R + + + R Sbjct 73 PEEFGLAE-FKE-----GGFFKGSLYVDETKKTYKDLGFKRYTAISVVPAALGKKVRDIA 126 Query 123 -KANKNHPNADLQGDGLQTGGIYLVGPGADSA-IHFAFNE-YDH-PVGTLVDNDQILAAV 178 KA + + GD LQ+GG+ +V G + +HF + DH P+ + I A V Sbjct 127 AKAKADGIQGNFSGDLLQSGGMLIVAKGGEKVLLHFIQDSPGDHLPLEDISKALGISATV 186 Query 179 KATQ 182 KA Q Sbjct 187 KAGQ 190 >ref|XP_002326159.1| predicted protein [Populus trichocarpa] gb|EEE71829.1| predicted protein [Populus trichocarpa] Length=200 Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 50/157 (32%), Positives = 81/157 (52%), Gaps = 18/157 (11%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 + ++V +L + + +D+WKD++ ++ R FGC LC +A ++ K +DA+GV +V Sbjct 25 DTVEVFDL-NGNAIPFSDLWKDRKAVVAFARHFGCVLCRRRADYLAAKKDIMDASGVALV 83 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRV-GLLHFLSWTA---- 117 L+G G+ A+ F E +F EVY DP ++YKA LQ V G+ + A Sbjct 84 LIGPGSVDQAKTFSEQT-----KFKGEVYADPSHSSYKA--LQFVSGVSTTFTPKAGLKI 136 Query 118 ISEWRKANKNHPNADLQGD-----GLQTGGIYLVGPG 149 I + + + +GD G Q GGI + GPG Sbjct 137 IQSYMEGYRQDWKLSFEGDTVAKGGWQQGGIIVAGPG 173 >gb|ACO13446.1| C1orf93 [Esox lucius] Length=225 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 55/179 (31%), Positives = 87/179 (49%), Gaps = 26/179 (14%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQ-LDAAGVKIVLVGTGNRYFAEKFI 76 ++++D++ ++I +R F C C E + I + L AG+++V++G + + + F Sbjct 46 FKEVYQDRKSVIIFVRNFLCHTCKEYVDDLSRIPGEVLKEAGLRLVVIGQSSHHHIQSFC 105 Query 77 ENVPGNGQRFPAEVYIDPEQTAYKARGLQR----VGLL----HFLSWTAI----SEWRKA 124 + R+P E+Y+DPE+ YK G+ R VGL H S + S WR Sbjct 106 -----SLTRYPHEMYVDPERCIYKKLGMNRGEISVGLAQPSPHVKSGMLVGHMKSIWRAM 160 Query 125 NKNHPNADLQGDGLQTGGIYLVGPGADSAI-HFAFNEYDH-PVGTLVDNDQILAAVKAT 181 P D QGD Q GG + GPG++ HF N DH P+ L+ LA V+ T Sbjct 161 TS--PIFDFQGDPRQQGGAIIAGPGSEVHFAHFDMNRLDHMPINWLLQ----LAGVRQT 213 >ref|XP_002574167.1| PRX_like2 domain-containing protein [Schistosoma mansoni] emb|CAZ30400.1| PRX_like2 domain-containing protein [Schistosoma mansoni] Length=203 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 50/168 (30%), Positives = 81/168 (49%), Gaps = 16/168 (9%) Query 17 VLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFI 76 L W+DQ ++ RR GC C +A ++ +KP LDA +K++ + T + ++F+ Sbjct 24 TLDSFWRDQTCIITFFRRLGCKFCRLEAKNLSYLKPVLDARNIKLMGI-TFDEGGVKEFL 82 Query 77 ENVPGNGQRFPAEVYIDPEQTAYKARGLQRV----GLLHFLSWTAISEWRKANKNHPNAD 132 + G F ++Y+D E+ YKA ++V G L+ S KA + + Sbjct 83 D-----GHYFDGDLYLDRERKTYKALEYKKVSACSGFCSLLTKAGRSLNSKAKAANIPGN 137 Query 133 LQGDGLQTGGIYLVGPGADSAIHFAFNE-YDHPVGTLVDNDQILAAVK 179 + GDG QTGG+ +V G HF E +HP D QI+ +K Sbjct 138 MSGDGWQTGGLLVVEKGGKVLYHFEQKEVVNHP-----DYKQIIDVLK 180 >ref|XP_002587381.1| hypothetical protein BRAFLDRAFT_96268 [Branchiostoma floridae] gb|EEN43392.1| hypothetical protein BRAFLDRAFT_96268 [Branchiostoma floridae] Length=185 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 44/141 (32%), Positives = 73/141 (52%), Gaps = 10/141 (7%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L +W+ + +L+ LRRFGC +C A+ + ++KPQLDAA V +V VG ++F++ Sbjct 25 LGSLWESRACVLLFLRRFGCQVCRWTATELSKLKPQLDAANVNLVGVGP-EEVGVDEFVQ 83 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADL 133 G+ F ++Y+D + YK G +R L+ + A + R KA + Sbjct 84 -----GKFFAGDLYVDETKQCYKDLGYRRYNALNVIPAAASKKSRDVINKAKAEGIPGNF 138 Query 134 QGDGLQTGGIYLVGPGADSAI 154 +GD LQ GG +V G + + Sbjct 139 KGDLLQAGGTLIVVAGGEKVL 159 >gb|ADO27754.1| uncharacterized protein c1orf93-like protein [Ictalurus furcatus] Length=201 Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 48/172 (28%), Positives = 81/172 (48%), Gaps = 16/172 (9%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVG---TGNRYFAEK 74 L+ +WKD+ V++ LRRFGC +C A+ V +++ L GV ++ +G TG + F Sbjct 25 LSSLWKDKTVVMFFLRRFGCQICRWAAAEVSKLEKDLRENGVALIGIGPEETGLKEFE-- 82 Query 75 FIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPN 130 +G F E+YID ++ YK G +R ++ L + R KA+ Sbjct 83 -------DGGFFKGEIYIDEKKQCYKELGFKRYNAINVLPAALGKKVREIASKASNEGIQ 135 Query 131 ADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKATQ 182 + GD LQ+GG+ +V G + + E + L D ++L + Q Sbjct 136 GNFSGDLLQSGGMLIVAKGGEKVLLHFIQETPGDLVPLEDITKVLGISASVQ 187 >gb|AAW27591.1| SJCHGC05103 protein [Schistosoma japonicum] emb|CAX73680.1| hypothetical protein [Schistosoma japonicum] Length=203 Score = 75.1 bits (183), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 45/156 (29%), Positives = 77/156 (50%), Gaps = 11/156 (7%) Query 14 KELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAE 73 + + L W+D+ ++ RR GC C +A ++ +KP LD +K++ + T + + Sbjct 21 QTVTLESFWRDRTCIVTFFRRMGCKFCRLEAKNLSYLKPALDTRNIKLIGI-TFDVGGVK 79 Query 74 KFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRV----GLLHFLSWTAISEWRKANKNHP 129 +F+ +G F ++Y+DPE+ YKA G ++V G + S A + KA Sbjct 80 EFL-----DGHYFDGDLYLDPERMTYKALGYKKVSPCSGAISLFSKAARALNSKAKAAKI 134 Query 130 NADLQGDGLQTGGIYLVGPGADSAIHFAFNE-YDHP 164 +L GDG QTGG+ +V G ++ E HP Sbjct 135 PGNLSGDGWQTGGLLVVEKGGKILYYYEQKEVVRHP 170 >ref|NP_001017220.1| hypothetical protein LOC549974 [Xenopus (Silurana) tropicalis] sp|Q28IJ3.1|CA093_XENTR RecName: Full=Uncharacterized protein C1orf93 homolog emb|CAJ83264.1| novel protein [Xenopus (Silurana) tropicalis] Length=201 Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 51/172 (30%), Positives = 83/172 (49%), Gaps = 13/172 (7%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L +WK++ +L+ LRRFGC +C A + ++K DA +++V +G E F+E Sbjct 25 LKSLWKEKTTVLLFLRRFGCQICRWIAKDIGKLKASCDAHQIRLVGIGPEEVGLKE-FLE 83 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADL 133 G F E+YID + +YK G +R L + + R KAN + + Sbjct 84 -----GNFFNGELYIDESKESYKTLGFKRYSALSVIPAALGKKVRDIVTKANADGVQGNF 138 Query 134 QGDGLQTGGIYLVGPGADSA-IHFAFNEYDH--PVGTLVDNDQILAAVKATQ 182 GD LQ+GG+ +V G + +HF + P+ ++V I A V +Q Sbjct 139 SGDLLQSGGMLIVSKGGEKVLLHFIQDSPGDYVPLESIVQTLGITANVTESQ 190 >ref|XP_001762195.1| predicted protein [Physcomitrella patens subsp. patens] gb|EDQ72987.1| predicted protein [Physcomitrella patens subsp. patens] Length=232 Score = 74.7 bits (182), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 52/173 (31%), Positives = 82/173 (48%), Gaps = 20/173 (11%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLV------ 64 D + L+ W DQ VL+ +LRRFGC LC A + +I P L+A GV+I+ + Sbjct 27 GDLSSVPLSTFWNDQPVLIHVLRRFGCQLCRGGAVEMGKIFPDLEAHGVRIIGIVRWKSL 86 Query 65 --------GTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLH----F 112 R EK G + E+YID + +KA +Q+VG+L Sbjct 87 VKDVCDADVDARRLGIEKVGLEDFQKGGFWKGELYIDNGKKIHKALNIQKVGILSSVKMM 146 Query 113 LSWTAISEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEY-DHP 164 +S ++ + K K+ P D +GDG Q G +++ G ++ + F + DHP Sbjct 147 VSNKSVKDAIKKTKDTP-GDFKGDGRQLGATFVLAKGGETLLDFRQEHFGDHP 198 >gb|ACI67539.1| C1orf93 homolog [Salmo salar] Length=200 Score = 74.7 bits (182), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 46/147 (32%), Positives = 73/147 (50%), Gaps = 17/147 (11%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVG---TGNRYFAEK 74 L +W+D+ V+L LRRFGC +C A+ + +++P L A G+ +V +G TG + F E Sbjct 24 LQSLWRDKPVVLFFLRRFGCQVCRWTAAEISKLEPDLTAHGIALVGIGPEETGLKEFKE- 82 Query 75 FIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPN 130 G F ++YID ++ YK G +R L + + R KA Sbjct 83 --------GGFFKGDLYIDEKKQCYKDLGFKRYTALSVVPAALGKKIREVTTKAKAQGIQ 134 Query 131 ADLQGDGLQTGGIYLVGPGADSA-IHF 156 + GD LQ+GG+ +V G + +HF Sbjct 135 GNFTGDLLQSGGMLIVAKGGEKVLLHF 161 >emb|CAX73679.1| hypothetical protein [Schistosoma japonicum] Length=203 Score = 74.3 bits (181), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 41/141 (30%), Positives = 72/141 (52%), Gaps = 10/141 (7%) Query 14 KELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAE 73 + + L W+D+ ++ RR GC C +A ++ +KP LD +K++ + T + + Sbjct 21 QTVTLESFWRDRTCIVTFFRRMGCKFCRLEAKNLSYLKPALDTRNIKLIGI-TFDVGGVK 79 Query 74 KFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRV----GLLHFLSWTAISEWRKANKNHP 129 +F++ G F ++Y+DPE+ YKA G ++V G++ S + KA Sbjct 80 EFLD-----GHYFDGDLYLDPERMTYKALGYKKVSPCSGVISLFSKAGRALNSKAKAAKI 134 Query 130 NADLQGDGLQTGGIYLVGPGA 150 +L GDG QTGG+ +V G Sbjct 135 PGNLSGDGWQTGGLLVVEKGG 155 >gb|ACI68733.1| C1orf93 homolog [Salmo salar] Length=224 Score = 73.9 bits (180), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 47/160 (30%), Positives = 79/160 (50%), Gaps = 21/160 (13%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQ-LDAAGVKIVLVGTGNRYFAEKFI 76 ++++D++ ++I +R F C C E + I + L AG+++V++G + + E F Sbjct 45 FKELYQDRKSVVIFVRNFLCHTCKEYVDDLSRIPAEVLKEAGLRLVVIGQSSHHHIESFC 104 Query 77 ENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL--------HFLSWTAI----SEWRKA 124 ++ G +P ++Y+DPE+ YK G++R + H S + S WR Sbjct 105 -SLTG----YPHDIYVDPERCIYKRLGMRRGEMSVESTKPSPHVKSGMLVGHMKSMWRAM 159 Query 125 NKNHPNADLQGDGLQTGGIYLVGPGADSAI-HFAFNEYDH 163 P D QGD Q GG +VGPG++ HF N DH Sbjct 160 TS--PIFDFQGDPRQQGGAIIVGPGSEVHFAHFDMNRLDH 197 >ref|XP_002965388.1| hypothetical protein SELMODRAFT_68006 [Selaginella moellendorffii] gb|EFJ34226.1| hypothetical protein SELMODRAFT_68006 [Selaginella moellendorffii] Length=172 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 54/179 (31%), Positives = 76/179 (43%), Gaps = 32/179 (17%) Query 16 LVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKF 75 + L D+WKD+ ++ R FGC LC ++A + K DAAGV +VLVG G A+ F Sbjct 9 IALTDLWKDRTAVVAFARHFGCILCRKRADVLASKKEVFDAAGVSLVLVGPGTVDQAKAF 68 Query 76 IENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWRKANKNHPNADLQG 135 +FP EVY DP ++ A F+S + KA A L+G Sbjct 69 ASQT-----QFPGEVYADPTHASFDA--------FQFVSGASTIFNPKAAMRVMGAHLEG 115 Query 136 ----------------DGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAV 178 G Q GGI + GPG D ++ D G D +++AA Sbjct 116 YRQDWGLSFEKDTVQRGGWQQGGIVIAGPGKDRLLYI---HKDKEAGDEPDIKEVIAAC 171 >ref|NP_001158627.1| UPF0308 protein C9orf21 homolog [Oncorhynchus mykiss] gb|ACO08544.1| UPF0308 protein C9orf21 homolog [Oncorhynchus mykiss] Length=224 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 47/160 (30%), Positives = 79/160 (50%), Gaps = 21/160 (13%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQ-LDAAGVKIVLVGTGNRYFAEKFI 76 ++++D++ ++I +R F C C E + I + L AG+++V++G + + E F Sbjct 45 FKELYQDRKSVVIFVRNFLCHTCKEYVDDLSRIPAEILKEAGLRLVVIGQSSHHHIESFC 104 Query 77 ENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL--------HFLSWTAI----SEWRKA 124 ++ G +P ++Y+DPE+ YK G++R + H S + S WR Sbjct 105 -SLTG----YPHDIYVDPERCIYKRLGMRRGEMSVESAKPSPHVKSGMLVGHMKSMWRAM 159 Query 125 NKNHPNADLQGDGLQTGGIYLVGPGADSAI-HFAFNEYDH 163 P D QGD Q GG +VGPG++ HF N DH Sbjct 160 TS--PIFDFQGDPRQQGGAIIVGPGSEVHFAHFDMNRLDH 197 >gb|ACI70082.1| C1orf93 homolog [Salmo salar] Length=232 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 47/160 (30%), Positives = 79/160 (50%), Gaps = 21/160 (13%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQ-LDAAGVKIVLVGTGNRYFAEKFI 76 ++++D++ ++I +R F C C E + I + L AG+++V++G + + E F Sbjct 53 FKELYQDRKSVIIFVRNFLCHTCKEYVDDLSRIPAEVLKEAGLRLVVIGQSSHHHIESFC 112 Query 77 ENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL--------HFLSWTAI----SEWRKA 124 ++ G +P ++Y+DPE+ YK G++R + H S + S WR Sbjct 113 -SLTG----YPHDMYVDPERCIYKRLGMRRGEMSVESTKPSPHVKSGMLVGHMKSMWRAM 167 Query 125 NKNHPNADLQGDGLQTGGIYLVGPGADSAI-HFAFNEYDH 163 P D QGD Q GG +VGPG++ HF N DH Sbjct 168 TS--PIFDFQGDPRQQGGAIIVGPGSEVHFAHFDMNRLDH 205 >ref|XP_002273449.1| PREDICTED: hypothetical protein [Vitis vinifera] emb|CBI32627.3| unnamed protein product [Vitis vinifera] Length=254 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 50/170 (30%), Positives = 80/170 (48%), Gaps = 18/170 (10%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 ++D+WKD++ ++ R FGC C ++A + K ++DA+GV +VL+G G+ A+ F E Sbjct 92 ISDLWKDRKAVVAFARHFGCVFCRKRADLLASQKDRMDASGVALVLIGPGSIDQAKAFSE 151 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTA----ISEWRKANKNHPNADL 133 F EVY DP ++Y+ G G+L + A I + + + Sbjct 152 QT-----NFKGEVYADPSHSSYEVLGFVS-GVLSTFTPQAGLKIIQLYMEGYRQDWGLSF 205 Query 134 QGD-----GLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAV 178 Q D G Q GGI + GPG + ++ D G D + IL A Sbjct 206 QRDTVTRGGWQQGGIIVAGPGKS---NISYIHKDKEAGDDPDMEDILTAC 252 >ref|XP_002525198.1| conserved hypothetical protein [Ricinus communis] gb|EEF37164.1| conserved hypothetical protein [Ricinus communis] Length=249 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 53/192 (28%), Positives = 89/192 (47%), Gaps = 33/192 (17%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 + +KVL+L E+ ++D+WKD++ ++ R FGC LC ++A ++ K +DA+GV +V Sbjct 73 DTVKVLDLGGN-EIPISDLWKDRKAVVAFARHFGCVLCRKRADYLAAKKDIMDASGVALV 131 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR 122 L+G G+ A+ F E +F EVY D ++Y+A F+S + + Sbjct 132 LIGPGSVDQAKTFSEQT-----KFKGEVYADTSHSSYEA--------FQFVSGVSTTFTP 178 Query 123 KANKNHPNADLQG----------------DGLQTGGIYLVGPGADSAIHFAFNEYDHPVG 166 KA ++G G + GGI + GPG + ++ D G Sbjct 179 KAGLKIIELYMEGYRQDWKLSFEKDTVARGGWRQGGIIVAGPG---KTNISYIHKDKEAG 235 Query 167 TLVDNDQILAAV 178 D + IL A Sbjct 236 DDPDIEDILKAC 247 >ref|NP_001087128.1| chromosome 1 open reading frame 93 [Xenopus laevis] sp|Q6AZG8.1|CA093_XENLA RecName: Full=Uncharacterized protein C1orf93 homolog gb|AAH78028.1| MGC82733 protein [Xenopus laevis] Length=201 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 51/172 (30%), Positives = 82/172 (48%), Gaps = 13/172 (7%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L +WK+Q +L+ LRRFGC +C A + ++K D +++V +G E F++ Sbjct 25 LKSLWKEQTTVLLFLRRFGCQICRWIAKDMGKLKESCDVHQIRLVGIGPEEVGLKE-FLD 83 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADL 133 G F E+YID + +YK G +R L + + R KAN + + Sbjct 84 -----GNFFNGELYIDDSKQSYKDLGFKRYSALSVIPAALGKKVRDIVTKANADGVQGNF 138 Query 134 QGDGLQTGGIYLVGPGADSA-IHFAFNEYDH--PVGTLVDNDQILAAVKATQ 182 GD LQ+GG+ +V G + +HF + P+ T+V I A V +Q Sbjct 139 SGDLLQSGGMLIVSKGGEKVLLHFIQDSPGDYVPLETIVQTLGITANVTESQ 190 >ref|XP_002666723.1| PREDICTED: UPF0308 protein C9orf21 homolog [Danio rerio] Length=223 Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 61/194 (32%), Positives = 90/194 (47%), Gaps = 35/194 (18%) Query 1 ASNNIKVLEL---------ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIK 51 AS+NI + EL DKK + +++ + ++I +R F C C E + +I Sbjct 21 ASHNICLSELKNCFIFDRHGDKKSF--SSLFEHNKAIVIFVRHFLCYTCKEYVEDLGKI- 77 Query 52 PQ--LDAAGVKIVLVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGL 109 PQ L + V++V++G + + F ++ G FP E+Y+DPE+ YK GL+R Sbjct 78 PQHVLQDSNVRLVVIGQSSYSHIQGFC-SLTG----FPHEIYVDPERQIYKRLGLRRGET 132 Query 110 L------------HFLSWTAISEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAI-HF 156 LS + S WR P D QGD Q GG +VGPG D HF Sbjct 133 YMETPSVSPHVKSSMLSGSLKSVWRAMTS--PVFDFQGDPQQQGGALIVGPGPDVHFAHF 190 Query 157 AFNEYDH-PVGTLV 169 N DH P+ L+ Sbjct 191 DMNRLDHMPINWLL 204 >emb|CAG07032.1| unnamed protein product [Tetraodon nigroviridis] Length=191 Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 45/144 (32%), Positives = 71/144 (50%), Gaps = 11/144 (7%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L +W+DQ V+L LRRFGC +C A+ + +++ +L A GV LVG G K + Sbjct 24 LQSLWRDQPVVLFFLRRFGCQICRWIAAEISKLEAELRAGGV--ALVGIGPEEVGLKEFK 81 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADL 133 +G F +YID ++ YK G +R + + + R KA + + Sbjct 82 ----DGGFFKGSIYIDEKKKTYKDLGFKRYTAISVVPAAMGKKVRDVAAKAKADGVEGNF 137 Query 134 QGDGLQTGGIYLVGPGADSA-IHF 156 GD LQ+GG+ +V G + +HF Sbjct 138 SGDLLQSGGMLIVAKGGEKVLLHF 161 >gb|ACU20039.1| unknown [Glycine max] Length=256 Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 45/159 (29%), Positives = 83/159 (53%), Gaps = 16/159 (10%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 +++KV +L + + ++D+WKD++ ++ R FGC LC ++A ++ K +DA+GV +V Sbjct 80 DSVKVFDL-NGNGIPISDLWKDRKAVVAFARHFGCVLCRKRADYLSSKKDIMDASGVALV 138 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTA----I 118 L+G G+ A+ F E +F E+Y DP ++Y+A G+L + A I Sbjct 139 LIGPGSIDQAKSFAEK-----SKFEGEIYADPTHSSYEALNFVS-GVLTTFTPNAGLKII 192 Query 119 SEWRKANKNHPNADLQGD-----GLQTGGIYLVGPGADS 152 + + + + D G + GGI + GPG ++ Sbjct 193 QLYMEGYRQDWKLSFEKDTVSRGGWKQGGIIVAGPGKNN 231 >ref|NP_201385.2| unknown protein [Arabidopsis thaliana] gb|AAM20692.1| unknown protein [Arabidopsis thaliana] gb|AAN15652.1| unknown protein [Arabidopsis thaliana] Length=275 Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 51/147 (35%), Positives = 74/147 (51%), Gaps = 17/147 (11%) Query 11 ADKKELVLADMW--KDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGN 68 A + + +D+W KD ++LLR FGC C E A+ + E KP+ DAAGVK++ VG G Sbjct 100 ASGQRVQFSDLWDQKDGIAAVVLLRHFGCVCCWELATALKEAKPRFDAAGVKLIAVGVGT 159 Query 69 RYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQ-RVGLLHF-----LSWTAISEW 121 A +P FP E +Y DPE+ AY GL +G F ++ SE Sbjct 160 PDKARILATRLP-----FPMECLYADPERKAYDVLGLYFGLGRTFFNPASTKVFSRFSEI 214 Query 122 RKANKNHP---NADLQGDGLQTGGIYL 145 R+A KN+ + + LQ GG ++ Sbjct 215 REATKNYTIEATPEDRSSVLQQGGTFV 241 >ref|NP_998478.1| hypothetical protein LOC406605 [Danio rerio] sp|Q6NV24.1|CA093_DANRE RecName: Full=Uncharacterized protein C1orf93 homolog gb|AAH68342.1| Zgc:85644 [Danio rerio] Length=201 Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 43/147 (30%), Positives = 73/147 (50%), Gaps = 17/147 (11%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVG---TGNRYFAEK 74 + +W++Q V+L LRRFGC +C A+ V +++ L A G+ +V +G TG + F Sbjct 25 IGSLWREQAVVLFFLRRFGCQVCRWMAAEVSKLEKDLKAHGIALVGIGPEETGVKEFK-- 82 Query 75 FIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPN 130 +G F ++YID + YK G +R ++ + + R KA+ Sbjct 83 -------DGGFFKGDIYIDEMKQCYKDLGFKRYNAINVVPAAMGKKVREIASKASAEGIQ 135 Query 131 ADLQGDGLQTGGIYLVGPGADSA-IHF 156 + GD LQ+GG+ +V G + +HF Sbjct 136 GNFSGDLLQSGGMLIVAKGGEKVLLHF 162 >ref|XP_002664824.1| PREDICTED: hypothetical protein [Danio rerio] Length=201 Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 43/147 (30%), Positives = 73/147 (50%), Gaps = 17/147 (11%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVG---TGNRYFAEK 74 + +W++Q V+L LRRFGC +C A+ V +++ L A G+ +V +G TG + F Sbjct 25 IGSLWREQAVVLFFLRRFGCQVCRWMAAEVSKLEKDLKAHGIALVGIGPEETGVKEFK-- 82 Query 75 FIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPN 130 +G F ++YID + YK G +R ++ + + R KA+ Sbjct 83 -------DGGFFKGDIYIDEMKQCYKDLGFKRYNAINVVPAAMGKKVREIASKASAEGIQ 135 Query 131 ADLQGDGLQTGGIYLVGPGADSA-IHF 156 + GD LQ+GG+ +V G + +HF Sbjct 136 GNFSGDLLQSGGMLIVAKGGEKVLLHF 162 >ref|XP_002977235.1| hypothetical protein SELMODRAFT_58056 [Selaginella moellendorffii] gb|EFJ21844.1| hypothetical protein SELMODRAFT_58056 [Selaginella moellendorffii] Length=172 Score = 71.2 bits (173), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 48/156 (31%), Positives = 68/156 (44%), Gaps = 29/156 (18%) Query 16 LVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKF 75 + L D+WKD+ ++ R FGC LC ++A + K D AGV +VLVG G A+ F Sbjct 9 ISLTDLWKDRTAVVAFARHFGCILCRKRADVLASKKEVFDGAGVSLVLVGPGTVDQAKAF 68 Query 76 IENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWRKANKNHPNADLQG 135 +FP EVY DP +++A F+S + KA A L+G Sbjct 69 ASQT-----QFPGEVYADPTHASFEA--------FQFVSGASTIFNPKAAMRVMGAHLEG 115 Query 136 ----------------DGLQTGGIYLVGPGADSAIH 155 G Q GGI + GPG D ++ Sbjct 116 YRQDWGLSFEKDTVQRGGWQQGGIVIAGPGKDRLLY 151 >ref|XP_001370596.1| PREDICTED: similar to SFLQ611 [Monodelphis domestica] Length=229 Score = 70.5 bits (171), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 46/183 (26%), Positives = 86/183 (47%), Gaps = 16/183 (8%) Query 4 NIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVL 63 +K L+ K ++W+ + +++ +RR GC LC E+A+ + +KPQLD GV + Sbjct 52 QLKTLDNESPKTFKARELWEHRGAVIMAVRRPGCFLCREEAADLSALKPQLDLLGVPLYA 111 Query 64 VGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR- 122 V EK V F ++++D + Y G Q+ ++ F+ + + W+ Sbjct 112 V------VKEKIGSEVENFQPYFKGKIFLDERKKFY---GPQKRKMM-FMGFVRLGVWQN 161 Query 123 --KANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 +A + +L+G+G GG+Y++GPG + + G V+ +L A K Sbjct 162 FFRARSKGFSGNLEGEGFVLGGVYVIGPGKQGIL---LEHREKEFGDKVNPASVLEAAKK 218 Query 181 TQP 183 +P Sbjct 219 IKP 221 >ref|XP_002334170.1| predicted protein [Populus trichocarpa] gb|EEF07066.1| predicted protein [Populus trichocarpa] Length=146 Score = 70.5 bits (171), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 34/99 (35%), Positives = 58/99 (59%), Gaps = 6/99 (6%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 + ++V +L + + +D+WKD++ ++ R FGC LC +A ++ K +DA+GV +V Sbjct 22 DTVEVFDL-NGNAIPFSDLWKDRKAVVAFARHFGCVLCRRRADYLAAKKDIMDASGVALV 80 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKA 101 L+G G+ A+ F E +F EVY DP ++YKA Sbjct 81 LIGPGSVDQAKTFSEQT-----KFKGEVYADPSHSSYKA 114 >ref|XP_002866692.1| hypothetical protein ARALYDRAFT_496819 [Arabidopsis lyrata subsp. lyrata] gb|EFH42951.1| hypothetical protein ARALYDRAFT_496819 [Arabidopsis lyrata subsp. lyrata] Length=265 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 50/147 (35%), Positives = 74/147 (51%), Gaps = 17/147 (11%) Query 11 ADKKELVLADMW--KDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGN 68 A + + +D+W KD ++LLR FGC C E A+ + E KP+ DAAGVK++ VG G Sbjct 90 ASGQRVQFSDLWDQKDGIAAVVLLRHFGCVCCWELATALKEAKPRFDAAGVKLIAVGVGT 149 Query 69 RYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQR-VGLLHF-----LSWTAISEW 121 A +P FP E +Y DPE+ AY GL +G F ++ +E Sbjct 150 PDKARILATRLP-----FPMECLYADPERKAYDVLGLYYGLGRTFFNPASTKVFSRFNEI 204 Query 122 RKANKNHP---NADLQGDGLQTGGIYL 145 R+A KN+ + + LQ GG ++ Sbjct 205 REATKNYTIEATPEDRSSVLQQGGTFV 231 >ref|XP_795970.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] ref|XP_001176073.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length=191 Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 44/159 (28%), Positives = 77/159 (49%), Gaps = 11/159 (6%) Query 2 SNNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKI 61 SNN+ V + + + L+ +W++ ++ LRRFGC +C A + +KP+LDAA V++ Sbjct 8 SNNL-VTNVQTGETITLSSIWEEGACVIQFLRRFGCPICRMGARDITHLKPRLDAANVRL 66 Query 62 VLVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEW 121 V +G A++FIE+ G +++ID ++ Y +R L ++ Sbjct 67 VAIGQ-EETGAKEFIESGFWTG-----DLFIDQQKKTYGDLKYKRYNFLTIMANLMCKMT 120 Query 122 R----KANKNHPNADLQGDGLQTGGIYLVGPGADSAIHF 156 R KA ++ GD LQ GG ++ G + F Sbjct 121 REAVSKATSEGITGNMTGDALQMGGTLVIDKGGKVLLDF 159 >ref|XP_001926014.1| PREDICTED: UPF0765 protein C10orf58 isoform 4 [Sus scrofa] Length=231 Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 48/183 (27%), Positives = 87/183 (48%), Gaps = 17/183 (9%) Query 4 NIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVL 63 ++K LE + K +W+ +++ +RR GC LC E+A+ + +KP+LD GV + Sbjct 55 DLKTLE-KEPKTFKAKALWEKTGAVIMAVRRPGCFLCREEAADLSSLKPRLDELGVPLYA 113 Query 64 VGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR- 122 V E+ V F E+++D E+ Y G QR ++ F+ + + W Sbjct 114 V------VKEQVKNEVKDFQPYFKGEIFLDEEKKFY---GPQRRKMM-FMGFVRLGVWYN 163 Query 123 --KANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 +A + +L+G+G GG+++VGPG + + G V+ +L AV+ Sbjct 164 FFRARSGGFSGNLEGEGFVLGGVFVVGPGKQGIL---LEHREKEFGDKVNPVSVLEAVRK 220 Query 181 TQP 183 +P Sbjct 221 IKP 223 >ref|NP_001069904.1| hypothetical protein LOC616897 [Bos taurus] sp|Q148E0.1|CI021_BOVIN RecName: Full=UPF0308 protein C9orf21 homolog gb|AAI18426.1| Chromosome 9 open reading frame 21 ortholog [Bos taurus] gb|DAA26600.1| hypothetical protein LOC616897 [Bos taurus] Length=228 Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 46/173 (27%), Positives = 85/173 (50%), Gaps = 21/173 (12%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + ++ ++++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 44 ASGRPVLFGELFRERRAIVVFVRHFLCYICKEYVEDLAKIPKSFLQEANVTLIVIGQSSY 103 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + E+Y+DPE+ YK G++R + + LS + Sbjct 104 HHIEPFCKLT-----GYSHEIYVDPEREIYKRLGMKRGEEIASSGQSPHVKSNILSGSIR 158 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGAD-SAIHFAFNEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IH N DH P+ +++ Sbjct 159 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNNIHFIHHDRNRLDHKPINSVL 209 >ref|XP_003130904.1| PREDICTED: UPF0308 protein C9orf21 homolog isoform 1 [Sus scrofa] Length=228 Score = 68.6 bits (166), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 47/175 (27%), Positives = 85/175 (49%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + ++ +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 44 ASGRRVLFGSLFRERRAVVVFVRHFLCYICKEYVEDLAKIPKSFLQEANVTLIVIGQSSY 103 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + E+Y+DPE+ YK G++R + + LS + Sbjct 104 HHIEPFCKLT-----GYSHEIYVDPEREIYKRLGMKRGEEIASSGKSPHIKSNILSGSIR 158 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 159 SLWRAVTG--PLFDFQGDPAQQGGTVILGPGNN--IHFIHRDRNRLDHKPINSVL 209 >gb|EEC75002.1| hypothetical protein OsI_11064 [Oryza sativa Indica Group] Length=239 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 50/185 (28%), Positives = 85/185 (46%), Gaps = 19/185 (10%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 ++V +L+ K V+ D+WKD++ ++ R FGC LC ++A + + ++AAGV +V Sbjct 63 QGVEVFDLSGKAVPVV-DLWKDRKAIVAFARHFGCVLCRKRADLLAAKQDAMEAAGVALV 121 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTA----I 118 L+G G A+ F + +F EVY DP ++Y A GL + +A I Sbjct 122 LIGPGTVEQAKAFYDQT-----KFKGEVYADPSHSSYNALEFA-FGLFSTFTPSAGLKII 175 Query 119 SEWRKANKNHPNADLQGD-----GLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQ 173 + + + + G GG+ + GPG D+ ++ D G D D Sbjct 176 QLYMEGYRQDWELSFEKTTRTKGGWYQGGLLVAGPGIDNILYI---HKDKEAGDDPDMDD 232 Query 174 ILAAV 178 +L A Sbjct 233 VLKAC 237 >ref|XP_002468056.1| hypothetical protein SORBIDRAFT_01g038790 [Sorghum bicolor] gb|EER95054.1| hypothetical protein SORBIDRAFT_01g038790 [Sorghum bicolor] Length=258 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 51/186 (28%), Positives = 87/186 (47%), Gaps = 21/186 (11%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 ++V +L + K + + D+WK+++ ++ R FGC LC ++A + + + AAGV +V Sbjct 82 QGVEVFDL-NGKAVSIVDLWKERKAVVAFARHFGCVLCRKRADLLAAKQDVMQAAGVALV 140 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTA----I 118 L+G G+ A+ F E +F EVY DP ++Y A GL + A I Sbjct 141 LIGPGSVEQAKAFCEQT-----KFKGEVYADPTHSSYDALEFA-FGLFSTFTPAAGLKII 194 Query 119 SEWRKANKN------HPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDND 172 +R+ + N +G G GG+ + GPG D+ ++ D G D + Sbjct 195 QLYREGYRQDWELSFEKNTRTKG-GWYQGGLIVAGPGIDNILYI---HKDKEAGDDPDME 250 Query 173 QILAAV 178 +L A Sbjct 251 DVLRAC 256 >ref|NP_001049763.1| Os03g0284600 [Oryza sativa Japonica Group] gb|ABF95344.1| UPF0308 protein, chloroplast precursor, putative, expressed [Oryza sativa Japonica Group] dbj|BAF11677.1| Os03g0284600 [Oryza sativa Japonica Group] gb|EEE58829.1| hypothetical protein OsJ_10400 [Oryza sativa Japonica Group] Length=251 Score = 68.2 bits (165), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 50/185 (28%), Positives = 85/185 (46%), Gaps = 19/185 (10%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 ++V +L+ K V+ D+WKD++ ++ R FGC LC ++A + + ++AAGV +V Sbjct 75 QGVEVFDLSGKAVPVV-DLWKDRKAIVAFARHFGCVLCRKRADLLAAKQDAMEAAGVALV 133 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTA----- 117 L+G G A+ F + +F EVY DP ++Y A GL + +A Sbjct 134 LIGPGTVEQAKAFYDQT-----KFKGEVYADPSHSSYNALEFA-FGLFSTFTPSAGLKII 187 Query 118 ---ISEWRKA-NKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQ 173 + +R+ + G GG+ + GPG D+ ++ D G D D Sbjct 188 QLYMEGYRQDWELSFEKTTRTKGGWYQGGLLVAGPGIDNILYI---HKDKEAGDDPDMDD 244 Query 174 ILAAV 178 +L A Sbjct 245 VLKAC 249 >gb|ACR37670.1| unknown [Zea mays] Length=259 Score = 67.8 bits (164), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 46/163 (29%), Positives = 79/163 (49%), Gaps = 18/163 (11%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 + V +L+ K + + D+WK+++ ++ R FGC LC ++A + + + AAGV +V Sbjct 83 QGVDVFDLSGKT-VPIVDLWKERKAVVAFARHFGCVLCRKRADLLAAKQDDMQAAGVALV 141 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTA----I 118 L+G G+ A+ F E +F EVY DP ++Y A GL + A I Sbjct 142 LIGPGSVEQAKAFCEQT-----KFKGEVYADPTHSSYDALEFA-FGLFSTFTPAAGLKII 195 Query 119 SEWRKANKN------HPNADLQGDGLQTGGIYLVGPGADSAIH 155 +R+ + N +G G GG+ + GPG D+ ++ Sbjct 196 QLYREGYRQDWELSFEKNTRTKG-GWYQGGLIVAGPGIDNILY 237 >ref|XP_001493974.2| PREDICTED: similar to UPF0308 protein C9orf21 [Equus caballus] Length=232 Score = 67.8 bits (164), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 46/156 (30%), Positives = 77/156 (50%), Gaps = 20/156 (12%) Query 21 MWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNRYFAEKFIENV 79 +++++R +++ +R F C +C E + +I K L A V ++++G + + E F + + Sbjct 58 LFRERRAVVVFMRHFLCYICKEYVEDLAKIPKSFLQEANVTLIVIGQSSYHHIEPFCK-L 116 Query 80 PGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHF-----------LSWTAISEWRKANKNH 128 G + E+Y+DPE+ YK G++R + F LS + S WR Sbjct 117 TG----YSHEIYVDPEREIYKRLGMKRGEEIAFSGKSPHIKSNILSGSIRSLWRAMTG-- 170 Query 129 PNADLQGDGLQTGGIYLVGPGAD-SAIHFAFNEYDH 163 P D QGD Q GG ++GPG + IH N DH Sbjct 171 PLFDFQGDPAQQGGTLILGPGNNIHFIHCDRNRLDH 206 >gb|EGC30304.1| hypothetical protein DICPUDRAFT_99572 [Dictyostelium purpureum] Length=808 Score = 67.4 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 45/174 (26%), Positives = 87/174 (50%), Gaps = 15/174 (8%) Query 15 ELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEK 74 E+++ +++++R+++ + RRFGC +C QA + +KP+LD G+++V +G E Sbjct 402 EVLVTSLYENKRIVVAIFRRFGCLICRLQALDLSSLKPKLDRMGIELVGIGFDEEGIDE- 460 Query 75 FIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLH----FLSWTAISEWRKANKNHPN 130 FI+ + F ++YID + Y+A L+R L FL ++ +R+ + Sbjct 461 FIQY-----KFFAGKIYIDKNRQVYRALNLKRRSKLTTYELFLDPRVMTYYRRMKELGLP 515 Query 131 ADLQGDGLQTGGIYLVGPGADSAIH-FAFNEYDHPVGTLVDNDQILAAVKATQP 183 ++ + DG Q G ++GP ++ F Y + D +I AA + P Sbjct 516 SNYRKDGFQLGATLVLGPRPQETLYDFRPQRY----ADIFDLKEIWAACQTPYP 565 >gb|EDL84435.1| similar to UPF0308 protein C9orf21, isoform CRA_c [Rattus norvegicus] Length=226 Score = 67.4 bits (163), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 47/175 (27%), Positives = 84/175 (48%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 42 ASGRRVTFGALFRERRAVVVFVRHFLCYVCKEYVEDLAKIPKSVLQEADVTLIVIGQSSY 101 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + E+Y+DPE+ YK G++R + + LS + Sbjct 102 HHIEPFCKLT-----GYSHEIYVDPEREIYKRLGMKRGEEISSSGQSPHIKSNLLSGSLQ 156 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 157 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNN--IHFVHRDRNRLDHKPINSVL 207 >dbj|BAB24662.1| unnamed protein product [Mus musculus] Length=186 Score = 67.4 bits (163), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 48/175 (28%), Positives = 86/175 (50%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 2 ASGRRVTFGALFRERRAVVVFVRHFLCYVCKEYVEDLAKIPKSVLREADVTLIVIGQSSY 61 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + G + E+Y+DPE+ YK G++R + + LS + Sbjct 62 HHIEPFCK-LTG----YSHEIYVDPEREIYKRLGMKRGEEISSSGQSPHIKSNLLSGSLQ 116 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 117 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNN--IHFVHRDRNRLDHKPINSVL 167 >ref|XP_001784902.1| predicted protein [Physcomitrella patens subsp. patens] gb|EDQ50291.1| predicted protein [Physcomitrella patens subsp. patens] Length=187 Score = 67.4 bits (163), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 44/146 (31%), Positives = 76/146 (53%), Gaps = 17/146 (11%) Query 12 DKKELVLADMW--KDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNR 69 D + + +++W ++ + ++ LR FGC C E A+ + E KP+ DAAG K++ +G G Sbjct 13 DGQPVKFSELWDHRNGKAIVAFLRHFGCPFCWEFAAALREAKPKFDAAGFKLITIGVGPS 72 Query 70 YFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQRVGLLHFLSWTAISEWRKANK-- 126 A+ E +P FPA+ +Y DP++ AY A GL +L+ ++ + + +K Sbjct 73 SKAQVLSEKLP-----FPADCLYADPDRKAYDALGLYHGVARTWLNPASMQIFTRLDKVA 127 Query 127 ---NHPNADLQGDG----LQTGGIYL 145 N D+ D LQ GG+Y+ Sbjct 128 DAVKGWNRDVMPDNTAATLQQGGVYV 153 >ref|XP_002928763.1| PREDICTED: UPF0308 protein C9orf21-like, partial [Ailuropoda melanoleuca] Length=192 Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 47/173 (28%), Positives = 86/173 (50%), Gaps = 21/173 (12%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A +++ +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 8 ASGRQVPFGALFRERRAVVVFVRHFLCYICKEYVEDLAKIPKSFLQEADVTLIVIGQSSY 67 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + G + E+Y+DPE+ YK G++R + + LS + Sbjct 68 HHIEPFCK-LTG----YSHEIYVDPEREIYKKLGMKRGEEIASSGKSPHIKSNILSGSIR 122 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGAD-SAIHFAFNEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IH N DH P+ +++ Sbjct 123 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNNIHFIHRDRNRLDHKPINSVL 173 >gb|EFB20490.1| hypothetical protein PANDA_018799 [Ailuropoda melanoleuca] Length=186 Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 47/173 (28%), Positives = 86/173 (50%), Gaps = 21/173 (12%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A +++ +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 2 ASGRQVPFGALFRERRAVVVFVRHFLCYICKEYVEDLAKIPKSFLQEADVTLIVIGQSSY 61 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + G + E+Y+DPE+ YK G++R + + LS + Sbjct 62 HHIEPFCK-LTG----YSHEIYVDPEREIYKKLGMKRGEEIASSGKSPHIKSNILSGSIR 116 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGAD-SAIHFAFNEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IH N DH P+ +++ Sbjct 117 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNNIHFIHRDRNRLDHKPINSVL 167 >ref|NP_079646.1| hypothetical protein LOC66129 [Mus musculus] sp|Q9D1A0.1|CI021_MOUSE RecName: Full=UPF0308 protein C9orf21 homolog dbj|BAB22993.1| unnamed protein product [Mus musculus] dbj|BAE28518.1| unnamed protein product [Mus musculus] gb|AAI40306.1| RIKEN cDNA 1110018J18 gene [synthetic construct] gb|EDL16238.1| RIKEN cDNA 1110018J18, isoform CRA_b [Mus musculus] gb|AAI56632.1| RIKEN cDNA 1110018J18 gene [synthetic construct] Length=226 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 47/175 (27%), Positives = 84/175 (48%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 42 ASGRRVTFGALFRERRAVVVFVRHFLCYVCKEYVEDLAKIPKSVLREADVTLIVIGQSSY 101 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + E+Y+DPE+ YK G++R + + LS + Sbjct 102 HHIEPFCKLT-----GYSHEIYVDPEREIYKRLGMKRGEEISSSGQSPHIKSNLLSGSLQ 156 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 157 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNN--IHFVHRDRNRLDHKPINSVL 207 >ref|XP_001106503.1| PREDICTED: UPF0308 protein C9orf21-like [Macaca mulatta] Length=226 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 47/173 (28%), Positives = 83/173 (48%), Gaps = 21/173 (12%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 42 ARGQRVPFGALFRERRAVVVFVRHFLCYICKEYVEDLAKIPKSFLQEANVTLIVIGQSSY 101 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + E+Y+DPE+ YK G++R + + LS + Sbjct 102 HHIEPFCRLT-----GYSHEIYVDPEREIYKRLGMKRGEEIASSGQSPHVKSNLLSGSLQ 156 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGAD-SAIHFAFNEYDH-PVGTLV 169 S WR P D QGD Q GGI ++GPG + IH N DH P+ +++ Sbjct 157 SLWRAVTG--PLFDFQGDPAQQGGILILGPGNNIHFIHRDRNRLDHKPINSVL 207 >gb|ADO28366.1| upf0308 protein c9orf21-like protein [Ictalurus furcatus] Length=223 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 47/170 (28%), Positives = 80/170 (48%), Gaps = 24/170 (14%) Query 16 LVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQ--LDAAGVKIVLVGTGNRYFAE 73 L +++ + ++I +R F C C E + +I PQ L A V+++++G E Sbjct 43 LTFKSLYQTHKAIIIFVRHFLCFTCQEYVEDLSQI-PQEILLDADVRLIVIGQSGFSHIE 101 Query 74 KFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLH------------FLSWTAISEW 121 F ++ G + E+Y+DPE+ Y+ G++R + L + S W Sbjct 102 AFC-SLTG----YQHEIYVDPERHIYEKLGMKRGEIYEETASQSPHVKSSMLVGSIKSMW 156 Query 122 RKANKNHPNADLQGDGLQTGGIYLVGPGADSAI-HFAFNEYDH-PVGTLV 169 R P D QGD LQ GG ++GPG + + HF N ++H P+ L+ Sbjct 157 RAMTS--PAFDFQGDPLQQGGALIIGPGPNIHVAHFDMNRFNHMPINGLL 204 >ref|NP_001180447.1| selenoprotein U [Gallus gallus] ref|NP_001180448.1| selenoprotein U [Gallus gallus] Length=224 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 43/165 (27%), Positives = 77/165 (47%), Gaps = 10/165 (6%) Query 19 ADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIEN 78 +++WK +++ +RR G LC E+AS + +KPQL GV + V EK Sbjct 67 SELWKKNGAVIMAVRRPGUFLCREEASELSSLKPQLSKLGVPLYAV------VKEKIGTE 120 Query 79 VPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWRKANKNHPNADLQGDGL 138 V F E+++D +++ Y R +++ L F + +A KN + +L+G+G Sbjct 121 VEDFQHYFQGEIFLDEKRSFYGPRK-RKMMLSGFFRIGVWQNFFRAWKNGYSGNLEGEGF 179 Query 139 QTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKATQP 183 GG+Y++G G + + G V +L A + +P Sbjct 180 TLGGVYVIGAGRQGVL---LEHREKEFGDKVSLPSVLEAAEKIKP 221 >ref|ZP_01909605.1| hypothetical protein PPSIR1_24954 [Plesiocystis pacifica SIR-1] gb|EDM77493.1| hypothetical protein PPSIR1_24954 [Plesiocystis pacifica SIR-1] Length=198 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 49/169 (29%), Positives = 81/169 (48%), Gaps = 21/169 (12%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRY 70 AD + L D+W ++ V+ I LR FGC LC A + + +AAG ++V VGTG R Sbjct 22 ADSEAHRLGDLWAERAVVFIHLRHFGCILCRHYAGALRDSFGDFEAAGAQLVAVGTGGRQ 81 Query 71 FAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQ----RVGLLH-FLSWTAISEW---R 122 + FIE ++ P V +D +Y+A ++ ++G LH + W A+ Sbjct 82 YTRDFIEE-----RKIPYLVLVDRHLASYEALHVRHDRSKMGWLHPKILWHALKALLAGE 136 Query 123 KANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDN 171 + K+ PN + G +++GPG I +A+ D+ VD+ Sbjct 137 RQGKSGPNP------FKYGAAHVIGPG--GTIEYAWLNDDYHDNAPVDD 177 >dbj|BAE40544.1| unnamed protein product [Mus musculus] Length=194 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 48/175 (28%), Positives = 85/175 (49%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + + +++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 10 ASGRRVTFGALCRERRAVVVFVRHFLCYVCKEYVEDLAKIPKSVLREADVTLIVIGQSSY 69 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + G + E+Y+DPE+ YK G++R + + LS + Sbjct 70 HHIEPFCK-LTG----YSHEIYVDPEREIYKRLGMKRGEEISSSGQSPHIKSNLLSGSLQ 124 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 125 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNN--IHFVHRDRNRLDHKPINSVL 175 >sp|Q5ZI34.2|CJ058_CHICK RecName: Full=UPF0765 protein C10orf58 homolog; Flags: Precursor Length=224 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 43/165 (27%), Positives = 77/165 (47%), Gaps = 10/165 (6%) Query 19 ADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIEN 78 +++WK +++ +RR G LC E+AS + +KPQL GV + V EK Sbjct 67 SELWKKNGAVIMAVRRPGUFLCREEASELSSLKPQLSKLGVPLYAV------VKEKIGTE 120 Query 79 VPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWRKANKNHPNADLQGDGL 138 V F E+++D +++ Y R +++ L F + +A KN + +L+G+G Sbjct 121 VEDFQHYFQGEIFLDEKRSFYGPRK-RKMMLSGFFRIGVWQNFFRAWKNGYSGNLEGEGF 179 Query 139 QTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKATQP 183 GG+Y++G G + + G V +L A + +P Sbjct 180 TLGGVYVIGAGRQGIL---LEHREKEFGDKVSLPSVLEAAEKIKP 221 >gb|EAW92661.1| chromosome 9 open reading frame 21, isoform CRA_a [Homo sapiens] Length=214 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 46/164 (29%), Positives = 82/164 (50%), Gaps = 15/164 (9%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I + L A V ++++G + Sbjct 42 ARGQRVPFGALFRERRAVVVFVRHFLCYICKEYVEDLAKIPRSFLQEANVTLIVIGQSSY 101 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWRKANKNHP 129 + E F + + E+Y+DPE+ YK G++R G S + S WR P Sbjct 102 HHIEPFCKLT-----GYSHEIYVDPEREIYKRLGMKR-GEEIASSGSLQSLWRAVTG--P 153 Query 130 NADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 154 LFDFQGDPAQQGGTLILGPGNN--IHFIHRDRNRLDHKPINSVL 195 >ref|NP_001180474.1| selenoprotein U [Oryzias latipes] Length=212 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 50/190 (27%), Positives = 85/190 (45%), Gaps = 17/190 (8%) Query 1 ASNNIKVLELAD-------KKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQ 53 A+ +++ LE AD K + +W +++ +RR G LC E+AS + +KPQ Sbjct 31 ANASLEFLEEADLRCTLDHTKVIKAKSLWDKNGAVVMAVRRPGUFLCREEASELSSLKPQ 90 Query 54 LDAAGVKIVLVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFL 113 L+ GV +V V E + F ++YID E+ Y +R+G L F+ Sbjct 91 LEELGVPLVAV------VKENLGSEIQDFRPHFAGDIYIDEEKRFYGPL-QRRMGGLGFI 143 Query 114 SWTAISEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQ 173 + +A K+ ++ G+G GG+Y++G G I + G VD Sbjct 144 RIGVWQNFIRAWKSGYQGNMNGEGFILGGVYVIGAGEQGII---LEHREKQFGDKVDTAD 200 Query 174 ILAAVKATQP 183 +L A++ P Sbjct 201 VLKAIQKIVP 210 >ref|XP_002820049.1| PREDICTED: UPF0308 protein C9orf21-like [Pongo abelii] Length=226 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 47/175 (27%), Positives = 84/175 (48%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 42 ARGQRVPFGALFRERRAVVVFVRHFLCYICKEYVEDLAKIPKSFLQEANVTLIVIGQSSY 101 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + E+Y+DPE+ YK G++R + + LS + Sbjct 102 HHIEPFCKLT-----GYSHEIYVDPEREIYKRLGMKRGEEIASSGQSPHVKSNLLSGSLR 156 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 157 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNN--IHFIHRDRNRLDHKPINSVL 207 >ref|XP_520707.2| PREDICTED: similar to TPA_exp: C9ORF21 isoform 2 [Pan troglodytes] Length=226 Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 47/175 (27%), Positives = 84/175 (48%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I K L A V ++++G + Sbjct 42 ARGQRVPFGALFRERRAVVVFVRHFLCYICKEYVEDLAKIPKSFLQEANVTLIVIGQSSY 101 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + E+Y+DPE+ YK G++R + + LS + Sbjct 102 HHIEPFCKLT-----GYSHEIYVDPEREIYKRLGMKRGEEIASSGQSPHVKSNLLSGSLQ 156 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 157 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNN--IHFIHRDRNRLDHKPINSVL 207 >gb|EDL90877.1| similar to RIKEN cDNA 5730469M10, isoform CRA_b [Rattus norvegicus] Length=225 Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 44/183 (25%), Positives = 85/183 (47%), Gaps = 17/183 (9%) Query 4 NIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVL 63 ++K LE + + ++W+ +++ +RR GC LC +A+ ++ +KP+LD GV + Sbjct 49 DLKTLE-KEPRTFKAKELWEKNGAVIMAVRRPGCFLCRAEAADLMSLKPKLDELGVPLYA 107 Query 64 VGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAY--KARGLQRVGLLHFLSW-TAISE 120 V EK V F E+++D ++ Y + R + +GL+ W + Sbjct 108 V------VKEKVKREVEDFQPYFKGEIFLDEKKKFYGPERRKMMLMGLVRLGVWYNSFRA 161 Query 121 WRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 W K + + +G+G GG++++G G + + G V+ +L AVK Sbjct 162 W----KGGFSGNFEGEGFILGGVFVIGSGKQGVL---LEHREKEFGDRVNLLSVLEAVKK 214 Query 181 TQP 183 +P Sbjct 215 IKP 217 >ref|NP_001145525.1| hypothetical protein LOC100278941 [Zea mays] gb|ACG48189.1| hypothetical protein [Zea mays] Length=258 Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 45/163 (28%), Positives = 79/163 (49%), Gaps = 19/163 (11%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 + V +L+ K + + D+WK+++ ++ R FGC LC ++A + + + AAGV +V Sbjct 83 QGVDVFDLSGKT-VPIVDLWKERKAVVAFARHFGCVLCRKRADLLAAKQDDMQAAGVALV 141 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTA----I 118 L+G G+ A+ F + +F EVY DP ++Y A GL + A I Sbjct 142 LIGPGSVEQAKAFEQT------KFKGEVYADPTHSSYDALEFA-FGLFSTFTPAAGLKII 194 Query 119 SEWRKANKN------HPNADLQGDGLQTGGIYLVGPGADSAIH 155 +R+ + N +G G GG+ + GPG D+ ++ Sbjct 195 QLYREGYRQDWELSFEKNTRTKG-GWYQGGLIVAGPGIDNILY 236 >gb|EDL90876.1| similar to RIKEN cDNA 5730469M10, isoform CRA_a [Rattus norvegicus] Length=218 Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 44/183 (25%), Positives = 85/183 (47%), Gaps = 17/183 (9%) Query 4 NIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVL 63 ++K LE + + ++W+ +++ +RR GC LC +A+ ++ +KP+LD GV + Sbjct 42 DLKTLE-KEPRTFKAKELWEKNGAVIMAVRRPGCFLCRAEAADLMSLKPKLDELGVPLYA 100 Query 64 VGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAY--KARGLQRVGLLHFLSW-TAISE 120 V EK V F E+++D ++ Y + R + +GL+ W + Sbjct 101 V------VKEKVKREVEDFQPYFKGEIFLDEKKKFYGPERRKMMLMGLVRLGVWYNSFRA 154 Query 121 WRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 W K + + +G+G GG++++G G + + G V+ +L AVK Sbjct 155 W----KGGFSGNFEGEGFILGGVFVIGSGKQGVL---LEHREKEFGDRVNLLSVLEAVKK 207 Query 181 TQP 183 +P Sbjct 208 IKP 210 >ref|NP_001014162.1| hypothetical protein LOC361118 precursor [Rattus norvegicus] sp|Q6AXX6.1|CJ058_RAT RecName: Full=UPF0765 protein C10orf58 homolog; AltName: Full=Sperm head protein 1; Flags: Precursor gb|AAH79275.1| Similar to RIKEN cDNA 5730469M10 [Rattus norvegicus] Length=229 Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 44/183 (25%), Positives = 85/183 (47%), Gaps = 17/183 (9%) Query 4 NIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVL 63 ++K LE + + ++W+ +++ +RR GC LC +A+ ++ +KP+LD GV + Sbjct 53 DLKTLE-KEPRTFKAKELWEKNGAVIMAVRRPGCFLCRAEAADLMSLKPKLDELGVPLYA 111 Query 64 VGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAY--KARGLQRVGLLHFLSW-TAISE 120 V EK V F E+++D ++ Y + R + +GL+ W + Sbjct 112 V------VKEKVKREVEDFQPYFKGEIFLDEKKKFYGPERRKMMLMGLVRLGVWYNSFRA 165 Query 121 WRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 W K + + +G+G GG++++G G + + G V+ +L AVK Sbjct 166 W----KGGFSGNFEGEGFILGGVFVIGSGKQGVL---LEHREKEFGDRVNLLSVLEAVKK 218 Query 181 TQP 183 +P Sbjct 219 IKP 221 >ref|XP_536403.1| PREDICTED: similar to R53.5 [Canis familiaris] Length=225 Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 45/182 (25%), Positives = 86/182 (48%), Gaps = 17/182 (9%) Query 4 NIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVL 63 ++K LE + + ++W+ +++ +RR GC LC E+A+ + +KP+LD GV + Sbjct 49 DLKTLE-KEPRTFKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKPKLDELGVPLYA 107 Query 64 VGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR- 122 V E+ V F E+++D ++ Y G QR ++ F+ + + W Sbjct 108 V------VKEQIRTEVQDFQPYFKGEIFLDEKKKFY---GPQRRKMM-FMGFVRLGVWYN 157 Query 123 --KANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 +A + +L+G+G GG+++VGPG + + G V+ +L A + Sbjct 158 FFRARNGGFSGNLEGEGFILGGVFVVGPGKQGIL---LEHREKEFGDKVNPVSVLEAARK 214 Query 181 TQ 182 Q Sbjct 215 IQ 216 >gb|ACN25853.1| unknown [Zea mays] Length=261 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 50/157 (32%), Positives = 80/157 (51%), Gaps = 17/157 (10%) Query 1 ASNNIKVLELADKKELVLADMW-KDQRVLLI-LLRRFGCSLCHEQASHVLEIKPQLDAAG 58 A ++++ A + ++ D+W +D+ V ++ LLR FGC C E AS + + K + D+AG Sbjct 76 ALGDVEIYSAATGEPVLFRDLWDQDEGVSVVALLRHFGCPCCWELASVLRDTKERFDSAG 135 Query 59 VKIVLVGTGNRYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQ-RVGLLHFLSWT 116 VK++ VG G A E +P FP E +Y DP++ AY GL VG F + Sbjct 136 VKLIAVGVGTPAKARILAERLP-----FPLEYLYADPDRKAYNLLGLYFGVGRTFFNPAS 190 Query 117 A-----ISEWRKANKNH---PNADLQGDGLQTGGIYL 145 A ++A KN+ D + LQ GG+++ Sbjct 191 AKVFSRFDSLKEAVKNYTIEATPDDRAGVLQQGGMFV 227 >ref|XP_001368977.1| PREDICTED: similar to C9ORF21 [Monodelphis domestica] Length=354 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 45/173 (27%), Positives = 81/173 (47%), Gaps = 21/173 (12%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A K + ++++++R +++ +R F C C E + +I K L A V ++++G + Sbjct 170 ASGKGIPFGELFRERRAIVVFVRHFLCYTCKEYVEDLAKIPKSFLQDANVTLIVIGQSSF 229 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 E F + R+ E+Y+D E+ Y+ G+ + + + LS + Sbjct 230 QHIEPFCKLT-----RYSHEIYVDTERKIYRKLGMNKGEGIASSEQSPHVKSNLLSGSIQ 284 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGAD-SAIHFAFNEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IH N DH P+ +++ Sbjct 285 SLWRAVTG--PAFDFQGDPAQQGGTLILGPGNNIHFIHLDKNRLDHKPINSIL 335 >gb|AAI14901.1| Chromosome 1 open reading frame 93 ortholog [Bos taurus] gb|DAA21137.1| hypothetical protein LOC617001 [Bos taurus] Length=201 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 52/174 (30%), Positives = 84/174 (49%), Gaps = 15/174 (8%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L ++W++Q ++ LRRFGC +C A + +K LD GV++V VG ++F+ Sbjct 25 LRNLWQEQACVVAGLRRFGCMVCRWIARDLSNLKGLLDQHGVRLVGVGP-EALGLQEFL- 82 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADL 133 +G F E+Y+D + YK G +R L L R KA +L Sbjct 83 ----DGGYFAGELYLDESKQFYKELGFKRYNSLSILPAALGKPVREVAAKAKAVGIQGNL 138 Query 134 QGDGLQTGGIYLVGPGADSA-IHF---AFNEYDHPVGTLVDNDQILAAVKATQP 183 GD LQ+GG+ +V G D +HF + +Y P+ +++ I A V ++P Sbjct 139 SGDLLQSGGLLVVAKGGDKVLLHFVQKSPGDY-APLESILQALGISAEVGPSEP 191 >ref|XP_002922464.1| PREDICTED: uncharacterized protein C10orf58-like [Ailuropoda melanoleuca] gb|EFB26948.1| hypothetical protein PANDA_011444 [Ailuropoda melanoleuca] Length=229 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 45/183 (25%), Positives = 86/183 (47%), Gaps = 17/183 (9%) Query 4 NIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVL 63 ++K LE + + ++W+ +++ +RR GC LC E+A+ + +KP+LD GV + Sbjct 53 DLKTLE-KEPRTFKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKPKLDELGVPLYA 111 Query 64 VGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR- 122 V E+ V F E+++D ++ Y G QR ++ F+ + + W Sbjct 112 V------VKEQIRTEVQDFQPYFKGEIFLDEKKKFY---GPQRRKMM-FMGFVRLGVWYN 161 Query 123 --KANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 +A + +L+G+G GG+++VG G + + G V+ +L A + Sbjct 162 FFRARNGGFSGNLEGEGFILGGVFVVGSGKQGIL---LEHREKEFGDKVNPVSVLEAARK 218 Query 181 TQP 183 QP Sbjct 219 IQP 221 >ref|NP_001035688.1| hypothetical protein LOC617001 [Bos taurus] sp|Q58CY6.1|CA093_BOVIN RecName: Full=Uncharacterized protein C1orf93 homolog gb|AAX46658.1| hypothetical protein MGC26818 [Bos taurus] Length=201 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 46/144 (32%), Positives = 70/144 (49%), Gaps = 11/144 (7%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L ++W++Q ++ LRRFGC +C A + +K LD GV++V VG ++F+ Sbjct 25 LRNLWQEQACVVAGLRRFGCMVCRWIARDLSNLKGLLDQHGVRLVGVGP-EALGLQEFL- 82 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADL 133 +G F E+Y+D + YK G +R L L R KA +L Sbjct 83 ----DGGYFAGELYLDESKQFYKELGFKRYNSLSILPAALGKPVREVAAKAKAVGIQGNL 138 Query 134 QGDGLQTGGIYLVGPGADSA-IHF 156 GD LQ+GG+ +V G D +HF Sbjct 139 SGDLLQSGGLLVVAKGGDKVLLHF 162 >ref|NP_714542.1| hypothetical protein LOC195827 [Homo sapiens] sp|Q7RTV5.1|CI021_HUMAN RecName: Full=UPF0308 protein C9orf21 tpg|DAA00065.1| TPA_exp: C9ORF21 [Homo sapiens] emb|CAI40534.1| novel protein [Homo sapiens] gb|EAW92662.1| chromosome 9 open reading frame 21, isoform CRA_b [Homo sapiens] gb|AAI36504.1| Chromosome 9 open reading frame 21 [Homo sapiens] Length=226 Score = 64.7 bits (156), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 46/175 (27%), Positives = 84/175 (48%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I + L A V ++++G + Sbjct 42 ARGQRVPFGALFRERRAVVVFVRHFLCYICKEYVEDLAKIPRSFLQEANVTLIVIGQSSY 101 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAI 118 + E F + + E+Y+DPE+ YK G++R + + LS + Sbjct 102 HHIEPFCKLT-----GYSHEIYVDPEREIYKRLGMKRGEEIASSGQSPHIKSNLLSGSLQ 156 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S WR P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 157 SLWRAVTG--PLFDFQGDPAQQGGTLILGPGNN--IHFIHRDRNRLDHKPINSVL 207 >ref|XP_002924432.1| PREDICTED: uncharacterized protein C1orf93-like [Ailuropoda melanoleuca] Length=217 Score = 64.3 bits (155), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 45/144 (32%), Positives = 70/144 (49%), Gaps = 11/144 (7%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L +W++Q ++ LRRFGCS+C A + +K LD GV++V VG ++F+ Sbjct 25 LRSLWREQACVVAGLRRFGCSVCRWIAQDLSSLKGLLDQHGVRLVGVGP-EALGLQEFL- 82 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADL 133 +G F E+Y+D + Y+ G +R L + R KA +L Sbjct 83 ----DGGYFAGELYLDESKQCYRELGFRRYNGLSIVPAALGKPVRDVALKAKAVGIQGNL 138 Query 134 QGDGLQTGGIYLVGPGADSA-IHF 156 GD LQ+GG+ +V G D +HF Sbjct 139 SGDLLQSGGLLVVTKGGDRVLLHF 162 >gb|ACN35248.1| unknown [Zea mays] Length=224 Score = 64.3 bits (155), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 32/99 (33%), Positives = 55/99 (56%), Gaps = 6/99 (6%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 + V +L+ K + + D+WK+++ ++ R FGC LC ++A + + + AAGV +V Sbjct 83 QGVDVFDLSGK-TVPIVDLWKERKAVVAFARHFGCVLCRKRADLLAAKQDDMQAAGVALV 141 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKA 101 L+G G+ A+ F E +F EVY DP ++Y A Sbjct 142 LIGPGSVEQAKAFCEQT-----KFKGEVYADPTHSSYDA 175 >ref|XP_001496590.1| PREDICTED: similar to SFLQ611 [Equus caballus] Length=244 Score = 64.3 bits (155), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 86/184 (47%), Gaps = 17/184 (9%) Query 3 NNIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIV 62 ++K LE + + ++W+ +++ +RR GC LC E+A + +KP+LD GV + Sbjct 67 TDLKTLE-KEPRTFKAKELWEKNGAVIMAVRRPGCFLCREEAMDLSLLKPKLDELGVPLY 125 Query 63 LVGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR 122 V E+ V F E+++D ++ Y G QR ++ L + + WR Sbjct 126 AV------VKEQLSTEVEDFQPYFKGEIFLDEKKKFY---GPQRRKMM-LLGFVRLGVWR 175 Query 123 ---KANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVK 179 +A + +L+G+G GG+++VG G + + G V+ D +L A + Sbjct 176 NFFRAWDRGISGNLEGEGFILGGVFVVGSGRQGIL---LEHREKEFGDKVNVDSVLEAAR 232 Query 180 ATQP 183 +P Sbjct 233 KIKP 236 >ref|XP_002945792.1| hypothetical protein VOLCADRAFT_127337 [Volvox carteri f. nagariensis] gb|EFJ52787.1| hypothetical protein VOLCADRAFT_127337 [Volvox carteri f. nagariensis] Length=234 Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 45/150 (30%), Positives = 76/150 (51%), Gaps = 20/150 (13%) Query 7 VLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGT 66 VL D E+V + +W+ Q + +++LRR GC LC ++A + ++KP+ + GV +V V Sbjct 16 VLRSRDGAEVVASTLWQSQPLAVLILRRPGCVLCRDEAQRLWKLKPEFERMGVGLVCV-- 73 Query 67 GNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKA-------RGLQRVGLLHFLSWTAIS 119 + + + G +P +Y DP + Y A RG GLL W+A+ Sbjct 74 VHEWIPREVNAFTSGF---WPGPLYHDPSKAFYAALNGGNPLRG-SIWGLL--FPWSAV- 126 Query 120 EWRK---ANKNHPNADLQGDGLQTGGIYLV 146 WR+ A++N P ++ GDG GG ++ Sbjct 127 -WRRIRTASRNVPEHNIVGDGFTMGGAMVL 155 >ref|XP_546736.2| PREDICTED: hypothetical protein XP_546736 [Canis familiaris] Length=201 Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 53/167 (32%), Positives = 79/167 (48%), Gaps = 17/167 (10%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFA-EKFI 76 L +W +Q ++ LRRFGCS+C A + ++ LD GV+ LVG G ++F+ Sbjct 25 LRTLWLEQACVVAGLRRFGCSVCRWIARDLSSLRGLLDQHGVR--LVGVGPEVLGVQEFL 82 Query 77 ENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNAD 132 +G F E+Y+D + Y+ G +R L L R KA + + Sbjct 83 -----DGGYFAGELYLDESKQFYRELGFKRYNSLSILPAALGKPVRDVALKAKAVGIHGN 137 Query 133 LQGDGLQTGGIYLVGPGADSA-IHFAFNEYDHPVGTLVDNDQILAAV 178 L GD LQ+GG+ +V G D +HF N P G V + IL A+ Sbjct 138 LSGDLLQSGGLLVVTKGGDKVLLHFVQNS---P-GDYVPRESILQAL 180 >ref|XP_001915482.1| PREDICTED: hypothetical protein [Equus caballus] Length=214 Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 46/144 (32%), Positives = 69/144 (48%), Gaps = 11/144 (7%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L D+W++Q ++ LRRFGC +C A + +K LD GV++V VG ++F+ Sbjct 38 LRDLWQEQACVVAGLRRFGCMVCRWIARDLSSLKGLLDQHGVRLVGVGP-EALGLQEFL- 95 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWT----AISEWRKANKNHPNADL 133 +G F E+Y+D + YK G +R L L KA +L Sbjct 96 ----DGGYFAGELYLDESKQFYKELGFKRYTSLSILPAALGKPVCDVAAKAKAVGIQGNL 151 Query 134 QGDGLQTGGIYLVGPGADSA-IHF 156 GD LQ+GG+ +V G D +HF Sbjct 152 SGDLLQSGGLLVVTKGGDKVLLHF 175 >sp|Q641F0.2|CJ058_XENLA RecName: Full=UPF0765 protein C10orf58 homolog; Flags: Precursor Length=227 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 39/132 (30%), Positives = 67/132 (51%), Gaps = 11/132 (8%) Query 20 DMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIENV 79 D+W+ +++ +RR GC LC E+AS + +KPQLD GV + + E V Sbjct 67 DLWERDGAVIMAVRRPGCFLCREEASGLSTLKPQLDQLGVPLYAI------VKENIGNEV 120 Query 80 PGNGQRFPAEVYIDPEQTAY--KARGLQRVGLLHFLSWTAISEWRKANKNHPNADLQGDG 137 F +V++D + Y + R + +GL+ W +R+A K +L+G+G Sbjct 121 EHFQPYFNGKVFLDAKGQFYGPQKRKMMLLGLVRLGVW---QNFRRAWKGGFEGNLEGEG 177 Query 138 LQTGGIYLVGPG 149 L GG++++G G Sbjct 178 LILGGMFVIGSG 189 >ref|NP_001087861.1| chromosome 10 open reading frame 58 [Xenopus laevis] gb|AAH82387.1| MGC81827 protein [Xenopus laevis] Length=210 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 39/132 (30%), Positives = 67/132 (51%), Gaps = 11/132 (8%) Query 20 DMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIENV 79 D+W+ +++ +RR GC LC E+AS + +KPQLD GV + + E V Sbjct 57 DLWERDGAVIMAVRRPGCFLCREEASGLSTLKPQLDQLGVPLYAI------VKENIGNEV 110 Query 80 PGNGQRFPAEVYIDPEQTAY--KARGLQRVGLLHFLSWTAISEWRKANKNHPNADLQGDG 137 F +V++D + Y + R + +GL+ W +R+A K +L+G+G Sbjct 111 EHFQPYFNGKVFLDAKGQFYGPQKRKMMLLGLVRLGVW---QNFRRAWKGGFEGNLEGEG 167 Query 138 LQTGGIYLVGPG 149 L GG++++G G Sbjct 168 LILGGMFVIGSG 179 >gb|ACO12061.1| C10orf58 homolog precursor [Lepeophtheirus salmonis] Length=216 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 47/171 (28%), Positives = 82/171 (48%), Gaps = 26/171 (15%) Query 20 DMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGT-----GNRYFAEK 74 D+W + +++++RR GC LC E+A ++IK L A + I LVG G FA Sbjct 61 DLWAKKGAVIMVVRRPGCILCREEALEFMKIKSDLSA--LDIPLVGIVHEEEGAEEFASN 118 Query 75 FIENVPGNGQRFPAEVYIDPEQTAY--KARGLQRVGLLHF-LSWTAISEWRKANKNHPNA 131 F + ++VY D + + K R + GLL+F W+K + Sbjct 119 FFTS---------SDVYFDINKKFFGPKERRIMLTGLLNFRFILKTFGAWKKG----VSG 165 Query 132 DLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKATQ 182 +L+GDG GG +++GPG++ ++ Y G V+ ++L+ K+ + Sbjct 166 NLEGDGSLLGGTFVMGPGSEGVLYEHRETY---FGDHVNMTEVLSIAKSLK 213 >ref|XP_002463942.1| hypothetical protein SORBIDRAFT_01g009350 [Sorghum bicolor] gb|EER90940.1| hypothetical protein SORBIDRAFT_01g009350 [Sorghum bicolor] Length=260 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 48/157 (31%), Positives = 75/157 (48%), Gaps = 17/157 (10%) Query 1 ASNNIKVLELADKKELVLADMWKDQR--VLLILLRRFGCSLCHEQASHVLEIKPQLDAAG 58 A ++++ A + + D+W ++ LLR FGC C E AS + + K + D+AG Sbjct 75 ALGDVEIYSAASGEPVPFRDLWDQNEGVAVVALLRHFGCPCCWELASVLRDTKEKFDSAG 134 Query 59 VKIVLVGTGNRYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQ-RVGLLHFLSWT 116 VK++ VG G A E +P FP E +Y DP++ AY GL VG F + Sbjct 135 VKLIAVGVGTPAKARILAERLP-----FPLEYLYADPDRKAYNLLGLYFGVGRTFFNPAS 189 Query 117 A-----ISEWRKANKNHP---NADLQGDGLQTGGIYL 145 A ++A KN+ D + LQ GG+++ Sbjct 190 AKVFSRFDSLKEAVKNYTMEATPDDRAGVLQQGGMFV 226 >ref|NP_001106912.1| prostamide/PG F synthase [Sus scrofa] dbj|BAF96021.1| prostamide/PG F synthase [Sus scrofa] Length=202 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 45/144 (32%), Positives = 69/144 (48%), Gaps = 11/144 (7%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 L +W++Q ++ LRRFGC +C A + +K LD GV++V VG ++F+ Sbjct 25 LRSLWQEQACVVAGLRRFGCMVCRWIARDLSSLKGLLDQHGVRLVGVGP-EALGLQEFL- 82 Query 78 NVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR----KANKNHPNADL 133 +G F ++Y+D + YK G +R L L R KA +L Sbjct 83 ----DGGYFAGDLYLDESKQFYKELGFKRYSSLSILPAALGKPVRDVAAKAKAAGIQGNL 138 Query 134 QGDGLQTGGIYLVGPGADSA-IHF 156 GD LQ+GG+ +V G D +HF Sbjct 139 SGDLLQSGGLLVVAKGGDKVLLHF 162 >gb|ADE77692.1| unknown [Picea sitchensis] Length=276 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 46/138 (34%), Positives = 71/138 (52%), Gaps = 17/138 (12%) Query 20 DMW--KDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIE 77 D+W K+ ++ LLR FGC C E AS + ++ P+ D+AGVK++ +G G A E Sbjct 110 DLWDQKNGTAVVALLRHFGCPCCWEFASTLKDVMPKFDSAGVKLIAIGVGTPEKARILGE 169 Query 78 NVPGNGQRFPAE-VYIDPEQTAYKARGLQR-VGLLHFLSWTA-----ISEWRKANKNHPN 130 +P FP + +Y DP++ AY A GL +G F +A +KA KN+ Sbjct 170 RLP-----FPLDSLYADPDRKAYDALGLYYGLGRTFFNPASAKVLTRFDSLQKALKNYTI 224 Query 131 ADLQGDG---LQTGGIYL 145 + D LQ GG+++ Sbjct 225 SATPEDRSSVLQQGGMFV 242 >ref|XP_002191279.1| PREDICTED: hypothetical protein [Taeniopygia guttata] Length=222 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 50/185 (28%), Positives = 88/185 (48%), Gaps = 25/185 (13%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 AD + + ++ +Q+ +++ +R F C C E + ++ K L + V+++++G + Sbjct 38 ADGRSVPFQALFAEQKAIVLFVRNFLCYTCKEYVEDLAKVPKAFLQESNVRLIVIGQSSY 97 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQR-------VGLLHFLSWTAI---- 118 + + F ++ G + E+Y+DP + YK G++R V H S T + Sbjct 98 HHIKPFC-SLTG----YTHEMYVDPPREIYKILGMKRGEGNKASVRSPHVKSNTFLGSIR 152 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGAD-SAIHFAFNEYDH-PVGTLVDNDQILA 176 S WR P D QGD Q GG ++GPG + +H N DH P+ T++ LA Sbjct 153 SIWRAMTG--PAFDFQGDPAQQGGALIIGPGNEVHFLHLDKNRLDHVPINTVLQ----LA 206 Query 177 AVKAT 181 VK Sbjct 207 GVKTV 211 >ref|XP_002320012.1| predicted protein [Populus trichocarpa] gb|EEE98327.1| predicted protein [Populus trichocarpa] Length=199 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 46/142 (33%), Positives = 69/142 (49%), Gaps = 17/142 (11%) Query 16 LVLADMWKDQR--VLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAE 73 ++ D+W ++ LLR FGC C E AS + E K + D++GVK++ +G G A Sbjct 29 VMFKDLWDQNEGIAVVALLRHFGCPCCWELASSLKESKEKFDSSGVKLIAIGVGTPNKAR 88 Query 74 KFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQR-VGLLHFLSWTA-----ISEWRKANK 126 E +P FP + +Y DPE+ AY GL +G F +A RKA K Sbjct 89 LLAERLP-----FPMDCLYADPERKAYDVLGLYYGLGRTFFNPASAKVFSRFDALRKAVK 143 Query 127 NHP---NADLQGDGLQTGGIYL 145 N+ D + LQ GG+++ Sbjct 144 NYTIEATPDDRSGVLQQGGMFV 165 >ref|NP_001180455.1| selenoprotein U [Taeniopygia guttata] ref|NP_001180456.1| selenoprotein U [Taeniopygia guttata] Length=224 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 41/173 (24%), Positives = 78/173 (46%), Gaps = 10/173 (5%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRY 70 ++K+ ++WK +++ +RR G LC E+AS + +KPQL GV + V Sbjct 59 SEKRTFKAGELWKQNGAVIMAVRRPGUFLCREEASELSSLKPQLSKLGVPLYAV------ 112 Query 71 FAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWRKANKNHPN 130 E V F E+++D ++ Y R +++ L F + +A ++ + Sbjct 113 VKENIGTEVEDFQHYFKGEIFLDEKKGFYGPR-RRKMMLSGFFRLGVWQNFVRAWRSGYS 171 Query 131 ADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKATQP 183 +L+G+G GG+Y++G G + + G V +L A + +P Sbjct 172 GNLEGEGFTLGGVYVIGAGRQGVL---LEHREKEFGDKVSLPSVLEAAEKIKP 221 >ref|NP_001008167.1| chromosome 9 open reading frame 21 [Xenopus (Silurana) tropicalis] gb|AAH82485.1| MGC88866 protein [Xenopus (Silurana) tropicalis] emb|CAJ81370.1| novel protein [Xenopus (Silurana) tropicalis] Length=227 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 43/168 (26%), Positives = 78/168 (47%), Gaps = 25/168 (14%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQ-LDAAGVKIVLVGTGNRYFAEKFI 76 D+++DQ+ +++L+R F C C E + +I L+ A V+++++G + + F Sbjct 50 FGDLYRDQKTIVVLVRNFLCYTCKEYVEDLAKIPSSALEDANVRLIVIGQSSYIHIKHFC 109 Query 77 ENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGL-----------LHFLSWTAISEWRKAN 125 +P ++Y+D ++ Y G+ + + +S + S WR Sbjct 110 SLT-----SYPYDMYVDTDREIYCKLGMMKGETSTSSGKSTHVKSNIISGSIKSVWRAMT 164 Query 126 KNHPNADLQGDGLQTGGIYLVGPGADSAIHFA---FNEYDH-PVGTLV 169 P D QGD Q GG +VGPG + +HF N D P+G+L+ Sbjct 165 S--PAFDFQGDPAQQGGSLVVGPG--NRVHFLHRDMNRLDQAPIGSLL 208 >ref|NP_001029771.1| hypothetical protein LOC534049 precursor [Bos taurus] sp|Q3ZBK2.1|CJ058_BOVIN RecName: Full=UPF0765 protein C10orf58 homolog; Flags: Precursor gb|AAI03250.1| Chromosome 10 open reading frame 58 ortholog [Bos taurus] gb|DAA14262.1| hypothetical protein LOC534049 precursor [Bos taurus] Length=218 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 42/167 (26%), Positives = 82/167 (50%), Gaps = 18/167 (10%) Query 21 MWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGNRYFAEKFIENVP 80 +W+ +++ +RR GC LC E+A+ + +KP+LD GV + V ++ I+N Sbjct 58 LWEKNGAVIMAVRRPGCFLCREEATDLSSLKPKLDELGVPLYAV-------VKEHIKNEV 110 Query 81 GNGQ-RFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR---KANKNHPNADLQGD 136 + Q F E+++D + Y G QR ++ F+ + + W+ +A + +L G+ Sbjct 111 KDFQPYFKGEIFLDENKKFY---GPQRRKMM-FMGFVRLGVWQNFFRAWNGGFSGNLDGE 166 Query 137 GLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKATQP 183 G GG++++GPG + + G V+ +L A + +P Sbjct 167 GFILGGVFVMGPGKQGIL---LEHREKEFGDKVNLTSVLEAARKIRP 210 >ref|XP_002756224.1| PREDICTED: uncharacterized protein C10orf58-like [Callithrix jacchus] Length=229 Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 47/183 (26%), Positives = 86/183 (47%), Gaps = 17/183 (9%) Query 4 NIKVLELADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVL 63 ++K LE + + L ++W+ +++ +RR GC LC E+A+ + +KP+LD GV + Sbjct 53 DLKTLE-NEPRTLKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKPKLDELGVPLYA 111 Query 64 VGTGNRYFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLLHFLSWTAISEWR- 122 V E V F EV++D ++ Y G QR ++ F+ + + W Sbjct 112 V------VKEHIKTEVKDFQPYFKGEVFLDEKKKFY---GPQRRKMM-FMGFIRLGVWYN 161 Query 123 --KANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAFNEYDHPVGTLVDNDQILAAVKA 180 +A + +L+G+G GG+++VG G + + G V+ +L A K Sbjct 162 FFRAWNGGFSGNLEGEGFVLGGVFVVGSGKQGIL---LEHREKEFGDKVNLLSVLEAAKM 218 Query 181 TQP 183 +P Sbjct 219 IKP 221 >ref|NP_001092155.1| hypothetical protein LOC100049742 [Xenopus laevis] gb|AAI41728.1| LOC100049742 protein [Xenopus laevis] Length=228 Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 44/173 (26%), Positives = 82/173 (48%), Gaps = 21/173 (12%) Query 18 LADMWKDQRVLLILLRRFGCSLCHEQASHVLEIKPQ-LDAAGVKIVLVGTGNRYFAEKFI 76 D++++++ +++ +R F C C E + +I L+ A V+++++G + E F Sbjct 51 FGDLYRERKTIVVFVRNFLCYTCKEYVEDLAKIPSSALEDANVRLIVIGQSSYIHIEHFC 110 Query 77 ENVPGNGQRFPAEVYIDPEQTAYKARGLQRVGLL-----------HFLSWTAISEWRKAN 125 ++ G +P E+Y+D ++T Y G+++ + LS + S WR Sbjct 111 -SLTG----YPYEMYVDTDRTIYSKLGMKKGETSTSSGRSPHVKSNILSGSIKSIWRAMT 165 Query 126 KNHPNADLQGDGLQTGGIYLVGPGAD-SAIHFAFNEYDH-PVGTLVDNDQILA 176 P D QGD Q GG +VGPG +H N D P+ +L+ + + A Sbjct 166 S--PAFDFQGDPAQQGGSLIVGPGNRVQFLHRDMNRLDQTPINSLLQHAGVQA 216 >ref|XP_002263959.1| PREDICTED: hypothetical protein [Vitis vinifera] Length=255 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 49/148 (34%), Positives = 71/148 (48%), Gaps = 19/148 (12%) Query 11 ADKKELVLADMW--KDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGN 68 A + ++ D+W K+ ++ LLR FGC C E AS + E K + D+AGVK++ VG G Sbjct 80 ASGESVLFKDLWDQKEGVAVVALLRHFGCFCCWELASALKESKARFDSAGVKLIAVGVGT 139 Query 69 RYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQRVGLLHFL----SWTAISEWRK 123 A E +P FP + +Y DP++ AY GL GL L S S + Sbjct 140 PNKACILAERLP-----FPMDCLYADPDRKAYDVLGLY-YGLSRTLFSPASAKVFSRFES 193 Query 124 ANKNHPNADLQGDG------LQTGGIYL 145 K N L+G LQ GG+++ Sbjct 194 LQKALKNYTLEGTPDDKSGVLQQGGMFV 221 >emb|CAN81555.1| hypothetical protein VITISV_040397 [Vitis vinifera] Length=201 Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 49/148 (34%), Positives = 71/148 (48%), Gaps = 19/148 (12%) Query 11 ADKKELVLADMW--KDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGN 68 A + ++ D+W K+ ++ LLR FGC C E AS + E K + D+AGVK++ VG G Sbjct 26 ASGESVLFKDLWDQKEGVAVVALLRHFGCFCCWELASALKESKARFDSAGVKLIAVGVGT 85 Query 69 RYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQRVGLLHFL----SWTAISEWRK 123 A E +P FP + +Y DP++ AY GL GL L S S + Sbjct 86 PNKACILAERLP-----FPMDCLYADPDRKAYDVLGLY-YGLSRTLFSPASAKVFSRFES 139 Query 124 ANKNHPNADLQGDG------LQTGGIYL 145 K N L+G LQ GG+++ Sbjct 140 LQKALKNYTLEGTPDDKSGVLQQGGMFV 167 >ref|NP_001051154.1| Os03g0729300 [Oryza sativa Japonica Group] gb|AAO38466.1| unknown protein [Oryza sativa Japonica Group] gb|ABF98681.1| UPF0308 protein, chloroplast precursor, putative, expressed [Oryza sativa Japonica Group] dbj|BAF13068.1| Os03g0729300 [Oryza sativa Japonica Group] dbj|BAG98418.1| unnamed protein product [Oryza sativa Japonica Group] Length=259 Score = 62.0 bits (149), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 52/161 (33%), Positives = 73/161 (46%), Gaps = 27/161 (16%) Query 1 ASNNIKVLELADKKELVLADMWKDQR--VLLILLRRFGCSLCHEQASHVLEIKPQLDAAG 58 A + VL + + L D+W ++ LLR FGC C E AS + E + DAAG Sbjct 77 ALGGVSVLAAGTGEAVQLRDLWDPTEGVAVVALLRHFGCFCCWELASVLKESMAKFDAAG 136 Query 59 VKIVLVGTGNRYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQRVGLLHFLSWTA 117 K++ +G G A + +P FP + +Y DPE+ AY +GL H L T Sbjct 137 AKLIAIGVGTPDKARILADGLP-----FPVDSLYADPERKAYDV-----LGLYHGLGRTL 186 Query 118 IS---------EWRKANKNH----PNADLQGDGLQTGGIYL 145 IS +K KN+ ADL G LQ GG+ + Sbjct 187 ISPAKMYSGLNSIKKVTKNYTLKGTPADLTGI-LQQGGMLV 226 >ref|NP_001051153.1| Os03g0729200 [Oryza sativa Japonica Group] gb|AAO38464.1| hypothetical protein [Oryza sativa Japonica Group] gb|ABF98679.1| expressed protein [Oryza sativa Japonica Group] dbj|BAF13067.1| Os03g0729200 [Oryza sativa Japonica Group] gb|EAY91738.1| hypothetical protein OsI_13379 [Oryza sativa Indica Group] dbj|BAG94118.1| unnamed protein product [Oryza sativa Japonica Group] gb|EEE59858.1| hypothetical protein OsJ_12440 [Oryza sativa Japonica Group] Length=258 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 47/157 (30%), Positives = 74/157 (48%), Gaps = 17/157 (10%) Query 1 ASNNIKVLELADKKELVLADMWKDQRVLLI--LLRRFGCSLCHEQASHVLEIKPQLDAAG 58 A + + A + ++ D+W + + LLR FGC C E AS + + K + D+AG Sbjct 73 ALGGVAIYSAATGEPVLFRDLWDQNEGMAVVALLRHFGCPCCWELASVLRDTKERFDSAG 132 Query 59 VKIVLVGTGNRYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQ-RVGLLHFLSWT 116 VK++ VG G A E +P FP + +Y DPE+ AY GL +G F + Sbjct 133 VKLIAVGVGTPDKARILAERLP-----FPLDYLYADPERKAYDLLGLYFGIGRTFFNPAS 187 Query 117 A-----ISEWRKANKNH---PNADLQGDGLQTGGIYL 145 A ++A KN+ D + LQ GG+++ Sbjct 188 ASVFSRFDSLKEAVKNYTIEATPDDRASVLQQGGMFV 224 >gb|EAY91739.1| hypothetical protein OsI_13380 [Oryza sativa Indica Group] Length=259 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 52/161 (33%), Positives = 73/161 (46%), Gaps = 27/161 (16%) Query 1 ASNNIKVLELADKKELVLADMWKDQR--VLLILLRRFGCSLCHEQASHVLEIKPQLDAAG 58 A + VL + + L D+W ++ LLR FGC C E AS + E + DAAG Sbjct 77 ALGGVSVLAAGTGEAVQLRDLWDPTEGVAVVALLRHFGCFCCWELASVLKESMAKFDAAG 136 Query 59 VKIVLVGTGNRYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQRVGLLHFLSWTA 117 K++ +G G A + +P FP + +Y DPE+ AY +GL H L T Sbjct 137 AKLIAIGVGTPDKARILADGLP-----FPVDSLYADPERKAYDV-----LGLYHGLGRTL 186 Query 118 IS---------EWRKANKNH----PNADLQGDGLQTGGIYL 145 IS +K KN+ ADL G LQ GG+ + Sbjct 187 ISPAKMYSGLNSIKKVTKNYTLKGTPADLTGI-LQQGGMLV 226 >ref|XP_002269002.1| PREDICTED: hypothetical protein, partial [Vitis vinifera] Length=223 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 49/148 (34%), Positives = 70/148 (48%), Gaps = 19/148 (12%) Query 11 ADKKELVLADMW--KDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGN 68 A + ++ D+W K+ ++ LLR FGC C E AS + E K D+AGVK++ VG G Sbjct 48 ASGESVLFKDLWDQKEGVAVVALLRHFGCFCCWELASALKESKATFDSAGVKLIAVGVGT 107 Query 69 RYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQRVGLLHFL----SWTAISEWRK 123 A E +P FP + +Y DP++ AY GL GL L S S + Sbjct 108 PNKACILAERLP-----FPMDCLYADPDRKAYDVLGLY-YGLSRTLFSPASAKVFSRFES 161 Query 124 ANKNHPNADLQGDG------LQTGGIYL 145 K N L+G LQ GG+++ Sbjct 162 LQKALKNYTLEGTPDDKSGVLQQGGMFV 189 >emb|CBI33289.3| unnamed protein product [Vitis vinifera] Length=255 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 49/148 (34%), Positives = 70/148 (48%), Gaps = 19/148 (12%) Query 11 ADKKELVLADMW--KDQRVLLILLRRFGCSLCHEQASHVLEIKPQLDAAGVKIVLVGTGN 68 A + ++ D+W K+ ++ LLR FGC C E AS + E K D+AGVK++ VG G Sbjct 80 ASGESVLFKDLWDQKEGVAVVALLRHFGCFCCWELASALKESKATFDSAGVKLIAVGVGT 139 Query 69 RYFAEKFIENVPGNGQRFPAE-VYIDPEQTAYKARGLQRVGLLHFL----SWTAISEWRK 123 A E +P FP + +Y DP++ AY GL GL L S S + Sbjct 140 PNKACILAERLP-----FPMDCLYADPDRKAYDVLGLY-YGLSRTLFSPASAKVFSRFES 193 Query 124 ANKNHPNADLQGDG------LQTGGIYL 145 K N L+G LQ GG+++ Sbjct 194 LQKALKNYTLEGTPDDKSGVLQQGGMFV 221 >ref|XP_848380.1| PREDICTED: similar to UPF0308 protein C9orf21 [Canis familiaris] Length=230 Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 46/175 (27%), Positives = 84/175 (48%), Gaps = 25/175 (14%) Query 11 ADKKELVLADMWKDQRVLLILLRRFGCSLCHEQASHVLEI-KPQLDAAGVKIVLVGTGNR 69 A + + +++++R +++ +R F C +C E + +I K L A + ++++G + Sbjct 46 ASGRRVPFGALFRERRAVVVFVRHFLCYICKEYVEDLAKIPKSVLQEADITLIVIGQSSY 105 Query 70 YFAEKFIENVPGNGQRFPAEVYIDPEQTAYKARGLQR-VGLL----------HFLSWTAI 118 + E F + + E+Y+DPE+ YK G++R G+ + LS + Sbjct 106 HHIEPFCKLT-----GYSHEIYVDPEREIYKKLGMKRGEGIASSGKSPHIKSNILSGSIR 160 Query 119 SEWRKANKNHPNADLQGDGLQTGGIYLVGPGADSAIHFAF---NEYDH-PVGTLV 169 S R P D QGD Q GG ++GPG + IHF N DH P+ +++ Sbjct 161 SLCRAVTG--PLFDFQGDPAQQGGTLILGPGNN--IHFIHRDRNRLDHKPINSVL 211 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Mar 14, 2011 11:46 AM Number of letters in database: 286,552,912 Number of sequences in database: 13,377,472 Lambda K H 0.319 0.137 0.414 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 13377472 Number of Hits to DB: 179902852 Number of extensions: 7692880 Number of successful extensions: 13521 Number of sequences better than 100: 42 Number of HSP's better than 100 without gapping: 0 Number of HSP's gapped: 13484 Number of HSP's successfully gapped: 42 Length of query: 183 Length of database: 4581520208 Length adjustment: 130 Effective length of query: 53 Effective length of database: 2842448848 Effective search space: 150649788944 Effective search space used: 150649788944 T: 11 A: 40 X1: 16 (7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 68 (30.8 bits)