TBLASTN 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= SPP00000109_2.0 # Protein # Selenoprotein U1 (SelU1) # Human (Homo sapiens) (229 letters) Database: genome.fa 47,351 sequences; 2,188,353,730 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KN328551.1 Gavialis gangeticus isolate Ggan-Ray unplaced genomic... 93 9e-20 >KN328551.1 Gavialis gangeticus isolate Ggan-Ray unplaced genomic scaffold scaffold5538, whole genome shotgun sequence Length = 130713 Score = 92.8 bits (229), Expect = 9e-20, Method: Compositional matrix adjust. Identities = 42/56 (75%), Positives = 49/56 (87%) Frame = +1 Query: 137 KKKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGSGKQ 192 +KKFYGP +RKM+FMG IR+GVW NF RAW GFSGNLEGEG ILGGV+V+G+GKQ Sbjct: 90067 QKKFYGPHKRKMLFMGLIRVGVWQNFLRAWKNGFSGNLEGEGVILGGVYVMGAGKQ 90234 Score = 76.3 bits (186), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 38/47 (80%), Positives = 42/47 (89%) Frame = +1 Query: 91 EAADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDFQPYFKGEIFLDEK 137 EA++LSSLK LDQLGVPLYAVVKE I +EV+DFQ YFKGEIFLDEK Sbjct: 88798 EASELSSLKPQLDQLGVPLYAVVKEKIGSEVEDFQHYFKGEIFLDEK 88938 Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 30/52 (57%), Positives = 38/52 (73%) Frame = +2 Query: 178 GFILGGVFVVGSGKQGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEKK 229 GFI + + QG+LLEHREKEFGDKV+L SVLEAA+ IKPQ +E++ Sbjct: 91937 GFI---IHLSSYSNQGVLLEHREKEFGDKVSLPSVLEAAEKIKPQAS*AEEQ 92083 Score = 52.4 bits (124), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 23/31 (74%), Positives = 27/31 (87%) Frame = +2 Query: 60 EPRTFKAKELWEKNGAVIMAVRRPGCFLCRE 90 E TF+A +LW+KNG+VIMAVRRPG FLCRE Sbjct: 84791 EQETFRAGDLWKKNGSVIMAVRRPG*FLCRE 84883 Score = 52.0 bits (123), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 26/57 (45%), Positives = 34/57 (59%) Frame = +3 Query: 1 MSFLQDPSFFTMGMWSIXXXXXXXXXXXXXXXNTDVFLSKPQKAALEYLEDIDLKTL 57 MS + D F MG+WSI NTD+ LSKP+KA +E+LE+I+LKTL Sbjct: 84072 MSSVPDFGLFAMGIWSIGLGALGAAVTGIILANTDLLLSKPEKATVEFLEEIELKTL 84242 Database: genome.fa Posted date: Sep 15, 2015 12:21 AM Number of letters in database: 2,188,353,730 Number of sequences in database: 47,351 Lambda K H 0.322 0.140 0.420 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 47351 Number of Hits to DB: 554,606,311 Number of extensions: 7940496 Number of successful extensions: 26474 Number of sequences better than 10.0: 3 Number of HSP's gapped: 26472 Number of HSP's successfully gapped: 7 Length of query: 229 Length of database: 729,451,243 Length adjustment: 125 Effective length of query: 104 Effective length of database: 723,532,368 Effective search space: 75247366272 Effective search space used: 75247366272 Neighboring words threshold: 13 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 50 (23.9 bits)