TBLASTN 2.2.19 [Nov-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= DDB0304589 |DDB_G0277015 |Protein| gene: msp on chromosome: 2 position 7225040 to 7225867 (243 letters) Database: genome.fa 799 sequences; 32,967,507 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value scaffold_17 297 5e-81 scaffold_168 28 4.9 >scaffold_17 Length = 153674 Score = 297 bits (760), Expect = 5e-81, Method: Compositional matrix adjust. Identities = 154/266 (57%), Positives = 178/266 (66%), Gaps = 24/266 (9%) Frame = -1 Query: 1 MSLF-NLPKVDLGEDCEGGVUARPXXXXXXINN-ASDESTPLISKTNDEEK-ANIGISST 57 MS F NLPKV++G+DCEGGV ARP IN DESTPLISKT+DEEK S + Sbjct: 39995 MSFFKNLPKVEVGDDCEGGV*ARPTSSASSINKTVGDESTPLISKTSDEEKDIACSSSPS 39816 Query: 58 SNSPQEEQTKKPXXXXXXXXXXXXPALVGSUCWPXXXXXXXXXXXXXXXXXXXHSLTFAI 117 S+ PQEEQ+KKP PALVGS CWP H++T+AI Sbjct: 39815 SDQPQEEQSKKPLIISILTVLISIPALVGS*CWPILLSSILGIAVSGASVTFAHAITYAI 39636 Query: 118 TLSILSNLAQYHFHKCKKRPSDRGHWIKFGPFYLTAIAVPLATFDILRHILVDNSIW--- 174 TL+ILSNLAQYH+HKC+ RP ++GHW K+GPFYLT +AVPLATFDILRHILVDN +W Sbjct: 39635 TLAILSNLAQYHYHKCQSRPREKGHWHKYGPFYLTLLAVPLATFDILRHILVDNQLWVNR 39456 Query: 175 ------------------TIHSFISPAAYRPGCENENITCLSVMGWFSAIVFTYTGYACL 216 T+HSF+SPAAYRPGC +ENI CLSVMGWFSAI+FTYTGYA L Sbjct: 39455 HKFKK*KKNFLLIFLCY*TVHSFVSPAAYRPGCTHENIRCLSVMGWFSAIIFTYTGYALL 39276 Query: 217 LVGTIWAADLIPKIKKVWTQLRPSKK 242 L GTIW A+L KIKK+W QLRP KK Sbjct: 39275 LTGTIWCAELHIKIKKLWQQLRPKKK 39198 >scaffold_168 Length = 64553 Score = 28.5 bits (62), Expect = 4.9, Method: Compositional matrix adjust. Identities = 13/30 (43%), Positives = 18/30 (60%) Frame = -1 Query: 190 ENENITCLSVMGWFSAIVFTYTGYACLLVG 219 E + L+ +GW S + TY+ Y CLLVG Sbjct: 26417 EGDTYFSLASLGWGSGV--TYSTYGCLLVG 26334 Score = 28.5 bits (62), Expect = 5.3, Method: Compositional matrix adjust. Identities = 12/23 (52%), Positives = 16/23 (69%) Frame = -2 Query: 197 LSVMGWFSAIVFTYTGYACLLVG 219 L+ +GW S + TY+ Y CLLVG Sbjct: 29128 LASLGWESGV--TYSAYGCLLVG 29066 Database: genome.fa Posted date: Jan 16, 2009 12:50 PM Number of letters in database: 32,967,507 Number of sequences in database: 799 Lambda K H 0.321 0.135 0.434 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 799 Number of Hits to DB: 8,224,664 Number of extensions: 118432 Number of successful extensions: 491 Number of sequences better than 10.0: 3 Number of HSP's gapped: 487 Number of HSP's successfully gapped: 4 Length of query: 243 Length of database: 10,989,169 Length adjustment: 98 Effective length of query: 145 Effective length of database: 10,910,867 Effective search space: 1582075715 Effective search space used: 1582075715 Neighboring words threshold: 13 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 49 (23.5 bits)