TBLASTN 2.2.17 [Aug-26-2007] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= Nova_selenoproteina (244 letters) Database: transcripts.fa 5157 sequences; 11,152,142 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value Plasmodium_k_strain_H|PK4.chr03|PKH_031110|Annotation|Plasmodium... 437 e-124 Plasmodium_k_strain_H|PK4.chr05|PKH_051050|Annotation|Plasmodium... 31 0.27 Plasmodium_k_strain_H|PK4.chr09|PKH_094030|Annotation|Plasmodium... 28 1.8 Plasmodium_k_strain_H|PK4.chr09|PKH_092080|Annotation|Plasmodium... 28 2.4 Plasmodium_k_strain_H|PK4.chr06|PKH_061730|Annotation|Plasmodium... 28 2.5 Plasmodium_k_strain_H|chr04|PKH_040300|Annotation|Plasmodium_kno... 27 4.7 Plasmodium_k_strain_H|PK4.chr13|PKH_133600|Annotation|Plasmodium... 27 5.1 Plasmodium_k_strain_H|PK4.chr08|PKH_082780|Annotation|Plasmodium... 27 5.5 Plasmodium_k_strain_H|PK4.chr11.pseudo.embl|PKH_113370|Annotatio... 26 8.3 >Plasmodium_k_strain_H|PK4.chr03|PKH_031110|Annotation|Plasmodium_kn owlesi_Sanger|(processed_transcript)hypothetical protein Length = 672 Score = 437 bits (1125), Expect = e-124, Method: Composition-based stats. Identities = 223/223 (100%), Positives = 223/223 (100%) Frame = +1 Query: 1 MFCSATTLLQFYFLLLLHFYGNWSRRVVFCGNCHLGTCKMLCGQEGLPKKPFVGTNTNVR 60 MFCSATTLLQFYFLLLLHFYGNWSRRVVFCGNCHLGTCKMLCGQEGLPKKPFVGTNTNVR Sbjct: 1 MFCSATTLLQFYFLLLLHFYGNWSRRVVFCGNCHLGTCKMLCGQEGLPKKPFVGTNTNVR 180 Query: 61 MYKYQAGLRMREYMDAHKDDLQNEAKDPPLIGRFIRSGKKSTPDLLNTILFRAYIFGYEK 120 MYKYQAGLRMREYMDAHKDDLQNEAKDPPLIGRFIRSGKKSTPDLLNTILFRAYIFGYEK Sbjct: 181 MYKYQAGLRMREYMDAHKDDLQNEAKDPPLIGRFIRSGKKSTPDLLNTILFRAYIFGYEK 360 Query: 121 IRVGAFTEGSLVHVGILISSFCENVVVDTKSIYTCKQQGECATERVSISPKGACNNLRVI 180 IRVGAFTEGSLVHVGILISSFCENVVVDTKSIYTCKQQGECATERVSISPKGACNNLRVI Sbjct: 361 IRVGAFTEGSLVHVGILISSFCENVVVDTKSIYTCKQQGECATERVSISPKGACNNLRVI 540 Query: 181 LYVCSLWHAHLRGAWTHTHQLYAHLRIDICASARMCIRPRAHV 223 LYVCSLWHAHLRGAWTHTHQLYAHLRIDICASARMCIRPRAHV Sbjct: 541 LYVCSLWHAHLRGAWTHTHQLYAHLRIDICASARMCIRPRAHV 669 >Plasmodium_k_strain_H|PK4.chr05|PKH_051050|Annotation|Plasmodium_kn owlesi_Sanger|(processed_transcript)kir protein Length = 1806 Score = 30.8 bits (68), Expect = 0.27, Method: Composition-based stats. Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 1/61 (1%) Frame = -3 Query: 11 FYFLLLLHFYGNWSRRVVFCGNCHLGTCKMLCGQEGLPK-KPFVGTNTNVRMYKYQAGLR 69 FY L + FY + +FC NC LG + +P+ P++G ++ R Y G+R Sbjct: 493 FYILPYIFFYIRYLPHFLFCRNC-LGNQIIFSLDRNIPR*CPYIGGYSSARRAAYSVGVR 317 Query: 70 M 70 + Sbjct: 316 I 314 >Plasmodium_k_strain_H|PK4.chr09|PKH_094030|Annotation|Plasmodium_kn owlesi_Sanger|(processed_transcript)hypothetical protein, conserved in Plasmodium species Length = 903 Score = 28.1 bits (61), Expect = 1.8, Method: Composition-based stats. Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 7/51 (13%) Frame = +1 Query: 47 LPKKPFVGTNTNVRMYKYQAGLRMRE-------YMDAHKDDLQNEAKDPPL 90 +P PF+ +N N+ YKY + E Y + KD +N+ D P Sbjct: 220 IPINPFIDSNENLNKYKYGVEKKKAERYTGVQVYEEDDKDHKKNQPIDYPF 372 >Plasmodium_k_strain_H|PK4.chr09|PKH_092080|Annotation|Plasmodium_know lesi_Sanger|(processed_transcript)hypothetical protein, conserved in Plasmodium species Length = 1884 Score = 27.7 bits (60), Expect = 2.4, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 4/55 (7%) Frame = +1 Query: 124 GAFTEGSLVHVGILISSFCENVVVDTKSIYTCKQQGECATER----VSISPKGAC 174 G+F + SL H S C+N K C +QG C +++ S S G C Sbjct: 1099 GSFPDSSLCH-----SKHCDNFRKGVKCNSNCTKQGSCCSDKHPGGNSTSEGGTC 1248 >Plasmodium_k_strain_H|PK4.chr06|PKH_061730|Annotation|Plasmodium_know lesi_Sanger|(processed_transcript)hypothetical protein, conserved in Plasmodium species Length = 4560 Score = 27.7 bits (60), Expect = 2.5, Method: Composition-based stats. Identities = 18/53 (33%), Positives = 27/53 (50%) Frame = -3 Query: 166 VSISPKGACNNLRVILYVCSLWHAHLRGAWTHTHQLYAHLRIDICASARMCIR 218 VS+ P GA + R++L +CS H H+ + LYA+ + C R IR Sbjct: 1387 VSLHP-GAYSPGRILLKICSYIHQHVH*GVIYLTHLYAYRKNRCCTLQRERIR 1232 >Plasmodium_k_strain_H|chr04|PKH_040300|Annotation|Plasmodium_knowlesi _Sanger|(processed_transcript)ATP synthase F1, alpha subunit, putative Length = 1659 Score = 26.9 bits (58), Expect = 4.7, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 16/34 (47%) Frame = +3 Query: 18 HFYGNWSRRVVFCGNCHLGTCKMLCGQEGLPKKP 51 +F GN + V L LCG EGLP+ P Sbjct: 1386 NFNGNIKAKAVLPCEYQLSDLPHLCGHEGLPR*P 1487 >Plasmodium_k_strain_H|PK4.chr13|PKH_133600|Annotation|Plasmodium_kn owlesi_Sanger|(processed_transcript)GTP binding protein, putative Length = 1728 Score = 26.6 bits (57), Expect = 5.1, Method: Composition-based stats. Identities = 14/30 (46%), Positives = 18/30 (60%) Frame = -2 Query: 1 MFCSATTLLQFYFLLLLHFYGNWSRRVVFC 30 M CS T+ YFL LHF ++RV+FC Sbjct: 173 MQCSLFTVRLGYFLPSLHFMLLLTKRVLFC 84 >Plasmodium_k_strain_H|PK4.chr08|PKH_082780|Annotation|Plasmodium_know lesi_Sanger|(processed_transcript)hypothetical protein, conserved in Plasmodium species Length = 3282 Score = 26.6 bits (57), Expect = 5.5, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 31/64 (48%) Frame = -2 Query: 139 SSFCENVVVDTKSIYTCKQQGECATERVSISPKGACNNLRVILYVCSLWHAHLRGAWTHT 198 SS C N+ + KS+Y KQQ V P+G N+ ++ ++ S ++ W+ + Sbjct: 2909 SSPCSNL--NKKSVYLDKQQSR*MQALVKPPPRGDSNSDKIDFFIESKENSSRWTRWSFS 2736 Query: 199 HQLY 202 ++ Sbjct: 2735 FSIF 2724 >Plasmodium_k_strain_H|PK4.chr11.pseudo.embl|PKH_113370|Annotation|Pla smodium_knowlesi_Sanger|(processed_transcript)hypothetica l protein, conserved in Plasmodium species Length = 2019 Score = 26.2 bits (56), Expect = 8.3, Method: Composition-based stats. Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 1/28 (3%) Frame = +1 Query: 6 TTLLQFYFL-LLLHFYGNWSRRVVFCGN 32 T L FY + LLL++YGN+S R++ N Sbjct: 1630 TMSLMFYLIHLLLNYYGNYSVRILCNAN 1713 Database: transcripts.fa Posted date: Mar 13, 2008 9:37 AM Number of letters in database: 11,152,142 Number of sequences in database: 5157 Lambda K H 0.329 0.140 0.461 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 5157 Number of Hits to DB: 4,572,490 Number of extensions: 90706 Number of successful extensions: 734 Number of sequences better than 10.0: 18 Number of HSP's gapped: 734 Number of HSP's successfully gapped: 18 Length of query: 244 Length of database: 3,717,380 Length adjustment: 89 Effective length of query: 155 Effective length of database: 3,258,407 Effective search space: 505053085 Effective search space used: 505053085 Neighboring words threshold: 13 Window for multiple hits: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.8 bits) S2: 38 (19.2 bits)