Results
* = query obtained from UniProt instead of SelenoDB.
~ = selenoprotein that is not described in zebrafish.
This table shows, for each query, if it contains Sec, its tblastn output, the exonerate and GeneWise prediction for the protein, the T-Coffee alignment for the best prediction and the SECIS element prediction by Seblastian and SECISearch3.
Selenoproteins:
Protein ID | Specie | Sec | BLAST | Exonerate | Genwise | Contig | Gene location | T-Coffee | Seblastian | SECIS | SECIS Image |
---|---|---|---|---|---|---|---|---|---|---|---|
Sel15* | KN469434.1 | 1179-10273(-) | |||||||||
Fep15 | JACM01057682.1 | 15126-20590(-) | |||||||||
GPx1a | KN484102.1 | 99822-100807(+) | |||||||||
GPx1b | KN483187.1 | 32547-34354(-) | |||||||||
GPx2 | KN485792.1 | 6989-7944(-) | |||||||||
GPx3a | JACM01069729.1 | 5286-10682(+) | |||||||||
GPx3b | KN468990.1 | 11936-17165(-) | |||||||||
GPx4a* | JACM01009827.1 | 5919-8706(+) | |||||||||
GPx4b* | KN473080.1 | 50109-51118(-) | |||||||||
GPx7 | KN477059.1 | 299-3110(+) | |||||||||
GPx8* | KN469473.1 | 103479-105000(-) | |||||||||
DI1* | KN484083.1 | 33386-34693(+) | |||||||||
DI2 | - | - | |||||||||
DI3a | KN482120.1 | 698-1487(+) | |||||||||
DI3b | KN473016.1 | 4757-5251(+) | |||||||||
MsrA1(I) | KN474103.1 KN469776.1 JACM01079289.1 |
2382-6525 689-20313 1476-3921(-) |
|||||||||
MsrA1(II) | JACM01020992.1 | 10670-19581(-) | |||||||||
MsrA2 | JACM01013465.1 | 2360-7074(-) | |||||||||
SelH | KN473035.1 | 31436-84157(+) | |||||||||
SelI | KN475358.1 | 17239-28153(+) | |||||||||
SelJ1 | KN475703.1 | 21495-23513(+) | |||||||||
SelK | - | - | |||||||||
SelL | KN476964.1 | 67919-123684(+) | SelM | < | KN483744.1 | 13389-19197(-) | |||||
SelN | KN472865.1 | 23444-26439(-) | SelS | KN472370.1 | 23645-30103(-) | ||||||
SelO1 | KN485144.1 | 92086-97120(-) | |||||||||
SelO2* | KN481390.1 KN476330.1 |
45291-48067(+) 1432-6033(+) |
|||||||||
SelP1a | KN472076.1 | 27665-99153(-) | SelP1b | - | - | SelP~ | - | - | - | - | KN470089.1 | 1146-99228(-) | - |
SelR1a* | KN472457.1 | 3772-5119(-) | |||||||||
SelR1b | KN481551.1 | 2479-3411(+) | |||||||||
SelR2 | KN475030.1 | 22971-24766(+) | |||||||||
SelR3 | JACM01052317.1 | 809-3106(+) | |||||||||
SelT1 | < | KN472607.1 | 224964-226784(-) | ||||||||
SelT1b | - | - | |||||||||
SelT2 | KN472830.1 | 1757-4070(+) | |||||||||
SelU1a (I) |
KN484292.1 | 53686-52121(+) | |||||||||
SelU1a (II) |
KN471901.1 | 38923-42068(+) | SelU2 | KN483383.1 | 25145-35531(+) | ||||||
SelU3 | KN479866.1 | 23728-64520(-) | |||||||||
SelW1 | - | - | |||||||||
SelW2a | JACM01031559.1 | 123-2617(+) | |||||||||
SelW2b | - | - | |||||||||
TR2 | KN477299.1 | 10548-67353(-) 12313-143777(-) |
|||||||||
TR3 | KN483500.1 KN474115.1 |
57978-61466(+) 52548-55858(-) |
Machinery
This table shows the presence of the genes encoding the machinery proteins.
Protein ID | Specie | Sec | BLAST | Exonerate | Genwise | Contig | Gene location | T-Coffee | Seblastian | SECIS | SECIS Image |
---|---|---|---|---|---|---|---|---|---|---|---|
SecS | KN484638.1 | 1409-12679(+) | |||||||||
eEFsec | KN477770.1.1 | 77439-98118(+) | |||||||||
PSTK | KN474861.1 | 19572-21515(+) | |||||||||
SPS | KN475839.1 | 46190-48295(+) | |||||||||
SPS2 | KN484547.1 KN469560.1 |
11229-18200(+) 12772-20105(-) | |||||||||
SBP2 | KN470587.1 | 4404-75221(+) | |||||||||
SBP2L* | KN472259.1 | 13351-23647(-) | |||||||||
Trnau1apa(I) | KN468817.1 | 40105-44000(-) | C2H6orf52 | KN484243.1 | 6525-9576(-) |
Result Analysis of Selenoproteins and Machinery
Sel15
Sel15 protein from zebrafish was not well annotated in SelenoDB so, the homology-based approach was done by using the one obtained in UniProt. In P. schlosseri, Sel15 is located in the contig KN469434.1 and its gene location is between the position 1179 and the position 10273, in the negative strand. The protein structure is detailed below:
- Exon 1: from position 10226 to position 10273.
- Exon 2: from position 8212 to position 8379.
- Exon 3: from position 4610 to position 4673. This exon contains a selenocysteine residue.
- Exon 4: from position 2966 to position 3015.
- Exon 5: from position 1179 to position 1307.
A SECIS element and a selenoprotein were predicted by using Seblastian.
Fep15
Fep 15 protein is in the contig JACM01057682.1. The gene location is between the position 15126 and the position 20590, in the negative strand. The structure of this protein (predicted by exonerate) is described below:
- Exon 1: from position 20427 to position 20590.
- Exon 2: from position 20029 to position 20107. There is a selenocystein residue aligned with the one found in the zebrafish Fep15 protein.
- Exon 3: from position 15126 to position 15266.
This selenoprotein has a total number of 2 selenocystein residues. By using Seblastian a SECIS element was predicted and also Fep15 selenoprotein that in this case, had 4 exons instead of 3.
GPx1a
Gpx1a protein is codified by the contig KN484102.1 in P. schlosseri. The gene positions are between 99822 and 100807 in the positive strand. This protein has 2 exons:
- Exon 1: from position 99822 to position 100040. This exon contains the U residue.
- Exon 2: from position 100472 to position 100807.
A SECIS element and a selenoprotein were predicted by using Seblastian.
GPx1b
The contig KN483187.1 encodes for the Gpx1b protein in P. schlosseri. The positions are between 32547 and 34354 in the negative strand. Gpx1b has 2 exons:
- Exon 1: from position 34145 to position 34354. This exon has a selenocysteine residue.
- Exon 2: from position 32547 to position 32894.
A SECIS element and a selenoprotein were predicted by using Seblastian.
GPx2
The protein Gpx2 is encoded by the positions between 6989 and 7944 in the negative strand of the contig KN485792.1 from P. schlosseri. Gpx2 has two exons:
- Exon 1: from position 7723 to position 7944. This exon has a selenocysteine residue.
- Exon 2: from position 6989 to position 7333.
A SECIS element and a selenoprotein were predicted by using Seblastian.
GPx3a
The protein Gpx3a is encoded by the contig JACM01069729.1, between positions 5287 and 10682 in the positive strand. There are 4 exons in Gpx3a:
- Exon 1: from position 5287 to position 5464. This exon contains the selenocystein residue of the protein.
- Exon 2: from position 6551 to position 6668.
- Exon 3: from position 8365 to position 8464.
- Exon 4: from position 10470 to position 10682.
A SECIS element and a selenoprotein were predicted by using Seblastian.
GPx3b
This protein is codifed by the contig KN468990.1, between positions 11936 and 17165 in the negative strand. The predicted protein in P. schlosseri has 6 exons:
- Exon 1: from position 13401 to position 13511. This exon contains the selenocystein residue of the protein.
- Exon 2: from position 12542 to position 12635.
- Exon 3: from position 11300 to position 11430.
- Exon 4: from position 9220 to position 9351.
- Exon 5: from position 17116 to position 17165.
- Exon 6: from position 11936 to position 12083.
Seblastian could not predict any selenoprotein. Nevertheless, after using SECISearch3, a SECIS element was obtained.
GPx4a
This protein is encoded by the contig JACM01009827.1, between the positions 5919 and 8706 from the positive strand. Exonerate predicted seven exons for Gx4a:
- Exon 1: from position 5919 to position 5969.
- Exon 2: from position 6147 to position 6241.
- Exon 3: from position 6324 to position 6468. This exon contains a selenocysteine residue.
- Exon 4: from position 6637 to position 6788.
- Exon 5: from position 6880 to position 6904.
- Exon 6: from position 8456 to position 8515.
- Exon 7: from position 8677 to position 8706.
A SECIS element and a selenoprotein were predicted by using Seblastian. However, the selenoprotein obtained had only 6 exons.
GPx4b
The protein Gpx4b of P. schlosseri is in the contig KN473080.1, between the positions 50109 and 51118, in the reverse strand. Exonerate has predicted 6 exons for this:
- Exon 1: from position 51024 to position 51118.
- Exon 2: from position 50807 to position 50951. This exon contains the selenocysteine residue, aligned with the one in the Gpx4b from zebrafish.
- Exon 3: from position 50569 to position 50720.
- Exon 4: from position 50351 to position 50375.
- Exon 5: from position 50219 to position 50278.
- Exon 6: from position 50109 to position 50138.
A SECIS element and a selenoprotein were predicted by using Seblastian. The obtained selenoprotein had 7 exons and the U residue was in the number 3.
GPx7
To do the homology-based approach the zebrafish Gpx7 protein could not be used because it was not starting with methionine (neither in SelenoDB or UniProt). This is the reason why the sequence from Homo sapiens, obtained from SelenoDB, was used to do the comparison. The protein Gpx7 of P. schlosseri is in the contig KN477059.1, between the positions 299 and 3110, in the possitive strand. The protein has the structure detailed below:
- Exon 1: from position 242 to position 376.
- Exon 2: from position 2248 to position 2509.
- Exon 3: from position 2950 to position 3110.
As the one in zebrafish, Gpx7 from P. schlosseri is not a selenoprotein because it does not contain any Sec residue. After using Seblastian and SECISearch3, no results were obtained.
GPx8
Gpx8 protein is encoded by the contig number KN469473.1, between the positions 103479 and 105000 from the negative strand. It has 3 exons:
- Exon 1: from position 104794 to position 105000.
- Exon 2: from position 104377 to position104638.
- Exon 3: from position 103479 to position 103639.
As the one in zebrafish, Gpx8 from P. schlosseri is not a selenoprotein because it does not contain any Sec residue. For this reason, Seblastian did not provide any prediction. However, by using SECISearch3, a SECIS element was found.
DI1
To do the homology-based approach, the sequence of this protein was obtained in UniProt because the one in the SelenoDB was not properly annotated. DI1 in P. schlosseri is located in the forward strand of the contig KN484083.1, between the positions 33386 and 34693. The structure of the protein is detailed below:
- Exon 1: from position 33386 to position 33716.
- Exon 2: from position 33789 to position 33932. This exon contains an aligned selenocystein with the one found in zebrafish DI1 protein.
- Exon 3: from position 34117 to position 34316.
- Exon 4: from position 34631 to position 34693.
By using Seblastian, two SECIS element were predicted and also a selenoprotein.
DI2
The methodology used in this study could not predict DI2 in P. schlosseri.
DI3a
The protein DI3a of P. schlosseri is in the contig KN482120.1, between the positions 698 and 1487, in the forward strand. This protein has two exons:
- Exon 1: from position 698 to position 1095.
- Exon 2: from position 1157 to position 1487. The selenocystein residue is located in this exon.
A SECIS element and a selenoprotein were predicted by using Seblastian.
DI3b
The homology between DI3b in P. schlosseri was searched for zebrafish even though it was not well annotated because this protein is not find in Homo sapiens. due to the fact that the first one was not well annotated. However, the results obtained after doing the blast were different in each case. This is why the further analysis was done with the zebrafish DI3b protein.
DI3b protein is found in the positive strand of the contig KN473016.1 from P. schlosseri. Its gene location is between the position 4865 and 5308, in the positive strand. DI3b has only one exon (from position 4586 to position 5317).
A SECIS element and a selenoprotein were predicted by using Seblastian.
MsrA1(I)
Protein MsrA1 is located in contig number KN469776.1 between positions 689 and 20313 of the positive strand, in contig JACM01079289.1 between positions 1476 and 3921 and in contig KN474103.1 between positions 2382 and 6525 both in negative strand. The gene has 9 exons:
- Exon 1: from position 689 to position 764
- Exon 2: from position 3806 to position 3874
- Exon 3: from position 6862 to position 6981
- Exon 4: from position 12254 to position 12277
- Exon 5: from position 20240 to position 20313
- Exon 6: from position 3814 to position 3921
- Exon 7: from position 1476 to position 1502
- Exon 8: from position 6457 to position 6525
- Exon 9: from position 2382 to position 2540
This protein was predicted blasting P. schlosseri genome against the human MsrA1 protein found in selenoDB, since the zebrafish MsrA1 protein did not start with a methionine residue meaning it was not correctly annotated. As its homologue in both human and zebrafish MsrA1 does not include any selenocystein. For this reason, any Seblastian prediction or SECIS element were obtained.
MsrA1(II)
Protein MsrA1 (II) is codified by a gene located in contig number JACM01020992.1 between positions 10670 and 19581 in the negative strand. The gene contains 2 exons.
- Exon 1: from position 19392 to position 19581
- Exon 2: from position 10670 to position 10674
The zebrafish MsrA1 protein could not be taken as a reference to predict MsrA1 (II) in P. sclosseri, since it was not correctly annotated. For this reason, the homology-based approach was done with the Human MsrA1 protein obtained from selenoDB. MsrA1 (II) has not selenocysteine residues. Thus, neither selenoprotein prediction nor SECIS element were found.
MsrA2
Protein MsrA2 is located in contig number JACM01013465.1 between positions 2360 and 7074 of the negative strand. The gene contains 6 exons:
- Exon 1: from position 6942 to position 7074
- Exon 2: from position 5121 to position 5189
- Exon 3: from position 4842 to position 4961
- Exon 4: from position 4199 to position 4303
- Exon 5: from position 3057 to position 3163
- Exon 6: from position 2360 to position 2515
As its homologue in zebrafish this protein does not include any selenocystein. For this reason, any Seblastian prediction or SECIS element were obtained.
SelH
SelH protein, in P. schlosseri, is codified by the contig KN473035.1, between the positions 31436 and 84157 of the positive strand. This protein has 4 exons:
- Exon 1: from position 31436 to position 31528.
- Exon 2: from position 82369 to position 82400.
- Exon 3: from position 83716 to position 83858. This exon contains an aligned selenocystein with the one found in zebrafish SelH protein.
- Exon 4: from position 84054 to position 84157.
A SECIS element and a selenoprotein were predicted by using Seblastian. The obtained selenoprotein had 3 exons and the U residue was in the number 3.
SelI
To do the homology-based approach the zebrafish SelI protein could not be used because it was not well annotated (neither in SelenoDB or UniProt). This is the reason why the sequence from Homo sapiens, obtained from SelenoDB, was used to do the comparison. Nevertheless, in this case the Sec residue was not aligned with the one in the sequence of Human Selenoprotein.
SelJ1
SelJ1 protein, in P. schlosseri, is codified by the contig KN475703.1, between the positions 21495 and 23513 of the positive strand. This protein has 9 exons:
- Exon 1: from position 21495 to position 21561.
- Exon 2: from position 21667 to position 21843.
- Exon 3: from position 21933 to position 22031.
- Exon 4: from position 22325 to position 22417.
- Exon 5: from position 22500 to position 22656.
- Exon 6: from position 22725 to position 22840.
- Exon 7: from position 23003 to position 23080. This exon contains an aligned selenocystein with the one found in zebrafish SelJ1 protein.
- Exon 8: from position 23161 to position 23300.
- Exon 9: from position 23418 to position 23513.
A SECIS element and a selenoprotein were predicted by using Seblastian.
SelK
In this case, the Sec residue was not aligned with the one in the sequence of Zebrafish Selenoprotein.Therefore, no SelK homolog could be found in P. schlosseri'sgenome.
SelL
SelL protein, in P. schlosseri, is codified by the contig KN476964.1, between the positions 67919 and 12368 of the positive strand. This protein has 12 exons:
- Exon 1: from position 67919 to position 67965.
- Exon 2: from position 104478 to position 104488.
- Exon 3: from position 104953 to position 104979.
- Exon 4: from position 112656 to position 112662.
- Exon 5: from position 116865 to position 116915.
- Exon 6: from position 116985 to position 117038.
- Exon 7: from position 117159 to position 117204.
- Exon 8: from position 119561to position 119688.
- Exon 9: from position 119795 to position 119909. This exon contains two aligned selenocysteins with the two found in zebrafish SelL protein.
- Exon 10: from position 122855 to position 122977.
- Exon 11: from position 123058 to position 123195.
- Exon 12: from position 123589 to position 123684.
Seblastian could not predict any selenocysteine but a SECIS element was obtained by SECISearch3.
SelM
To do the homology-based approach the zebrafish SelM protein could not be used because it was not starting with methionine (neither in SelenoDB or UniProt). This is the reason why the sequence from Homo sapiens, obtained from SelenoDB, was used to do the comparison. The results point that, the gene location of this protein in P. schlosseri is between the position 13389 and the position 19197, in the negative strand from the contig KN483744.1. SelM protein in this organism has five exons:
- Exon 1: from position 19102 to position 19197.
- Exon 2: from position 16922 to position 16957. This exon contains the selenocysteine residue.
- Exon 3: from position 16415 to position 16449.
- Exon 4: from position 15563 to position 15641.
- Exon 5: from position 13389 to position 13511.
A SECIS element and a selenoprotein were predicted by using Seblastian.
SelN
SelN protein, in P. schlosseri, is codified by the contig KN472865.1, between the positions 23444 and 26439 of the negative strand. This protein has 12 exons:
- Exon 1: from position 26245 to position 26439.
- Exon 2: from position 25659 to position 25776.
- Exon 3: from position 25448 to position 25578.
- Exon 4: from position 25129 to position 25341.
- Exon 5: from position 24939 to position 25063.
- Exon 6: from position 24708 to position 24845.
- Exon 7: from position 24538 to position 24619.
- Exon 8: from position 24270 to position 24458.
- Exon 9: from position 24084 to position 24189. This exon contains an aligned selenocystein with the one found in zebrafish SelN protein.
- Exon 10: from position 23857 to position 23969.
- Exon 11: from position 23682 to position 23783.
- Exon 12: from position 23444 to position 23602.
Seblastian could not predict any selenocysteine but a SECIS element was obtained by SECISearch3.
SelO1
SelO1 protein, in P. schlosseri, is codified by the contig KN485144.1, between the positions 92086 and 97120 of the negative strand. This protein has 9 exons:
- Exon 1: from position 96570 to position 97120.
- Exon 2: from position 95146 to position 95349.
- Exon 3: from position 94898 to position 95075.
- Exon 4: from position 94575 to position 94705.
- Exon 5: from position 94015 to position 94292.
- Exon 6: from position 93768 to position 93930.
- Exon 7: from position 93226 to position 93411.
- Exon 8: from position 92917 to position 93076.
- Exon 9: from position 92086 to position 92301. This exon contains an aligned selenocystein with the one found in zebrafish SelO1 protein.
A SECIS element and a selenoprotein were predicted by using Seblastian.
SelO2
SelO2 protein in P. schlosseri is encoded by two different contigs: KN481390.1 (between the positions 45291 and 48067, in the positive strand) and KN476730.1 (between positions 1432 and 6033, in the positive strand).
The contig KN481390.1 encodes 3 exons of SelO2:
- Exon 1: from position 45291 to position 45551.
- Exon 2: from position 46374 to position 46545.
- Exon 3: from position 47838 to position 48067.
The contig KN476730.1 encodes 8 exons of TR3:
- Exon 4: from position 1432 to position 1614.
- Exon 5: from position 2204 to position 2334.
- Exon 6: from position 2885 to position 3159.
- Exon 7: from position 3830 to position 3886.
- Exon 8: from position 3926 to position 4031.
- Exon 9: from position 4391 to position 4576.
- Exon 10: from position 5118 to position 5274.
- Exon 11: from position 5974 to position 6033.
SelO2 doesn’t have any selenocystein to align with the one in the sequence of Zebrafish Selenoprotein. After using Seblastian and SECISearch3, no results were obtained for KN476730.1 contig. As for KN481390.1 contig, Seblastian could not predict any selenocysteine but a SECIS element was obtained by SECISearch3.
SelP1a
SelP1A protein, in P. schlosseri, is codified by the contig KN472076.1, between the positions 27665 and 99153 of the negative strand. This protein has 10 exons:
- Exon 1: from position 99011 to position 99153. This exon contains an aligned selenocystein with the one found in zebrafish SelR1b protein.
- Exon 2: from position 97721 to position 97933.
- Exon 3: from position 97434 to position 97585.
- Exon 4: from position 96986 to position 97133.
- Exon 5: from position 95945 to position 96014.
- Exon 6: from position 88555 to position 88562.
- Exon 7: from position 81947 to position 81971.
- Exon 8: from position 29479 to position 29563.
- Exon 9: from position 29088 to position 29106.
- Exon 10: from position 27665 to position 27704.
A SECIS element and a selenoprotein were predicted by using Seblastian.
SelP1b
None of the hits met the needed requirements to be considered valid.
SelP~
SelP~ protein, in P. schlosseri, is codified by the contig KN470089.1, between the positions 61666 and 62548 of the negative strand. This protein has 3 exons:
- Exon 1: from position 61666 to position 61850.
- Exon 2: from position 62142 to position 62354.
- Exon 3: from position 62431 to position 62548.
Two SECIS elements and the selenoprotein were predicted by using Seblastian.
SelR1a
SelR1A protein, in P. schlosseri, is codified by the contig KN472457.1, between the positions 3772 and 5119 of the negative strand. This protein has 3 exons:
- Exon 1: from position 5065 to position 5119.
- Exon 2: from position 3976 to position 4118.
- Exon 3: from position 3772 to position 3894. This exon contains an aligned selenocystein with the one found in zebrafish Sel1A protein.
SelR1b
SelR1b protein, in P. schlosseri, is codified by the contig KN481551.1, between the positions 2479 and 3411 of the positive strand. This protein has 6 exons:
- Exon 1: from position 2479 to position 2533.
- Exon 2: from position 3018 to position 3160.
- Exon 3: from position 3298 to position 3411. This exon contains an aligned selenocystein with the one found in zebrafish SelR1b protein.
- Exon 4: from position 4267 to position 4321.
- Exon 5: from position 4403 to position 4545.
- Exon 6: from position 4836 to position 4949.
A SECIS element and a selenoprotein were predicted by using Seblastian. The obtained selenoprotein had 3 exons and the U residue was in the number 3.
SelR2
SelR2 protein, in P. schlosseri, is codified by the contig KN475030.1, between the positions 22971 and 24766 of the positive strand. This protein has 5 exons:
- Exon 1: from position 22971 to position 23082.
- Exon 2: from position 23658 to position 23755.
- Exon 3: from position 24220 to position 24296.
- Exon 4: from position 24443 to position 24590.
- Exon 5: from position 24668 to position 24766.
SelR2 does not have any selenocysteine residue. However, this protein has 6 cysteine residues that are aligned with the ones found in the zebrafish SelR2 protein. After using Seblastian and SECISearch3, no results were obtained.
SelR3
SelR3 protein, in P. schlosseri, is codified by the contig JACM01052317.1, between the positions 809 and 3106 of the positive strand. This protein has 5 exons:
- Exon 1: from position 809 to position 921.
- Exon 2: from position 1030 to position 1107.
- Exon 3: from position 1182 to position 1210.
- Exon 4: from position 2482 to position 2579.
- Exon 5: from position 2990 to position 3106.
SelR3 does not have any selenocysteine residue. However, this protein has 5 cysteine residues that are aligned with the ones found in the zebrafish SelR3 protein. After using Seblastian and SECISearch3, no results were obtained.
SelS
P schlosseri SelS protein is codified by the contig KN472370.1, between positions 23645 and 30103 in the negative strand. It has 6 exons:
- Exon 1: from position 30070 to position 30103. This exon has a selenocysteine residue.
- Exon 2: from position 25593 to position 25727.
- Exon 3: from position 25408 to position 25514.
- Exon 4: from position 24486 to position 24581.
- Exon 5: from position 24327 to position 24393.
- Exon 6: from position 23645 to position 23721.
However, the sequence of zebrafish SelS protein does not contain any Sec residue but has aligned cysteine residues with P. schlosseri SelS. Seblastian did not provide any prediction. Nevertheless, by using SECISearch3, a SECIS element was found.
SelT1
This protein is encoded in the contig KN472607.1, between positions 224964 and 226784 in the negative strand. P. schlosseri SelT1 protein has 5 exons:
- Exon 1: from position 226654 to position 226784.
- Exon 2: from position 226043 to position 226153. This exon contains the selenocysteine residue of SelT1 protein.
- Exon 3: from position 225613 to position 225739.
- Exon 4: from position 225129 to position 225216.
- Exon 5: from position 224964 to position 225031.
A SECIS element and a selenoprotein were predicted by using Seblastian.
SelT1b
By using the methodology that has been described, no SelW1 could be predicted in P. schlosseri.
SelT2
This protein is codified in the contig KN472830.1, between positions 1757 and 4070 in the positive strand. In P. schlosseri, SelT2 has 5 exons:
- Exon 1: from position 1757 to position 1899.
- Exon 2: from position 1945 to position 1986.
- Exon 3: from position 2593 to position 2703. This exon contains the selenocysteine residue aligned with the one in zebrafish SelT2 protein.
- Exon 4: from position 3111 to position 3249.
- Exon 5: from position 3984 to position 4070.
After using Seblastian and SECISearch3, no results were obtained.
SelU1a(I)
To do the homology-based approach the zebrafish SelU1A protein could not be used because it was not starting with methionine (neither in SelenoDB or UniProt). This is the reason why the sequence from Homo sapiens, obtained from SelenoDB, was used to do the comparison. In this case, after using the methodology already described, two possible hits were obtained:SelU1A(I)
This protein of P. schlosseri is encoded by the contig KN484292.1, between the positions 53686 and 52121 in the positive strand. The structure of the protein is detailed below:
- Exon 1: from position 55965 to position 56121.
- Exon 2: from position 54947 to position 55038. There is a selenocysteine residue in this exon.
- Exon 3: from position 54738 to position 54878.
- Exon 4: from position 54079 to position 54243.
- Exon 5: from position 53686 to position 53766.
- Exon 6: from position 63059 to position 63233.
- Exon 7: from position 62533 to position 62624.
- Exon 8: from position 62320 to position 62460.
- Exon 9: from position 62072 to position 62236.
- Exon 10: from position 61919 to position 61999.
A SECIS element and a selenoprotein were predicted by using Seblastian. However, the obtained selenoprotein had 5 exons.
SelU1a(II)
SelU1A (II) protein is codified by the contig KN471901.1, between the positions 38923 and 42068 in the positive strand. It has 5 exons:
- Exon 1: from position 38923 to position 39067.
- Exon 2: from position 40010 to position 40101. This exon contains a Sec residue.
- Exon 3: from position 40660 to position 40800.
- Exon 4: from position 41751 to position 41915.
- Exon 5: from position 41988 to position 42068.
A SECIS element and a selenoprotein were predicted by using Seblastian.
SelU2
To do the homology-based approach the zebrafish SelU2 protein could not be used because it was not well annotated (neither in SelenoDB or UniProt). This is the reason why the sequence from Homo sapiens, obtained from SelenoDB, was used to do the comparison.
In P. schlosseri, the SelU2 protein is codified by the contig KN483383.1, between the positions 25145 and 35531 in the positive strand. It has 7 exons:
- Exon 1: from position 25145 to position 25193.
- Exon 2: from position 30913 to position 31031.
- Exon 3: from position 31195 to position 31263.
- Exon 4: from position 33175 to position 33228.
- Exon 5: from position 33320 to position 33425.
- Exon 6: from position 35186 to position 35320.
- Exon 7: from position 35404 to position 35531.
SelU2 protein does not contain any selenocysteine residue but it has 3 cysteine residues aligned with the same amino acid in the zebrafish protein. After using Seblastian and SECISearch3, no results were obtained.
SelU3
To do the homology-based approach the zebrafish SelU3 protein could not be used because it was not well annotated (neither in SelenoDB or UniProt). This is the reason why the sequence from Homo sapiens, obtained from SelenoDB, was used to do the comparison. SelU3 protein is codified by the contig KN479866.1, between positions 23728 and 64520 in the negative strand. It has 8 exons:
- Exon 1: from position 64439 to position 64520.
- Exon 2: from position 29869 to position 29933.
- Exon 3: from position 28911 to position 29115.
- Exon 4: from position 28434 to position 28485.
- Exon 5: from position 28289 to position 28352.
- Exon 6: from position 28068 to position 28143.
- Exon 7: from position 26946 to position 27064.
- Exon 8: from position 23728 to position 23742.
This protein has 3 cysteine residues aligned with the same amino acid in the zebrafish SelU3 protein. In addition, the P. schlosseri SelU3 protein has four Sec insertions among its sequence that are not found in the zebrafish one.
Seblastian did not provide any prediction. However, by using SECISearch3, a SECIS element was found.
SelW1
By using the methodology that has been described, no SelW1 could be predicted in P. schlosseri.
SelW2a
In P. schlosseri, SelW2a protein is encoded by the contig JACM01031559.1, between positions 123 and 2617 in the positive strand. This protein has 4 exons:
- Exon 1: from position 123 to position 151.
- Exon 2: from position 414 to position 511.
- Exon 3: from position 589 to position 665.
- Exon 4: from position 2537 to position 2617.
Seblastian did not provide any selenoprotein prediction. However, by using SECISearch3, a SECIS element was found.
SelW2b
SelW2b protein could not be predicted for P. schlosseri by following the detailed steps of this study.
TR2
To do the homology-based approach the zebrafish TR2 protein could not be used because it was not starting with methionine (neither in SelenoDB or UniProt). This is the reason why the sequence from Homo sapiens, obtained from SelenoDB, was used to do the comparison.
TR2 protein is codified by the contig KN477299.1 in P. schlosseri. However, it is not encoded by one gene only but two. The first gene is located between the positions 10548 and 67353, in the negative strand. It encodes 18 exons:
- Exon 1: from position 67338 to position 67353.
- Exon 2: from position 23009 to position 23074.
- Exon 3: from position 22876 to position 22932.
- Exon 4: from position 22644 to position 22788.
- Exon 5: from position 22412 to position 22486.
- Exon 6: from position 22207 to position 22285.
- Exon 7: from position 21994 to position 22056.
- Exon 8: from position 21796 to position 21866.
- Exon 9: from position 19465 to position 19484.
- Exon 10: from position 19258 to position 19349.
- Exon 11: from position 19002 to position 19176.
- Exon 12: from position 14078 to position 14214.
- Exon 13: from position 12202 to position 12297.
- Exon 14: from position 11940 to position 12032.
- Exon 15: from position 11040 to position 11111.
- Exon 16: from position 10748 to position 10845.
- Exon 17: from position 10548 to position 10674.
- Exon 18: from position 143715 to position 143777.
The second gene is located between the positions 12313 and 143777, also in the negative strand. It encodes for the las six exons of the TR2 protein:
- Exon 19: from position 135349 to position 135404.
- Exon 20: from position 67003 to position 67029.
- Exon 21: from position 64534 to position 64551.
- Exon 22: from position 60825 to position 60865 .
- Exon 23: from position 49221 to position 49228.
- Exon 24: from position 12313 to position 12357. This exon contains the selenocysteine residue.
Two SECIS elements were predicted by using SECISearch3. Nevertheless, Seblastian could not predict a selenoprotein because TR2 is encoded by two different contigs.
TR3
TR3 protein in P. schlosseri is encoded by two different contigs: KN483500.1 (between the positions 57978 and 61466, in the positive strand) and KN474115.1 (between positions 52548 and 55858, in the negative strand). The contig KN483500.1 encodes 10 exons of TR3:
- Exon 1: from position 57978 to position 58094.
- Exon 2: from position 58304 to position 58364.
- Exon 3: from position 58977 to position 59086.
- Exon 4: from position 59275 to position 59382.
- Exon 5: from position 59493 to position 59565.
- Exon 6: from position 59634 to position 59753.
- Exon 7: from position 59844 to position 59992.
- Exon 8: from position 60070 to position 60185.
- Exon 9: from position 60350 to position 60575.
- Exon 10: from position 61350 to position 61466.
The contig KN474115.1 encodes 6 exons of TR3:
- Exon 11: from position 55858 to position 55955.
- Exon 12: from position 54782 to position 54938.
- Exon 13: from position 54315 to position 54422.
- Exon 14: from position 53975 to position 54070.
- Exon 15: from position 53753 to position 53887.
- Exon 16: from position 52548 to position 52613. This exon contains the selenocysteine residue.
A SECIS element was predicted by using SECISearch3. Nevertheless, Seblastian could not predict a selenoprotein.
SecS
SecS protein is encoded in the contig KN484638.1, between positions 1409 and 12679, in the positive strand. The structure of the protein is detailed below:
- Exon 1: from position 1409 to position 1522.
- Exon 2: from position 1595 to position 1749.
- Exon 3: from position 3465 to position 3583.
- Exon 4: from position 3692 to position 3850.
- Exon 5: from position 4943 to position 5096.
- Exon 6: from position 5849 to position 5951.
- Exon 7: from position 6062 to position 6191.
- Exon 8: from position 7120 to position 7211.
- Exon 9: from position 9239 to position 9332.
- Exon 10: from position 10020 to position 10110.
- Exon 11: from position 12460 to position 12679.
It does not have any selenocysteine residue. However, this protein has 12 cysteine residues that are aligned with the ones found in the zebrafish SecS protein. After using Seblastian and SECISearch3, no results were obtained.
eEFsec
The protein eEFsec is located in the contig KN477770.1 between positions 77439 and 98118, in the positive strand. The gene codifies 7 exons:
- Exon 1: from position 77439 to position 77712.
- Exon 2: from position 80059 to position 80266.
- Exon 3: from position 81459 to position 81555.
- Exon 4: from position 82659 to position 82823.
- Exon 5: from position 84887 to position 85510.
- Exon 6: from position 91040 to position 91196.
- Exon 7: from position 97940 to position 98118.
eEFsec is not a selenoprotein: it does not contain any selenocysteine. For this reason, Seblastian could not predict a selenoprotein. However, SECISearch3 predicted a SECIS element.
PSTK
The protein PSTK of P. schlosseri is in the contig KN474861.1, between the positions 19572 and 21515, in the negative strand. It has 3 exons:
- Exon 1: from position 21369 to position 21515.
- Exon 2: from position 20490 to position 20733.
- Exon 3: from position 19572 to position 19708.
It does not contain any Sec residue and no SECIS element or Seblastian prediction could be obtained for PSTK. However, it has 5 cysteine residues that are aligned with the same amino acid in the zebrafish protein.
SBP2
This protein is codified by the contig KN472259.1, between positions 13351 and 23647 in the negative strand. SBP2 in P. schlosseri has 10 exons:
- Exon 1: from position 23404 to position 23647.
- Exon 2: from position 22966 to position 23001.
- Exon 3: from position 22551 to position 22703.
- Exon 4: from position 19525 to position 19776.
- Exon 5: from position 18333 to position 18448.
- Exon 6: from position 17203 to position 17313.
- Exon 7: from position 16195 to position16284.
- Exon 8: from position 15476 to position 15643.
- Exon 9: from position 14024 to position 14108.
- Exon 10: from position 13351 to position 13421.
SBP2 does not have any selenocysteine residue. However, this protein has 5 cysteine residues that are aligned with the ones found in the zebrafish SBP2 protein. Seblastian could not predict any selenocysteine but a SECIS element was obtained by SECISearch3.
SBP2L
To do the homology-based approach the zebrafish SBP2L protein could not be used because it was not well annotated (neither in SelenoDB or UniProt). This is the reason why the sequence from Homo sapiens, obtained from SelenoDB, was used to do the comparison.
SBP2L protein, in P. schlosseri, is codified by the contig KN470587.1, between the positions 4404 and 75221 of the positive strand. This protein has 12 exons:
- Exon 1: from position 4404 to position 4415.
- Exon 2: from position 7435 to position 7461.
- Exon 3: from position 52542 to position 52670.
- Exon 4: from position 52745 to position 52834.
- Exon 5: from position 53137 to position 53266.
- Exon 6: from position 53339 to position 53499.
- Exon 7: from position 53944 to position 54058.
- Exon 8: from position 54480 to position 54648.
- Exon 9: from position 54735 to position 54955.
- Exon 10: from position 55064 to position 55218.
- Exon 11: from position 55293 to position 55379.
- Exon 12: from position 75186 to position 75221.
SBP2L does not have any selenocysteine residue. However, this protein has 4 cysteine residues that are aligned with the ones found in the zebrafish SBP2L protein. After using Seblastian and SECISearch3, no results were obtained.
SPS
In P. schlosseri, SPS is aligned in the contig KN475839.1, between the positions 46190 and 48295, in the positive strand. This protein has 9 exons:
- Exon 1: from position 46190 to position 46285.
- Exon 2: from position 46363 to position 46459.
- Exon 3: from position 46811 to position 46914.
- Exon 4: from position 46981 to position 47088.
- Exon 5: from position 47163 to position 47317.
- Exon 6: from position 47411 to position 47501.
- Exon 7: from position 47591 to position 47690.
- Exon 8: from position 47766 to position 47978.
- Exon 9: from position 48084 to position 48295.
This protein does not include any selenocysteine residue but it has 9 cysteine residues well-aligned with the ones that are found in zebrafish SPS protein. A SECIS element was predicted by using SECISearch3. Nevertheless, Seblastian could not predict a selenoprotein.
SPS2
The protein SPS2 is codified by 2 different contigs: KN484547.1 (between the positions 11229 and 18200, in the positive strand) and KN469560.1 (between the positions 1277 and 20105, in the negative strand). The contig KN484547.1 encodes the first part of the protein:
- Exon 1: from position 11229 to position 11490, that contains the selenocysteine residue.
- Exon 2: from position 11841 to position 11864.
- Exon 3: from position 14667 to position 14776.
- Exon 4: from position 15689 to position 15796.
- Exon 5: from position 16369 to position 16523.
- Exon 6: from position 17005 to position 17095.
- Exon 7: from position 17315 to position 17414.
- Exon 8: from position 17986 to position 18200.
The contig KN469560.1 encodes the last part of the protein:
- Exon 9: from position 19928 to position 20105.
- Exon 10: from position 12772 to position 12800.
A SECIS element was predicted by using SECISearch3. However, Seblastian could not predict a selenoprotein.
Trnau1apa
Trnau1apa protein is encoded by the contig number KN468817.1, between the positions 40105 and 44000 from the negative strand. The predicted one has 6 exons:
- Exon 1: from position 43977 to position 44000.
- Exon 2: from position 43090 to position 43187.
- Exon 3: from position 42617 to position 42716.
- Exon 4: from position 41938 to position 41987.
- Exon 5: from position 40774 to position 40905.
- Exon 6: from position 40105 to position 40222.
It does not have any selenocysteine residue. However, this protein has 3 cysteine residues that are aligned with the ones found in the zebrafish Tranu1apa protein. After using Seblastian and SECISearch3, no results were obtained.
C2H6orf52
This protein is encoded by the contig number KN484243.1, between the positions 6525 and 9576 from the negative strand in P. schlosseri. The structure of is detailed below:
- Exon 1: from position 9541 to position 9576.
- Exon 2: from position 9286 to position 9383.
- Exon 3: from position 8731 to position 8830.
- Exon 4: from position 8042 to position 8094.
- Exon 5: from position 7848 to position 7976.
- Exon 6: from position 7368 to position 7484.
- Exon 7: from position 7163 to position 7294.
- Exon 8: from position 6976 to position 7084.
- Exon 9: from position 6842 to position 6884.
- Exon 10: from position 6525 to position 6640.
It does not contain any Sec residue and no SECIS element or Seblastian prediction could be obtained for this protein. However, it has 4 cysteine residues that are aligned with the same amino acid in the zebrafish protein.