SEQUENCES ALIGNMENT PROGRAM
Created by Ariadna Escalona and Anna Travesa
This program uses a dynamic programming algorism to align any two sequences in fasta format. It allows to choose among four different substitution matrices(PAM30, PAM70, BLOSUM62 and BLOSUM80) and it returns the optimum alignment with its corresponding score in fasta format.
We download the two sequences and the program transfers them into two vectors in which each position contains an aminoacid. Simultaneously, a hash is constructed based on the values from the substitution matrix chosen. The choice of the matrix should be made according to the phylogenetical relativeness between the two sequences that are to be aligned. So, the user of the program should choose either PAM30 or BLOSUM80 if dealing with phylogenetically close sequences, and BLOSUM62 or PAM70 if dealing with not so close sequences.
Afterwards, an alignment matrix is constructed, in which the first cell will be filled in with a zero and the rest of the matrix will contain the partial alignment scores calculated according to the following formula:
Si,j = maximum between
- Si-1, j-1 + s(i,j)
- Si-1, j + g
- Si, j-1 + g
Program
Click here
to download the program to run it at the Linux terminal.
If you wish to see it in html format, click
here.
For any questions or want further information send an email to the authors:
anna.travesa01@campus.upf.edu
ariadna.escalona01@campus.upf.edu