We first used our getGenbank.pl script from last time to get the FASTA sequence of the human STK24 gene. This gene has a known yeast homolog (the yeast STE20 gene). We saved the amino acid sequence of the STK24 protein in the
stk24.fa file.
We wrote a new script call
blast.pl to find homologous sequences for a set of supplied protein sequences given in FASTA format. This script helps solve part II of lab 6. To call the program, we can send the FASTA sequence(s) to the blast.pl program's standard input using the following UNIX command:
perl blast.pl < stk24.fa
Remember that the code does not work at this stage because NCBI has recently changed their output format. In the future, we will instead use the XML-based BioPerl parser so that it does not break when NCBI updates their format.