You know which protein sequence corresponds to which cDNA sequence, so I think you could also: - take the start and end coordinates of the domain in the protein (from the HMMer output) - convert those into the start and end coordinates of the domain in the cDNA - use the subseq() method to extract the subsequence of the cDNA Dave