[Bioperl-l] Protein families

Jason Stajich jason at bioperl.org
Wed Feb 11 21:16:52 UTC 2009

> The simplest way that I can think of for doing this is to first
> build a profile for the family, based on a multiple sequence
> alignment; then to align each random sequence against the profile and
> calculate an e-value. But since I don't know much about this things, I
> really can't judge whether it makes sense or is completely wrong.
> Using Bio::Tools::HMM sounded fine, but unfortunately it doesn't offer
> a method for calculating the probability of an observation sequence,
> given the profile.
I'm not entirely sure about the whole problem you describe but if you  
are using HMMER for this - "hmmsearch" does give you e-value of  
similarity of sequence to the profile - you need to do the  
hmmcalibrate beforehand though - this is covered in HMMER user manual.

> What would you suggest? Thanks in advance!
> PS: If there is a more appropriate mailing list for this sort of
> questions, please don't hesitate to educate me.
> Bruno.
>      Yahoo! Cocina
> Recetas prácticas y comida saludable
> http://ar.mujer.yahoo.com/cocina/
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l

Jason Stajich
jason at bioperl.org

More information about the Bioperl-l mailing list