[EMBOSS] How to find protein sequences in a given genome using CDS information

Guy Bottu gbottu at vub.ac.be
Wed Feb 4 08:49:01 UTC 2009


	Dear Nermin,

You can do that with coderet. At the command line you can do :
coderet AAA -cdsoutseq=XXX -translationoutseq=YYY -outfile=/dev/null 
-mrnaoutseq=/dev/null -restoutseq=/dev/null
where AAA contains one or more sequences in EMBL or GenBank format. You will get 
the coding sequences in XXX and the protein sequences in YYY. How to do it in a 
GUI or dataflow system should be obvious to figure out.

	Regards,
	Guy Bottu,
	Belgian EMBnet Node




More information about the EMBOSS mailing list