Extractfeat options
Marc Logghe
Marc.Logghe at devgen.com
Wed Apr 2 08:27:23 UTC 2003
Hi Burke,
> I have having a bit of trouble extracting just genes form a
> Genbank file. I
> have tried the obviously options to no avail. I want to get
> JUST the gene
> information but I always get gene and CDS as below. How do I do that?
you should set the -type arg to gene like this
extractfeat -filter -type gene test.gb | less
>
> Additionally, can I get the gene name instead of the stuff below?
Don't know how to do this with EMBOSS, I'd use BioPerl for that:
#!/usr/bin/perl -w
use strict;
use Bio::SeqIO;
my $io = Bio::SeqIO->new(-format => 'genbank', -file => shift);
while (my $seq = $io->next_seq)
{
foreach my $feat ($seq->get_SeqFeatures('gene'))
{
next unless ($feat->primary_tag =~ /gene/i);
print $feat->each_tag_value('gene'), "\n";
}
}
HTH,
Marc
***********************************************************
Marc Logghe, Ph.D.
Senior Scientist
Scientific Computing Group
deVGen
Technologiepark 9
9052 Zwijnaarde
Belgium
tel: +32 (0) 9 324 24 88
fax: +32 (0) 9 324 24 25
***********************************************************
More information about the EMBOSS
mailing list