[Bioperl-l] Converting Genbank to fasta via SeqIO
Wes Barris
wes.barris at csiro.au
Mon Jul 21 16:36:00 EDT 2003
Hi,
I am using the following code to convert a genbank file into a fasta
file:
my $seq_in = Bio::SeqIO->new('-file' => "<$infile", '-format' => 'genbank');
my $seq_out = Bio::SeqIO->new('-file' => ">$outfile", '-format' => 'fasta');
while ( my $inseq = $seq_in->next_seq ) {
if ($seq->accession =~ '_') {
$seq_out->write_seq($inseq);
}
}
The genbank entry (NM_174198) results in the following defline
in the fasta file:
>TLR4 Bos taurus toll-like receptor 4 (TLR4), mRNA.
However, I prefer to have a defline containing the accession number similar to
what is shown at the NCBI site:
>gi|31342611|ref|NM_174198.2| Bos taurus toll-like receptor 4 (TLR4), mRNA
Is there a way to have the SeqIO routines do this?
--
Wes Barris
E-Mail: Wes.Barris at csiro.au
More information about the Bioperl-l
mailing list