[Bioperl-l] Bio::SeqIO::genbank seems to munge SOURCE and ORGANISM lines

Hilmar Lapp hlapp at gmx.net
Fri Feb 21 10:52:59 EST 2003


Can you submit this as bug report to http://bugzilla.bioperl.org/?

Thanks,

	-hilmar

On Friday, February 21, 2003, at 04:53  AM, Geoff Purdy wrote:

> I've noticed some more strange behavior with reading
> and printing GenBank files using Bio::SeqIO.
>
> It appears that the SOURCE and ORGANISM lines are
> being corrupted by BioPerl in some records.  Below is
> an example using GenBank accession AE016800 (for ease
> of reading, I've removed the uneffected surounding
> lines):
>
>> From the original genbank file:
> SOURCE      Vibrio vulnificus CMCP6
>   ORGANISM  Vibrio vulnificus CMCP6
>
>
>> From the output after reading in an printing out with
> Bio::SeqIO:
> SOURCE      Vibrio vulnificus CMCP6 CMCP6.
>   ORGANISM  Vibrio vulnificus
>
> You can see that CMCP6 was dropped from the ORGANISM
> and appended (along with a mystery period) to the
> SOURCE.
>
> Is this a known issue, or should I submit it to
> bugzilla?
>
> Here is the code snippet that I used for reading the
> file and writing it back out:
>
> my $in  = Bio::SeqIO->newFh(-file => "AE016800.gb" ,
>                             '-format' => 'Genbank');
>
> my $out = Bio::SeqIO->newFh('-format' => 'Genbank');
>
> print $out $_ while <$in>;
>
>
> __________________________________________________
> Do you Yahoo!?
> Yahoo! Tax Center - forms, calculators, tips, more
> http://taxes.yahoo.com/
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at bioperl.org
> http://bioperl.org/mailman/listinfo/bioperl-l
>
>
-- 
-------------------------------------------------------------
Hilmar Lapp                            email: lapp at gnf.org
GNF, San Diego, Ca. 92121              phone: +1-858-812-1757
-------------------------------------------------------------



More information about the Bioperl-l mailing list