[Bioperl-l] Problem in parsing GenBank flatfile

Brian Osborne brian_osborne at cognia.com
Fri Jun 10 12:56:53 EDT 2005


Yoshida,

Whatever the problem is it appears to be fixed in bioperl-live, in CVS. I
downloaded the most recent NT_015926 and it's parsed correctly.

Brian O.



On 6/9/05 5:39 AM, "Yoshida Yuichi" <yuichiyoshi at gmail.com> wrote:

> Dear all,
> 
> I am trying to parse GenBank flatfile (accession num is NT_015926)
> by calling Bio::SeqIO modules, but I can not.
> 
> - - - - - - - - - - - - - Perl program code - - - - - - - - - - - - -
> #!/usr/bin/perl
> use Bio::SeqIO;
> 
> $gbk_filename = shift @ARGV;
> $seqin = Bio::SeqIO->new(-file=>$gbk_filename, -format=>'Genbank');
> 
> while ($seqobj = $seqin->next_seq) {
>     $accession = $seqobj->accession_number,"\n";
>     foreach my $feat ($seqobj->get_SeqFeatures()){
>         if ($feat->primary_tag eq 'mRNA'){
>             $db_gene_name = join(' ',$feat->get_tag_values('gene'));
>             $db_transcript_id = join('
> ',$feat->get_tag_values('transcript_id'));
>             $start = $feat->start; $end = $feat->end;
>             print $db_transcript_id,"\t",$db_gene_name,"\t",$accession,"\t";
>             print $start,"\t",$end,"\n";
>         }
>     }
> }
> - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
> 
> The following error message is shown.
> 
> - - - - - - - - - - - - - error message - - - - - - - - - - - - -
> -------------------- WARNING ---------------------
> MSG: cannot see new qualifier in feature CDS: aa:OTHER)
> ---------------------------------------------------
> 
> -------------------- WARNING ---------------------
> MSG: cannot see new qualifier in feature CDS: aa:OTHER)
> ---------------------------------------------------
> 
> -------------------- WARNING ---------------------
> MSG: cannot see new qualifier in feature CDS: aa:OTHER)
> ---------------------------------------------------
> out of memory
> - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
> 
> The parts which cause the error (I guess) is shown as the followings.
> 
> - - - - - GenBank partial flatfile (NT_015926) - - - - -
>      CDS             complement(join(4528741..4528932,4543408..4543490,
>                      4581809..4582043,4616648..4616817,4632093..4632236,
>                      4643148..4643301))
>                      /gene="FLJ21820"
>                      /note="go_function: catalytic activity [goid 0003824]
>                      [evidence IEA];
>                      go_process: lipid metabolism [goid 0006629] [evidence
>                      IEA]"
>                      /codon_start=1
>                      /product="hypothetical protein FLJ21820"
>                      /protein_id="NP_068744.1"
>                      /db_xref="GI:11345458"
>                      /db_xref="GeneID:60526"
>                      /db_xref="LocusID:60526"
> - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
> 
> Would you please tell me the way to solve this problem?
> 
> --
> Yuichi Yoshida <yuichiyoshi at gmail.com>
> 
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l




More information about the Bioperl-l mailing list