[Biojava-l] Genbank SeqIO question

Neill White nk_white@yahoo.com
Tue, 30 Jul 2002 12:11:32 -0700 (PDT)


Thanks Robin - 

I was using the latest (1.21) binary.  Updating to the 
current code was a marked improvement.  The input and
output genbank files nearly match.  There is a slight issue
with an accumulation of '/JoinType="join"' feature table
qualifiers on CDS and mRNA records (i.e., if you read in
and print out the same genbank record n times, you'll get
n '/JoinType="join"' qualifiers for each feature that has
a join.  This probably won't break anyone's parser though
and is more cosmetic than vital.

neill

--- "Emig, Robin" <Robin.Emig@maxygen.com> wrote:
> Are you using the latest source from the CVS? We should have recently cleaned up a lot of the
> genbank and msf parsing code.
> -Robin
> 
> -----Original Message-----
> From: Neill White [mailto:nk_white@yahoo.com]
> Sent: Monday, July 29, 2002 6:30 PM
> To: biojava-l@biojava.org
> Subject: [Biojava-l] Genbank SeqIO question
> 
> 
> Hello-
> 
> I'm a new user of biojava and have a small problem that
> I'm hoping someone can help me with - that is, I'm trying 
> to read in and write out genbank files such 
> that what I write out is somewhat close to what I read in
> (it's acceptable if some info is lost - not all parsers
> are perfect).  The problem is that some information
> (the annotation) is lost after the first iteration.  Not
> a huge problem, but if I now modify the program to read in
> this freshly created genbank file ('file.dat'), and write
> it out again - it's now missing the feature table.  Any
> advice would be greatly appreciated.
> 
> Thanks-
> neill
> 
> my program:
> 
> import java.io.FileOutputStream;
> import java.io.PrintStream;
> import java.io.BufferedReader;
> import java.io.FileReader;
> 
> import org.biojava.bio.*;
> import org.biojava.bio.seq.*;
> import org.biojava.bio.seq.impl.*;
> import org.biojava.bio.seq.io.*;
> import org.biojava.bio.symbol.*;
> 
> public class SimpleBioJava{
>     public static void main(String args[]) throws Exception{
> 
>         // First, make, or read in a simple DNA sequence
>         //Sequence sequence = nwSequence( "gattaca", "DNA" );
>         // This line reads in the first sequence from a fasta file
>         //Sequence sequence = nwReadFastaSequence( "test.fa" );
>         // This line reads in the first sequence from a genbank file
>         Sequence sequence = nwReadGenbankSequence( "gi20537285.gbk" );
> 
>         // Phew, now that that's done, let's write it to a file
>         SequenceFormat seqFormat = new GenbankFormat();
>         PrintStream stream = new PrintStream( new FileOutputStream( "test.dat" ) );
>         seqFormat.writeSequence( sequence, stream );
>     }
>  
>     public static Sequence nwReadGenbankSequence( String fileName ) throws Exception{
> 
>         Sequence sequence = null;
> 
>         BufferedReader br = new BufferedReader( new FileReader( fileName ) );
>         SequenceIterator stream = SeqIOTools.readGenbank( br );
> 
>         if ( stream.hasNext() ){
>             sequence = stream.nextSequence();
>         }
> 
>         return sequence;
>     }
> }
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Yahoo! Health - Feel better, live better
> http://health.yahoo.com
> _______________________________________________
> Biojava-l mailing list  -  Biojava-l@biojava.org
> http://biojava.org/mailman/listinfo/biojava-l


__________________________________________________
Do You Yahoo!?
Yahoo! Health - Feel better, live better
http://health.yahoo.com