[Biojava-l] Parsing INSDseq Sequences (1.3 & 1.4)
Richard Holland
richard.holland at ebi.ac.uk
Thu Jun 8 09:14:22 UTC 2006
Yesterday I think I said I was going to add other-seqids but I forgot to
do it, so I did it just now. Try it and see. Use the new
INSDseqFormat.Terms.getOtherSeqIdTerm() term to find them.
cheers,
Richard
On Wed, 2006-06-07 at 19:48 -0400, Seth Johnson wrote:
> Hi Richard,
>
> I still cannot locate the GI number for the main sequence. After I
> parse it with readINSDseqDNA, I then use:
>
> Note [] myAccs = ((RichAnnotation)rs.getAnnotation
> ()).getProperties(Terms.getAdditionalAccessionTerm ());
>
> However, the 'myAccs' appears to be empty. Am I on the wrong track to
> get to other-seqids???
>
> On 6/6/06, Richard Holland <richard.holland at ebi.ac.uk> wrote:
> GenBank has a separate line for GI number, so it can be parsed
> out
> nicely. INSDseq does not, so you have to rely on the other-
> seqids tag
> and hope that one of them is the GI number. However it seems I
> have not
> included that tag in the parser, so I will include it. This
> will make
> the other-seqids values available through the notes with the
> term
> Terms.getAdditionalAccessionTerm(), but getIdentifier() will
> remain
> null.
>
> For your second question, the tutorial makes the mistake in
> several
> places of saying getNoteSet(Terms.blahblah()). This was
> shorthand for:
>
> rs.getAnnotation().getProperty(Terms.blahblah())
> (for single values)
>
> or
>
> ((RichAnnotation)rs.getAnnotation()).getProperties
> ( Terms.blahblah())
> (for multiple values)
>
> but never got expanded. Maybe someone can fix that one
> day... :)ded...
>
> I'm just updating INSDseq to 1.4 now. The guys next door gave
> me the
> details of the changes, and told me that 1.3 is actually no
> longer
> supported by them after Friday this week! So I'll make it 1.4
> only.
>
> cheers,
> Richard
>
--
Richard Holland (BioMart Team)
EMBL-EBI
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD
UNITED KINGDOM
Tel: +44-(0)1223-494416
More information about the Biojava-l
mailing list