[Biojava-l] Writing EMBL files
Vasa Curcin
vc100 at doc.ic.ac.uk
Tue Nov 23 11:44:25 EST 2004
Hello,
We are loading an EMBL file into a SequenceDB and then writing it out
again and getting the following error:
16:42:19,032 INFO [STDOUT] at
org.biojava.bio.seq.io.EmblFileFormer.addSequ
enceProperty(EmblFileFormer.java:246)
16:42:19,032 INFO [STDOUT] at
org.biojava.bio.seq.io.SeqIOEventEmitter.getS
eqIOEvents(SeqIOEventEmitter.java:92)
16:42:19,032 INFO [STDOUT] at
org.biojava.bio.seq.io.EmblLikeFormat.writeSe
quence(EmblLikeFormat.java:289)
16:42:19,032 INFO [STDOUT] at
org.biojava.bio.seq.io.EmblLikeFormat.writeSe
quence(EmblLikeFormat.java:253)
16:42:19,032 INFO [STDOUT] at
org.biojava.bio.seq.io.StreamWriter.writeStre
am(StreamWriter.java:63)
16:42:19,032 INFO [STDOUT] at
org.biojava.bio.seq.io.SeqIOTools.writeEmbl(S
eqIOTools.java:289)
16:42:19,032 INFO [STDOUT] at
SequenceDBToText.process(SequenceDBToText.jav
a:134)
This is the file we are using:
ID AB126240 standard; genomic DNA; PRO; 1350 BP.
XX
AC AB126240;
XX
SV AB126240.1
XX
DT 03-SEP-2004 (Rel. 81, Created)
DT 03-SEP-2004 (Rel. 81, Last updated, Version 1)
XX
DE Thermococcus kodakaraensis Tko1062 gene for phosphosugar mutase,
complete
DE cds.
XX
KW .
XX
OS Thermococcus kodakaraensis
OC Archaea; Euryarchaeota; Thermococci; Thermococcales; Thermococcaceae;
OC Thermococcus.
XX
RN [1]
RP 1-1350
RA Imanaka T., Atomi H., Rashid N.;
RT ;
RL Submitted (15-NOV-2003) to the EMBL/GenBank/DDBJ databases.
RL Tadayuki Imanaka, Kyoto University, Synthetic Chemistry & Biological
RL Chemistry, Graduate School of Engineering; Katsura, Nishikyo-ku, Kyoto
RL 615-8510, Japan (E-mail:imanaka at sbchem.kyoto-u.ac.jp,
Tel:81-75-383-2777,
RL Fax:81-75-383-2778)
XX
RN [2]
RA Rashid N., Kanai T., Atomi H., Imanaka T.;
RT "Among Multiple Phosphomannomutase Gene Orthologues, Only One Gene
Encodes
RT a Protein with Phosphoglucomutase and Phosphomannomutase Activities in
RT Thermococcus kodakaraensis";
RL J. Bacteriol. 186:6070-6076(2004).
XX
FH Key Location/Qualifiers
FH
FT source 1..1350
FT /db_xref="taxon:69014"
FT /mol_type="genomic DNA"
FT /organism="Thermococcus kodakaraensis"
FT /strain="KOD1"
FT CDS 1..1350
FT /codon_start=1
FT /transl_table=11
FT /gene="Tko1062"
FT /product="phosphosugar mutase"
FT /protein_id="BAD42439.1"
FT
/translation="MGKYFGTSGIREVFNEKLTPELALKVGKALGTYLGGGKVVIGKDT
FT
RTSGDVIKSAVISGLLSTGVDVIDIGLAPTPLTGFAIKLYGADAGVTITASHNPPEYNG
FT
IKVWQANGMAYTSEMERELESIMDSGNFKKAPWNEIGTLRRADPSEEYINAALKFVKLE
FT
NSYTVVLDSGNGAGSVVSPYLQRELGNRVISLNSHPSGFFVRELEPNAKSLSALAKTVR
FT
VMKADVGIAHDGDADRIGVVDDQGNFVEYEVMLSLIAGYMLRKFGKGKIVTTVDAGFAL
FT
DDYLRPLGGEVIRTRVGDVAVADELAKHGGVFGGEPSGTWIIPQWNLTPDGIFAGALVL
FT
EMIDRLGPISELAKEVPRYVTLRAKIPCPNEKKAKAMEIIAREALKTFDYEGLIDIDGI
FT RIENGDWWILFRPSGTEPIMRITLEAHEEEKAKELMGKAERLVKKAISEA"
XX
SQ Sequence 1350 BP; 339 A; 341 C; 417 G; 253 T; 0 other;
atggggaagt acttcggaac cagcggaatc agggaagtct ttaatgagaa
gctgacacct 60
gagctggctc taaaggtcgg caaagccctt ggaacgtacc tcggcggcgg
aaaggttgtt 120
atcgggaagg ataccaggac tagcggcgac gttataaaat cagcagtcat
aagcggactt 180
ctctcaactg gtgttgatgt gattgacata ggtttagcgc caacgccgct
cacgggcttt 240
gcgataaagc tctacggtgc cgatgctggc gttaccatca cagcttctca
caacccgccg 300
gagtacaacg gcataaaggt gtggcaggcc aacggaatgg catacacctc
tgagatggag 360
cgtgaactcg agtccataat ggactcaggg aacttcaaaa aagctccctg
gaatgagatc 420
gggacgctta gaagggccga ccccagtgag gagtacataa acgcggcgct
aaaattcgtc 480
aaacttgaga actcctacac ggtcgtcctc gattctggaa acggtgcggg
ctcggtggtc 540
tccccctacc tccagcggga gctgggcaat agggttatct cgctcaactc
ccacccgagc 600
ggcttcttcg tcagggaact tgagccgaac gcgaagagcc tctccgccct
agcgaagacc 660
gttagagtga tgaaagccga cgtcggcata gcccacgacg gcgacgcaga
taggatcggc 720
gtcgttgatg atcagggcaa cttcgttgag tacgaggtca tgctctcgct
catagcgggc 780
tacatgctga ggaagttcgg gaaggggaaa atagttacca ccgttgatgc
gggctttgct 840
ttggacgact acctcagacc ccttggcgga gaagtcataa ggacgcgcgt
tggtgatgtg 900
gccgttgccg acgagctcgc aaaacacggc ggcgtcttcg gcggcgagcc
gagtggcacg 960
tggataatcc cgcagtggaa cctcaccccc gacggaatct ttgctggggc
ccttgttctg 1020
gagatgattg acagactcgg tccgataagc gagctggcca aggaagtccc
gcgctacgtg 1080
acgctccgcg ccaaaatccc ctgtccgaac gagaagaagg cgaaagccat
ggagataata 1140
gcgcgcgagg cactaaagac gttcgactac gaggggctga tagacataga
tggaattagg 1200
atagaaaacg gtgactggtg gatcctcttc cgcccgagcg gaaccgagcc
gataatgcgc 1260
ataactttgg aggcccacga ggaagagaag gcgaaggagc tgatggggaa
ggcggagagg 1320
ctggttaaga aagccatctc
ggaggcctga 1350
//
Any ideas?
Regards,
Vasa
More information about the Biojava-l
mailing list