EMBL and Ensembl

Stefanie Lager stefanielager at fastmail.ca
Tue Mar 18 06:12:05 UTC 2003


Hi,

I have some problems with the EMBL format output from Ensembl. If I
retrieve a LARGE piece of DNA from Ensembl in EMBL format, the SQ line
gets so long so it's divided into two SQ lines, this is NOT handled
correctly by EMBOSS programs! Some EMBOSS programs gives a warning
about illegal characters, others just incorporates the second SQ line
in the sequence. It's easy to fix the problem by manual editing, but
it would be nice to know it this IS standard EMBL format or if it's
Ensembl that's made a mistake?

ID   1.77242832-92443803    ENSEMBL; DNA; PLN; 15200972 BP.
XX
.....
.....
.....
FT   misc_feature    14757170..15200972
FT                   /note="contig 1.92000001-93000000 1..443803(1)"
XX
SQ   Sequence 15200972 BP; 4106479 A; 3111667 C; 3123445 G; 4136833 T;
722548
SQ   other;
     TAGAACTTGC AAATGAGAAA ACAGAGTTCT GTCAAGCTGT GTTAGTGTTT GCCCAACACA
       60 


_________________________________________________________________
    http://fastmail.ca/ - Fast Secure Web Email for Canadians


More information about the EMBOSS mailing list