EMBL and Ensembl
Stefanie Lager
stefanielager at fastmail.ca
Tue Mar 18 06:12:05 UTC 2003
Hi,
I have some problems with the EMBL format output from Ensembl. If I
retrieve a LARGE piece of DNA from Ensembl in EMBL format, the SQ line
gets so long so it's divided into two SQ lines, this is NOT handled
correctly by EMBOSS programs! Some EMBOSS programs gives a warning
about illegal characters, others just incorporates the second SQ line
in the sequence. It's easy to fix the problem by manual editing, but
it would be nice to know it this IS standard EMBL format or if it's
Ensembl that's made a mistake?
ID 1.77242832-92443803 ENSEMBL; DNA; PLN; 15200972 BP.
XX
.....
.....
.....
FT misc_feature 14757170..15200972
FT /note="contig 1.92000001-93000000 1..443803(1)"
XX
SQ Sequence 15200972 BP; 4106479 A; 3111667 C; 3123445 G; 4136833 T;
722548
SQ other;
TAGAACTTGC AAATGAGAAA ACAGAGTTCT GTCAAGCTGT GTTAGTGTTT GCCCAACACA
60
_________________________________________________________________
http://fastmail.ca/ - Fast Secure Web Email for Canadians
More information about the EMBOSS
mailing list