[Bioperl-l] Re: bad entries in interpro again
Hilmar Lapp
hlapp at gnf.org
Thu Dec 2 16:28:16 EST 2004
This sounds more like an expat or XML::Parser problem. Have you tried
to upgrade either? Maybe check with the authors of those modules?
-hilmar
On Dec 2, 2004, at 2:11 AM, Mikko Arvas wrote:
>
> Hi,
>
> At 13:49 1.12.2004 -0800, Hilmar Lapp wrote:
>
>> On Wednesday, December 1, 2004, at 08:16 AM, Mikko Arvas wrote:
>>
>>> Is the &apos the source of the problem?
>> Did you try to take it out and see what happens? I.e., you can answer
>> this yourself easily.
>> I would have thought that it's not the problem, but it'd be great if
>> you or somebody else helps out by testing what was suggested.
>
> Sorry about that I should have tested it before mailing. The problem
> is not non-ascii characters it seems to be specifically the
> combination of two & inside individual <>. I tried various
> combinations and other non-ascii characters (even in abundance) don't
> break it and a single & does neither.
>
> Here is again the problematic line:
> <interpro id="IPR002073" name="3'5'-cyclic nucleotide
> phosphodiesterase" type="Domain" parent_id="IPR003607">
>
> And its error:
> not well-formed (invalid token) at line 2, column 54, byte 132 at
> /usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi/XML/Parser.pm
> line 187
>
> So which way to proceed?
>
>>> Is it really a problem in BioPerl or in expat?
>>
>> If the problem is outside of Interpro, it's Expat, not Bioperl. It's
>> the XML parser library that threw up.
>>
>>> Is somebody trying to solve the problem for Bioperl now
>>> and is there any sensible thing that the interpro team could do to
>>> help?
>>
>> Depends on where the problem is. It appears that the Interpro team
>> already eliminated the double quotes in names. The is some hard-coded
>> stuff in interpro.pm that needs to be removed, and I heard Allen say
>> he'll work on that.
>>
>> -hilmar
>
> Cheers,
> mikko
>
>
>
> Mikko Arvas
> VTT Biotechnology
>
> e-mail: mikko.arvas at vtt.fi
> tel: +358-(0)9-456 5827
> mobile: +358-(0)44-381 0502
> fax: +358-(0)9-455 2103
> mail: Tietotie 2, Espoo
> P.O. Box 1500
> FIN-02044 VTT, Finland
>
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l
>
--
-------------------------------------------------------------
Hilmar Lapp email: lapp at gnf.org
GNF, San Diego, Ca. 92121 phone: +1-858-812-1757
-------------------------------------------------------------
More information about the Bioperl-l
mailing list