[MOBY-dev] XML parsing takes forever

José María Fernández González jmfernandez at cnio.es
Thu Oct 30 19:54:33 UTC 2008


Hi everybody,
	I'm using the XML::LibXML SAX parser for other projects and tasks, and it can
be really, really slow. You don't realize that until you are parsing a large
file from a pipe, and you see that most of the time the program which is
parsing is CPU (and not I/O) bounded.

	Best Regards,
		José María

Edward Kawas wrote:
> I think that you are right about the parser not going into an infinite loop.
> For some reason, the SAX parser seems to be really slow (although, I don't
> want to say that until I know for sure).
> 
> Eddie
> 
> -----Original Message-----
> From: moby-dev-bounces at lists.open-bio.org
> [mailto:moby-dev-bounces at lists.open-bio.org] On Behalf Of Kenny Billiau
> Sent: October-30-08 11:56 AM
> To: Core developer announcements
> Subject: Re: [MOBY-dev] XML parsing takes forever
> 
> Hi,
> 
> thx for helping to investigate the problem!
> 
> I'm pretty sure the parser gets out of it's mess. When you truncate the 
> XML file by removing all but a few ElementContainers, it takes awhile, but 
> it actually returns without error.
> 
> And memory consuption doesn't seem to be a problem either, as mostly 'top' 
> reveals that the invoked webservice script uses a steady amount of memory.
> 
> thx,
> Kenny
> 
> On Thu, 30 Oct 2008, Edward Kawas wrote:
> 
>> So this one might take some time... from what I can see, our parser goes
>> into an infinite loop (although, I am not certain about that ...).
>>
>> I will play around with this XML file and get back to you.
>>
>> Thanks,
>>
>> Eddie
>>
>> -----Original Message-----
>> From: moby-dev-bounces at lists.open-bio.org
>> [mailto:moby-dev-bounces at lists.open-bio.org] On Behalf Of Kenny Billiau
>> Sent: October-30-08 8:03 AM
>> To: Core developer announcements
>> Subject: Re: [MOBY-dev] XML parsing takes forever
>>
>> Hi,
>>
>> here you go,
>>
>> Kenny
>>
>> On Thu, 30 Oct 2008, Edward Kawas wrote:
>>
>>> Hi,
>>>
>>> This is the right list! Could you send me the XML for your second
> service?
>>> Thanks,
>>>
>>> Eddie
>> _______________________________________________
>> MOBY-dev mailing list
>> MOBY-dev at lists.open-bio.org
>> http://lists.open-bio.org/mailman/listinfo/moby-dev
>>
> 

-- 
"There is no reason why anybody would want a computer in their home" -
	Ken Olson, founder of DEC 1977
"640K ought to be enough for anybody" - Bill Gates, 1981
"Nobody will ever outgrow a 20Mb hard drive." - ???

"Premature optimization is the root of all evil." - Donald Knuth

José María Fernández González
Tlfn: (+34) 91 732 80 00 / 91 224 69 00 (ext 3061)
e-mail: jmfernandez at cnio.es		Fax: (+34) 91 224 69 76
Unidad del Instituto Nacional de Bioinformática
Biología Estructural y Biocomputación	Structural Biology and Biocomputing
Centro Nacional de Investigaciones Oncológicas
C.P.: 28029				Zip Code: 28029
C/. Melchor Fernández Almagro, 3	Madrid (Spain)

**NOTA DE CONFIDENCIALIDAD** Este correo electrónico, y en su caso los ficheros adjuntos, pueden contener información protegida para el uso exclusivo de su destinatario. Se prohíbe la distribución, reproducción o cualquier otro tipo de transmisión por parte de otra persona que no sea el destinatario. Si usted recibe por error este correo, se ruega comunicarlo al remitente y borrar el mensaje recibido.
**CONFIDENTIALITY NOTICE** This email communication and any attachments may contain confidential and privileged information for the sole use of the designated recipient named above. Distribution, reproduction or any other use of this transmission by any party other than the intended recipient is prohibited. If you are not the intended recipient please contact the sender and delete all copies.




More information about the MOBY-dev mailing list