[Bioperl-l] query name in xml blast report
gert thijs
gert.thijs@esat.kuleuven.ac.be
Mon, 21 Oct 2002 13:36:52 +0200
Hi,
I am parsing blast reports in XML format with bioperl and this works fine
except from the extraction of the query ID and description line. If I ask for
the query name ($r->query_name ) and description ($r->query_description), I
get empty strings and not values that are in the respective fields in the xml
file.
In the XML file I find the following lines describing my query sequence:
----
<BlastOutput_query-ID>lcl|QUERY</BlastOutput_query-ID>
<BlastOutput_query-def>AF059581|INCLUSive|gene|47|1504|1|+|.|id SAHH ;
number 1 ; query 'AF059581 - SAHH';</BlastOutput_query-def>
<BlastOutput_query-len>701</BlastOutput_query-len>
----
I guess the value in the 'BlastOutput_query-ID' field is set by the NCBI blast
server while the value in the 'BlastOutput_query-def' field matches the header
of my query sequence, but the quotes are changed to '
Has anyone a suggestion on how to get the full description line from the xml
report?
Gert
--
+ Gert Thijs
+ K.U.Leuven
+ ESAT-SCD
+ Kasteelpark Arenberg 10
+ B-3001 Leuven-Heverlee
+ Belgium
+
+ Tel : +32 16 32 85 88
+ Fax : +32 16 32 19 70
+ email: gert.thijs@esat.kuleuven.ac.be
+
+ http://www.esat.kuleuven.ac.be/~thijs
+ http://www.esat.kuleuven.ac.be/~dna/BioI/
+