[EMBOSS] refseq IDs via SRSWWW

Peter Rice pmr at ebi.ac.uk
Fri Mar 26 09:57:12 UTC 2004


F.J. Novo wrote:

> Hello, I've just configured a remote access to genbank in my 2.8.0 
> installation, as suggested:
> 
> DB genbank [ type: N method: srswww format: genbank release: NCBI
>    comment: "Genbank from NCBI"
>    url: "http://cbr-rbc.nrc-cnrc.gc.ca/srs6bin/cgi-bin/wgetz"
> ]
> 
> It works fine with EMBL-type accession nos. (AJXXXXXX), but it seems 
> unable to retrieve refseq IDs (NM_XXXXX). Am I missing something?

srswww will query by id and acc.

You can query a remote SRS database for any ID or ACC value.

If you define some extra fields, you can query by description, organism 
and keyword.

The NM_xxxxx codes are NCBI's REFSEQ identifiers. GenBank entries may 
refer to them, but they are not indexed by them.

At the Canadian SRS server, the REFSEQ DNA entries are in REFSEQNT so 
you can define that as a database (use dbalias:refseqnt in the EMBOSS db 
definition if you want to use another database name locally)

We have considered providing an Entrez access method to try retrieving 
these strange identifiers (refseq, GI numbers, and so on) from NCBI but 
(a) they change the Entrez interface each time I look and (b) it 
requires far too much rewriting of parsers each time to get to the 
actual entries. Some day, if it becomes stable and if one can query byu 
such fields and get straight to the entries then we may implement it.

Hope this helps,

Peter Rice




More information about the EMBOSS mailing list