[Bioperl-l] NCBI/Swissprot cross-ref

Ewan Birney birney at ebi.ac.uk
Fri Dec 3 03:57:51 EST 2004



On Thu, 2 Dec 2004, Fontaine, Burr R wrote:

> Hi,
>
>
>
> Does anyone know if BioPERL can help me cross-reference gene and SNP
> ID's between NCBI and Swissprot? I can't find anything at NCBI or
> Swissprot that does this directly.
>

Do you mean SNP ids to Variation IDs in Swissprot? In the swissprot files
some variations do have dbSNP ids (I believe in the feature table) and I
think there is a goal to get this done better in the future in swissprot.


Swissprot definitely holds many more variants which are just mentioned in
papers, which are often the ones with phenotypic effects.



>
>
> The closest thing we've found so far for this is the kgxref table at
> UCSC, but this table does not includes SNP's. Also, this table appears
> to include Swiss-prot ID's for both proteins and genes in the same
> field, and I'm not sure how to sort these out.
>
>
>
> #kgID       mRNA        spID        spDisplayID geneSymbol  refseq
> protAcc     description
>
> AY231461    AY231461    AAO84335    AAO84335    TAZ         NM_000116
> NP_000107   Tafazzin exon 5 deleted variant long form.
>
> AY231462    AY231462    AAO84336    AAO84336    TAZ         NM_000116
> NP_000107   Tafazzin exon 7 deleted variant long form.
>
> AY231463    AY231463    Q86XR0      Q86XR0      TAZ         NM_000116
> NP_000107   Tafazzin exon 5 and exon 7 deleted variant long form.
>
> AY258036    AY258036    Q86XQ9      Q86XQ9      TAZ         NM_000116
> NP_000107   Tafazzin short form.
>
> AY258037    AY258037    Q86XQ8      Q86XQ8      TAZ         NM_000116
> NP_000107   Tafazzin exon 5 and exon 7 deleted variant short form.
>
> AY258038    AY258038    Q86XQ7      Q86XQ7      TAZ         NM_000116
> NP_000107   Tafazzin exon 7 deleted variant short form.
>
> AY258039    AY258039    Q86XQ6      Q86XQ6      TAZ         NM_000116
> NP_000107   Tafazzin exon 5 deleted variant short form.
>
> BC005062    BC005062    Q7Z6N8      Q7Z6N8      TAZ         NM_000116
> NP_000107   Tafazzin, isoform 5.
>
> BC011515    BC011515    Q96F92      Q96F92      TAZ         NM_000116
> NP_000107   Similar to tafazzin (cardiomyopathy,   dilated 3A
> (X-linked), endocardial fibroelastosis 2, Barth syndrome).
>
> X92762      X92762      Q16635      TFZ_HUMAN   TAZ         NM_000116
> NP_000107   tafazzin (cardiomyopathy, dilated 3A (X-linked); endocardial
> fibroelastosis 2; Barth syndrome)
>
>
>
> Thanks in advance for your help.
>
>
>
> Burr Fontaine
>
>
>
>



More information about the Bioperl-l mailing list