[Bioperl-l] Ensembl stable_id_event table
Nathan (Nat) Goodman
ngoodman at systemsbiology.org
Tue Mar 22 13:57:50 UTC 2011
With apologies, this is not technically a bioperl question, but hopefully close enough. I'm trying to map old Ensembl IDs to current ones. Table stable_id_event on their ftp site seems to be the place to get this information, but the data in that table (for human build 61) is cyclic, meaning that old IDs are sometimes mapped to new ones which in turn are mapped back to the old one. This seems wrong... And, it's not a rare occurrence: 18% of IDs in the table lie on cycles, the largest of which contains more than 6000 IDs!
To see one short cycle, run this query (against human build 61).
mysql> select * from stable_id_event where
-> (old_stable_id='ENSP00000399673' or new_stable_id='ENSP00000399673')
-> and (old_stable_id='ENSP00000365536' or new_stable_id='ENSP00000365536');
The results are
+-----------------+-------------+-----------------+-------------+--------------------+-------------+-------+
| old_stable_id | old_version | new_stable_id | new_version | mapping_session_id | type | score |
+-----------------+-------------+-----------------+-------------+--------------------+-------------+-------+
| ENSP00000399673 | 2 | ENSP00000365536 | 3 | 380 | translation | 1 |
| ENSP00000399673 | 2 | ENSP00000365536 | 3 | 381 | translation | 1 |
| ENSP00000365536 | 3 | ENSP00000399673 | 2 | 381 | translation | 1 |
| ENSP00000399673 | 2 | ENSP00000365536 | 3 | 382 | translation | 1 |
| ENSP00000365536 | 3 | ENSP00000399673 | 2 | 382 | translation | 1 |
+-----------------+-------------+-----------------+-------------+--------------------+-------------+-------+
The first and last lines reveal the cycle:
line 1) ENSP00000399673-> ENSP00000365536
line 5) ENSP00000365536-> ENSP00000399673
If anyone can shed some light, I would be most grateful.
Thanks very much,
Nat Goodman
More information about the Bioperl-l
mailing list