[GSoC] GSoC python variant update 7
Lenna Peterson
arklenna at gmail.com
Tue Jul 17 17:48:33 UTC 2012
Hi all,
New blog post: http://arklenna.tumblr.com/post/27418058203/
Last week, Reece suggested trying to represent a variety of variants
with just five identifiers: accession, start, stop, pre_seq, and
post_seq. I've started a very minimal Variant object (in
https://github.com/lennax/biopython/blob/variant2/Bio/Variant/variant.py),
using `FeatureLocation` for its location. This uses zero-based,
right-open coordinates, similar to array counting in Python. In
contrast, HGVS and VCF both count from 1.
I've created a list of variant types each represented in HGVS, VCF (if
possible), and my new Python representation. It can be found on the
blog post. Please let me know if there are any errors in my
interpretation of these variant types.
Thanks,
Lenna
More information about the GSoC
mailing list