[GSoC] GSoC python variant update 6
Lenna Peterson
arklenna at gmail.com
Sat Jun 30 06:15:05 UTC 2012
Post: http://arklenna.tumblr.com/post/26196310200/
I've made a new branch (variant2:
https://github.com/lennax/biopython/tree/variant2 ) which has a very
skeletal outline of a set of Python objects designed to store
variants. One might note many similarities to the organization of
PyVCF. One thing SQL did neatly was store per-allele data with the
allele, rather than with the site, and I'm envisioning doing this in
Python, as well.
For a Python variant object, are there any organizational choices that
would make it easier for future conversion of a variant to HGVS
syntax? (this is primarily directed at Reece but I'm open to all
suggestions)
Another question that may reveal my complete ignorance of haplotypes
and such: could a polyploid site ever be partially phased? e.g. a
triploid genotype of 0/1|0?
Looking forward to any and all questions, comments, concerns, etc.
Lenna
More information about the GSoC
mailing list