[Biopython] SeqIO.parse for imgt
Peter Cock
p.j.a.cock at googlemail.com
Fri Nov 11 17:14:36 UTC 2016
Thank you Chang, James,
Those details are just the kind of thing I was hoping for.
I don't have time to make the IMGT parser changes today,
so have opened an issue for this on the Biopython GitHub:
https://github.com/biopython/biopython/issues/988
Peter
On Fri, Nov 11, 2016 at 5:00 PM, Liu, Chang <cliu32 at wustl.edu> wrote:
> Thank you very much, James!
> Hi, Peter, here you go - thank you in advance for updating the 'imgt' parser. I really appreciate it. Please let me know if I can be of any assistance!
> Chang
>
> -----Original Message-----
> From: James Robinson [mailto:jrobinso at ebi.ac.uk]
> Sent: Friday, November 11, 2016 10:54 AM
> To: Liu, Chang <cliu32 at wustl.edu>
> Cc: p.j.a.cock at googlemail.com
> Subject: Re: [IPD #99553] hla.dat file and biopython, follow up
>
> Hi,
>
> The key changes post 3.16 are the addition of an SV value to the ID line, these additions should make the format more similar to the ENA style.
>
> ID HLA00001 standard; DNA; HUM; 3503 BP.
>
> becomes
>
> ID HLA00001; SV 1; standard; DNA; HUM; 3503 BP.
>
> We have also added the SV value as a line in the file;
>
> SV HLA00001.1
>
> this is added between the AC and DT lines.
>
> The other change, is the removal of a third DT line, we previously had 3 lines, but have reduced this to two;
>
> DT 01-AUG-1989 (Rel. 1.0.0, Created, Version 1)
> DT 16-DEC-1998 (Rel. 1.0.0, Sequence Updated, Version 1)
> DT 14-APR-2014 (Rel. 3.16.0, Current Release, Version 1)
>
> becomes
>
> DT 01-AUG-1989 (Rel. 1.0.0, Created, Version 1)
> DT 14-OCT-2016 (Rel. 3.26.0, Last Updated, Version 1)
>
> In addition the text within the CC lines has changed from;
>
> CC --------------------------------------------------------------------------
> CC Copyrighted by the IMGT/HLA Database, Distributed under the Creative
> CC Commons Attribution-NoDerivs License, see;
> CC http://www.ebi.ac.uk/imgt/hla/licence.html for further details.
> CC --------------------------------------------------------------------------
>
> to
>
> CC --------------------------------------------------------------------------
> CC IPD-IMGT/HLA Release Version 3.26.0
> CC --------------------------------------------------------------------------
> CC Copyrighted by the IPD-IMGT/HLA Database, Distributed under the Creative
> CC Commons Attribution-NoDerivs License, see;
> CC http://www.ebi.ac.uk/ipd/imgt/hla/licence.html for further details.
> CC --------------------------------------------------------------------------
>
> Thanks
>
> James
More information about the Biopython
mailing list