[Biopython-dev] Updates to the tutorial for parsing GenBank files

Tue Dec 13 17:23:11 EST 2005

Are there any others on the list interested in parsing GenBank files who 
wouldn't mind proofreading/commenting on this change to the 
Tutorial/Cookbook?

i.e. Changes to this document, section 3.4 GenBank:

http://www.biopython.org/docs/tutorial/Tutorial004.html#toc13
http://www.biopython.org/docs/tutorial/Tutorial.pdf

The patch is on the mailing list archive here:

http://www.biopython.org/pipermail/biopython-dev/2005-November/002193.html

Or I could log a bug & attach the patch to it.

Would I be better off asking on the Discussion List, rather than the 
Development List for this sort of question?

Bonus question: where could I find multi-record GenBank files?

Peter

On 10 Nov 2005, I wrote:
> There should be a patch attached for Biopython Doc/Tutorial.tex which 
> tries to clarify GenBank parsing.
> 
> Created on Windows using:-
> 
> diff cvs_Tutorial.tex new_Tutorial.tex -E -Naur > patch.txt
> 
> In particular, I have tried make it clear that GenBank.Iterator() and 
> GenBank.index_file() are overkill/unnecessary when dealing with GenBank 
> files which contain only single record (which is the typical case in my 
> personal experience).
> 
> My changes add an introductory example: parsing a small bacterial genome 
> (a single large GenBank record), before moving on to the 
> GenBank.Iterator() and GenBank.index_file() examples.
> 
> I have also pointed out that the multi-record example GenBank file used 
> in these examples (cor6_6.gb) is included in the downloadable BioPython 
> source code.
> 
> Plus there is a minor correction to the GenBank.index_file example, 
> len(gb_dict) gives 6, not 7.
> 
> Peter