<div dir="ltr"><div><div>Hello everyone.<br></div>Is any support for FASTA dialects, so to say, in Biopython? For example, NCBI headers include GI/new ID, human-readable sequence name, and a good deal of them include species name in square brackets. Ones on JGI site include two of their sequence IDs and a shortened species name. MMETSP consists of lots and lots of tags. And so on and so forth, most databases have some internal standart for FASTA headers that potentially includes useful information.<br></div><div>Looking up docs, I found only SeqRecord.id and SeqRecord.description. If I understood correctly, this just means "Stuff before or after first \s, respectively". Can I get more fine-grained features without cooking up my own parser?<br></div><br clear="all"><div><div><div><br>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><font face="arial,helvetica,sans-serif">Alexey Morozov,<br>LIN SB RAS, bioinformatics group.<br>Irkutsk, Russia.<br></font></div>
</div></div></div></div>