[Bioperl-l] modify sequence name
yang liu
yang.liu0508 at gmail.com
Fri Mar 9 19:25:50 UTC 2012
Dear colleagues,
When I do Sanger sequencing, I get hundreds of sequences named by DNA
Numbers, and for several genes. I need to add taxon name manually for each
sequence. I wonder is there a way to change the names automatically?
I have two .txt files.
file 1, with seqeucens named by DNA Number:
>2863
AGGATTAAAAATCAACGCTATGAATCTGGTGTAATTCCATATGCTAAAATGGGCTATTGGGATCCTAATT
ATGCAATTAAAGAAACTGATGTATTAGCATTATTTC
>2864
AGGATTAAAAATCAACGCTATGAATCTGGTGTAATTCCATATGCTAAAATGGGCTATTGGGATCCTAATT
ATGCAATTAAAGAAACTGATGTATTAGCATTATTTCGTATTACTCCACAACCAGGTGTAGAT
........
file 2, with DNA Number and taxa names, seperated by tabs
2863 Gelidium
2864 Poa
........
I hope the final file to be like this,
>Gelidium-2863
AGGATTAAAAATCAACGCTATGAATCTGGTGTAATTCCATATGCTAAAATGGGCTATTGGGATCCTAATT
ATGCAATTAAAGAAACTGATGTATTAGCATTATTTC
>Poa-2864
AGGATTAAAAATCAACGCTATGAATCTGGTGTAATTCCATATGCTAAAATGGGCTATTGGGATCCTAATT
ATGCAATTAAAGAAACTGATGTATTAGCATTATTTCGTATTACTCCACAACCAGGTGTAGAT
Any ideas? Anything help would be appreciated.
Yang.
More information about the Bioperl-l
mailing list