[Bioperl-l] preparing nexus files for Mr. Bayes
Jason Stajich
jason at bioperl.org
Wed Jun 11 15:13:20 UTC 2008
Giovanni -
Bits and pieces already exist, there is little to handle the text
blocks that exist in NEXUS files because BioPerl has has been focused
on the alignment data not the analysis. Arlin Stolzfus has started a
Bio::NEXUS project that is supposed to fully parse all NEXUS data in
and out but I don't know what the status is right now.
If you look at the Bio::AlignIO::nexus documentation you need to turn
off symbols and endblock when writing for MrBayes NEXUS.
my $out = Bio::AlignIO->new(-format => 'nexus',
-show_symbols => 0,
-show_endblock => 0);
# after you have written out the alignment
$out->write_aln;
# you can then print out whatever execution blocks you want with
standard print statements.
print $out->_fh "begin mrbayes;\n"; ...
As for the other things, the Bio::SimpleAlign module lets you swap
the match character (see map_char and gap_char methods).
For joining alignments and setting up partitions, you can join
multiple alignments by making a new Bio::SimpleAlign and adding
concatenated sequences together.
my %matrix;
for my $aln ( @alns ) {
for my $seq ( $aln->each_seq ) {
my $id = $seq->id;
$matrix{$id} .= $seq->seq;
}
}
my $bigaln = Bio::SimpleAlign->new;
while( my ($id,$seq) = each %matrix ) {
$bigaln->add_seq(Bio::LocatableSeq->new(-id => $id,
-seq => $seq));
}
In general there is not a single solution to a lot of these tasks
(although there should be one that concatenates a set of alignments)
so there are not ready-to-use functions. I have my custom code I use
to join datasets and establish partitions but much is specific to how
I organize the data on my system so I don't know how informative it
would be.
I'm not sure python code would be much good here... =) if you jump
ship you should talk to Frank Kauff who has written some python code
to manipulate alignments for biopython.
-jason
On Jun 11, 2008, at 2:46 AM, Giovanni Marco Dall'Olio wrote:
> Hi,
> I was wondering whether there is any bioperl module I can use to
> handle nexus files for Mr.Bayes.
>
> mr. Bayes is a program for bayesian estimation of phylogeny, which
> uses an alignment file in a customized nexus format as input.
> I would need a module to prepare these files, doing tasks like:
> - joining more than an alignment in a single file/line
> - substitute matching chars with '.'
> - customizing parameters displayed in the headers of the output
> nexus file
> - manage mrbayes' extensions, like adding information about
> partitions and taxas
> - adding batch instructions for the mr bayes interpreter
> - and similar stuff.
>
> I have tried Bio::AlignIO but I see it doesn't handle all of this,
> what I am looing for is bit more specific.
> Moreover, I found a bug in the way mrbayes recognizes the output from
> Bio::AlignIO (https://sourceforge.net/tracker/index.php?
> func=detail&aid=1990655&group_id=129302&atid=714418).
>
> Is there any existing module already available?
> If there is not, I am going to have to write such scripts anyway.
> I would like to contribute them to bioperl, even if I am not a very
> good perl programmer, I prefer python.
>
> --
> -----------------------------------------------------------
>
> My Blog on Bioinformatics (italian): http://bioinfoblog.it
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
More information about the Bioperl-l
mailing list