Bio::Tools::Run::Phylo::PAML
Evolver
Toolbar
Summary
Bio::Tools::Run::Phylo::PAML::Evolver - Wrapper aroud the PAML program evolver
Package variables
No package variables defined.
Included modules
Bio::AlignIO
Bio::Root::Root
Bio::SeqIO
Bio::Tools::Phylo::PAML
Bio::Tools::Run::WrapperBase
Bio::TreeIO
Cwd
Inherit
Bio::Root::Root Bio::Tools::Run::WrapperBase
Synopsis
use Bio::Tools::Run::Phylo::PAML::Evolver;
my $evolver = Bio::Tools::Run::Phylo::PAML::Evolver->new();
# Get a $tree object somehow
$evolver->tree($tree);
# FIXME: evolver generates a tree (first run with option 1 or 2)?
# One or more alns are created
my @alns = $evolver->run();
####
# Or with all the data coming from a previous PAML run
my $parser = Bio::Tools::Phylo::PAML->new
(
-file => "$inputfile",
);
my $result = $parser->next_result();
my $tree = $result->next_tree;
$evolver->tree($tree);
my @codon_freqs = $result->get_CodonFreqs();
$evolver->set_CodonFreqs(\@codon_freqs);
my $val = $evolver->prepare();
# FIXME: something similar for nucleotide frequencies:
# Option (5) Simulate nucleotide data sets (use MCbase.dat)?
# FIXME: something similar for aa parameters:
# Option (7) Simulate amino acid data sets (use MCaa.dat)?
# FIXME: With an initial RootSeq.txt
Description
This is a wrapper around the evolver program of PAML (Phylogenetic
Analysis by Maximum Likelihood) package of Ziheng Yang. See
http://abacus.gene.ucl.ac.uk/software/paml.html for more information.
This module is more about generating the properl MCmodel.ctl file and
will run the program in a separate temporary directory to avoid
creating temp files all over the place.
Methods
Methods description
Title : program_name Usage : $factory->program_name() Function: holds the program name Returns: string Args : None |
Title : program_dir Usage : ->program_dir() Function: returns the program directory, obtained from ENV variable. Returns: string Args : |
Title : new Usage : my $obj = Bio::Tools::Run::Phylo::PAML::Evolver->new(); Function: Builds a new Bio::Tools::Run::Phylo::PAML::Evolver object Returns : Bio::Tools::Run::Phylo::PAML::Evolver -save_tempfiles => boolean to save the generated tempfiles and NOT cleanup after onesself (default FALSE) -tree => the Bio::Tree::TreeI object (FIXME: optional if this is done in a first run) -params => a hashref of PAML parameters (all passed to set_parameter) -executable => where the evolver executable resides
See also: Bio::Tree::TreeI |
Title : prepare Usage : my $rundir = $evolver->prepare($aln); Function: prepare the evolver analysis using the default or updated parameters the alignment parameter must have been set Returns : value of rundir Args : Bio::Align::AlignI object, Bio::Tree::TreeI object [optional] |
Title : run Usage : my ($rc,$parser) = $evolver->run(); Function: run the evolver analysis using the default or updated parameters the alignment parameter must have been set Returns : Return code, Bio::Tools::Phylo::PAML Args : Bio::Align::AlignI object, Bio::Tree::TreeI object [optional] |
Title : error_string Usage : $obj->error_string($newval) Function: Where the output from the last analysis run is stored. Returns : value of error_string Args : newvalue (optional) |
Title : tree Usage : $evolver->tree($tree, %params); Function: Get/Set the Bio::Tree::TreeI object Returns : Bio::Tree::TreeI Args : [optional] $tree => Bio::Tree::TreeI, [optional] %parameters => hash of tree-specific parameters: branchLengths: 0, 1 or 2 out
Comment : We could potentially add support for running directly on a file
but we shall keep it simple
See also: Bio::Tree::Tree |
Title : get_parameters Usage : my %params = $self->get_parameters(); Function: returns the list of parameters as a hash Returns : associative array keyed on parameter names Args : none |
Title : set_parameter Usage : $evolver->set_parameter($param,$val); Function: Sets a evolver parameter, will be validated against the valid values as set in the %VALIDVALUES class variable. The checks can be ignored if one turns off param checks like this: $evolver->no_param_checks(1) Returns : boolean if set was success, if verbose is set to -1 then no warning will be reported Args : $param => name of the parameter $value => value to set the parameter to See also: no_param_checks() |
Title : set_default_parameters Usage : $evolver->set_default_parameters(0); Function: (Re)set the default parameters from the defaults (the first value in each array in the %VALIDVALUES class variable) Returns : none Args : boolean: keep existing parameter values |
Title : no_param_checks Usage : $obj->no_param_checks($newval) Function: Boolean flag as to whether or not we should trust the sanity checks for parameter values Returns : value of no_param_checks Args : newvalue (optional) |
Title : set_CodonFreqs Usage : $obj->set_CodonFreqs($newval) Function: Get/Set the Codon Frequence table Returns : value of set_CodonFreqs (a scalar) Args : on set, new value (a scalar or undef, optional) |
Title : get_CodonFreqs Usage : my @codon_freqs = $evolver->get_CodonFreqs() Function: Get the Codon freqs Returns : Array Args : none |
Title : indel Usage : $obj->indel($newval) Function: this is only useful if using evolver_indel instead of main evolver package: Exploring the Relationship between Sequence Similarity and Accurate Phylogenetic Trees Brandi L. Cantarel, Hilary G. Morrison and William Pearson Example : Returns : value of indel (a scalar) Args : on set, new value (a scalar or undef, optional) |
Methods code
BEGIN {
$MINNAMELEN = 25;
$PROGRAMNAME = 'evolver' . ($^O =~ /mswin/i ?'.exe':'');
if( defined $ENV{'PAMLDIR'} ) {
$PROGRAM = Bio::Root::IO->catfile($ENV{'PAMLDIR'},$PROGRAMNAME). ($^O =~ /mswin/i ?'.exe':'');;
}
my $rand = int(time);
%VALIDVALUES =
(
'outfmt' => [0,1],
'seed' => "$rand",
'nuclsites' => '1000',
'replicates' => '1',
'tree_length' => '1.5',
'omega' => '0.3',
'kappa' => '5',
);
} |
sub program_name
{ return 'evolver'; } |
sub program_dir
{ return Bio::Root::IO->catfile($ENV{PAMLDIR}) if $ENV{PAMLDIR};} |
sub new
{ my($class,@args) = @_;
my $self = $class->SUPER::new(@args);
my ($tree, $st, $params, $exe)
= $self->_rearrange([qw(TREE SAVE_TEMPFILES PARAMS EXECUTABLE)],
@args);
defined $tree && $self->tree($tree);
defined $st && $self->save_tempfiles($st);
defined $exe && $self->executable($exe);
$self->set_default_parameters();
if( defined $params ) {
if( ref($params) !~ /HASH/i ) {
$self->warn("Must provide a valid hash ref for parameter -FLAGS");
} else {
map { $self->set_parameter($_, $$params{$_}) } keys %$params;
}
}
return $self; } |
sub prepare
{ my ($self,$aln,$tree) = @_;
unless ( $self->save_tempfiles ) {
$self->cleanup();
}
$tree = $self->tree unless $tree;
my ($tempdir) = $self->tempdir();
my $evolver_ctl = "$tempdir/MCcodon.dat";
my $evolverfh;
open($evolverfh, ">$evolver_ctl") or $self->throw("cannot open $evolver_ctl for writing");
my %params = $self->get_parameters;
print $evolverfh "$params{outfmt}\n";
print $evolverfh "$params{seed}\n";
my $numseq = scalar($tree->get_leaf_nodes);
print $evolverfh "$numseq ";
print $evolverfh "$params{nuclsites} ";
print $evolverfh "$params{replicates}\n\n";
print $evolverfh "$params{tree_length}\n";
my $treeout = Bio::TreeIO->new
('-format' => 'newick',
'-fh' => $evolverfh,
-PRE =>'>>',
'-flush',
);
$treeout->write_tree($tree);
open($evolverfh, ">>$evolver_ctl") or $self->throw("cannot open $evolver_ctl for writing");
print $evolverfh "\n$params{omega}\n";
print $evolverfh "$params{kappa}\n";
my @codon_freqs = $self->get_CodonFreqs();
foreach my $firstbase (@codon_freqs) {
foreach my $element (@$firstbase) {
print $evolverfh " $element";
}
print $evolverfh "\n";
}
unless (@codon_freqs) {
print $evolverfh
"0.00983798 0.01745548 0.00222048 0.01443315\n",
"0.00844604 0.01498576 0.00190632 0.01239105\n",
"0.01064012 0.01887870 0 0\n",
"0.00469486 0.00833007 0 0.00688776\n",
"0.01592816 0.02826125 0.00359507 0.02336796\n",
"0.01367453 0.02426265 0.00308642 0.02006170\n",
"0.01722686 0.03056552 0.00388819 0.02527326\n",
"0.00760121 0.01348678 0.00171563 0.01115161\n",
"0.01574077 0.02792876 0.00355278 0.02309304\n",
"0.01351366 0.02397721 0.00305010 0.01982568\n",
"0.01702419 0.03020593 0.00384245 0.02497593\n",
"0.00751178 0.01332811 0.00169545 0.01102042\n",
"0.02525082 0.04480239 0.00569924 0.03704508\n",
"0.02167816 0.03846344 0.00489288 0.03180369\n",
"0.02730964 0.04845534 0.00616393 0.04006555\n",
"0.01205015 0.02138052 0.00271978 0.01767859\n";
}
print $evolverfh "\n// end of file.\n";
close($evolverfh);
} |
sub run
{
my $self = shift;
my $rc = (1);
{
my $exit_status;
my ($tmpdir) = $self->tempdir();
chdir($tmpdir);
my $evolverexe = $self->executable();
$self->throw("unable to find or run executable for 'evolver'") unless $evolverexe && -e $evolverexe && -x _;
open(RUN, "$evolverexe 6 MCcodon.dat |") or $self->throw("Cannot open exe $evolverexe");
my @output = <RUN>;
$exit_status = close(RUN);
$self->error_string(join('',@output));
if ( (grep { /\berr(or)?: /io } @output) || !$exit_status) {
$self->warn("There was an error - see error_string for the program output");
$rc = 0;
}
unless ($self->indel) {
my $in = Bio::AlignIO->new('-file' => "$tmpdir/mc.paml",
'-format' => 'phylip');
my $aln = $in->next_aln();
$self->alignment($aln);
}
}
return $rc;} |
sub error_string
{ my ($self,$value) = @_;
if( defined $value) {
$self->{'error_string'} = $value;
}
return $self->{'error_string'};} |
sub alignment
{ my ($self,$aln) = @_;
if( defined $aln ) {
if( -e $aln ) {
$self->{'_alignment'} = $aln;
} elsif( !ref($aln) || ! $aln->isa('Bio::Align::AlignI') ) {
$self->warn("Must specify a valid Bio::Align::AlignI object to the alignment function not $aln");
return undef;
} else {
$self->{'_alignment'} = $aln;
}
}
return $self->{'_alignment'};} |
sub tree
{ my ($self, $tree, %params) = @_;
if( defined $tree ) {
if( ! ref($tree) || ! $tree->isa('Bio::Tree::TreeI') ) {
$self->warn("Must specify a valid Bio::Tree::TreeI object to the alignment function");
}
$self->{'_tree'} = $tree;
}
return $self->{'_tree'};} |
sub get_parameters
{ my ($self) = @_;
return %{ $self->{'_evolverparams'} }; } |
sub set_parameter
{ my ($self,$param,$value) = @_;
unless ($self->{'no_param_checks'} == 1) {
if ( ! defined $VALIDVALUES{$param} ) {
$self->warn("unknown parameter $param will not be set unless you force by setting no_param_checks to true");
return 0;
}
if ( ref( $VALIDVALUES{$param}) =~ /ARRAY/i &&
scalar @{$VALIDVALUES{$param}} > 0 ) {
unless ( grep { $value eq $_ } @{ $VALIDVALUES{$param} } ) {
$self->warn("parameter $param specified value $value is not recognized, please see the documentation and the code for this module or set the no_param_checks to a true value");
return 0;
}
}
}
$self->{'_evolverparams'}->{$param} = $value;
return 1;} |
sub set_default_parameters
{ my ($self,$keepold) = @_;
$keepold = 0 unless defined $keepold;
while( my ($param,$val) = each %VALIDVALUES ) {
next if( defined $self->{'_evolverparams'}->{$param} && $keepold);
if(ref($val)=~/ARRAY/i ) {
$self->{'_evolverparams'}->{$param} = $val->[0];
} else {
$self->{'_evolverparams'}->{$param} = $val;
}
}} |
sub no_param_checks
{ my ($self,$value) = @_;
if( defined $value) {
$self->{'no_param_checks'} = $value;
}
return $self->{'no_param_checks'};} |
sub set_CodonFreqs
{ my $self = shift;
return $self->{'_codonfreqs'} = shift if @_;
return $self->{'_codonfreqs'};} |
sub get_CodonFreqs
{ my ($self) = @_;
return @{$self->{'_codonfreqs'} || []};} |
sub DESTROY
{ my $self= shift;
unless ( $self->save_tempfiles ) {
$self->cleanup();
}
$self->SUPER::DESTROY();} |
sub indel
{ my $self = shift;
return $self->{'indel'} = shift if @_;
return $self->{'indel'};
}
1;} |
General documentation
User feedback is an integral part of the evolution of this and other
Bioperl modules. Send your comments and suggestions preferably to
the Bioperl mailing list. Your participation is much appreciated.
bioperl-l@bioperl.org - General discussion
http://bioperl.org/MailList.shtml - About the mailing lists
Please direct usage questions or support issues to the mailing list:
bioperl-l@bioperl.org
rather than to the module maintainer directly. Many experienced and
reponsive experts will be able look at the problem and quickly
address it. Please include a thorough description of the problem
with code and data examples if at all possible.
Report bugs to the Bioperl bug tracking system to help us keep track
of the bugs and their resolution. Bug reports can be submitted via the
web:
http://bioperl.org/bioperl-bugs/
| AUTHOR - Albert Vilella | Top |
Email avilella-AT-gmail-DOT-com
Additional contributors names and emails here
The rest of the documentation details each of the object methods.
Internal methods are usually preceded with a _
Valid and default values for evolver programs are listed below. The
default values are always the first one listed. These descriptions
are essentially lifted from the example MCcodon.dat file and pamlDOC
documentation provided by the author.
Stub:
option1 specifies something.
option2 specifies something else.
INCOMPLETE DOCUMENTATION OF ALL METHODS
| Bio::Tools::Run::WrapperBase methods | Top |
Title : save_tempfiles
Usage : $obj->save_tempfiles($newval)
Function:
Returns : value of save_tempfiles
Args : newvalue (optional)
Title : outfile_name
Usage : my $outfile = $evolver->outfile_name();
Function: Get/Set the name of the output file for this run
(if you wanted to do something special)
Returns : string
Args : [optional] string to set value to
Title : tempdir
Usage : my $tmpdir = $self->tempdir();
Function: Retrieve a temporary directory name (which is created)
Returns : string which is the name of the temporary directory
Args : none
Title : cleanup
Usage : $evolver->cleanup();
Function: Will cleanup the tempdir directory after a PAML run
Returns : none
Args : none