Bio::Tools::EUtilities
EUtilParameters
Toolbar
Summary
Bio::Tools::EUtilities::EUtilParameters - Manipulation of NCBI eutil-based
parameters for remote database requests.
Package variables
Privates (from "my" definitions)
$HOSTBASE = 'http://eutils.ncbi.nlm.nih.gov/entrez/eutils/'
%NCBI_DATABASE = ( 'protein' => 'text', 'nucleotide' => 'text', 'nuccore' => 'text', 'nucgss' => 'text', 'nucest' => 'text', 'structure' => 'text', 'genome' => 'text', 'gene' => 'asn1', 'journals' => 'text', )
%MODE = ( 'einfo' => { 'mode' => ['GET'], 'location' => 'einfo.fcgi', 'params' => [qw(db tool email)], }, 'epost' => { 'mode' => ['POST','GET'], 'location' => 'epost.fcgi', 'params' => [qw(db retmode id tool email WebEnv query_key)], }, 'efetch' => { 'mode' => ['GET','POST'], 'location' => 'efetch.fcgi', 'params' => [qw(db retmode id retmax retstart rettype strand seq_start seq_stop complexity report tool email WebEnv query_key)], }, 'esearch' => { 'mode' => ['GET','POST'], 'location' => 'esearch.fcgi', 'params' => [qw(db retmode usehistory term field reldate mindate maxdate datetype retmax retstart rettype sort tool email WebEnv query_key)], }, 'esummary' => { 'mode' => ['GET','POST'], 'location' => 'esummary.fcgi', 'params' => [qw(db retmode id retmax retstart rettype tool email WebEnv query_key)], }, 'elink' => { 'mode' => ['GET','POST'], 'location' => 'elink.fcgi', 'params' => [qw(db retmode id reldate mindate maxdate datetype term dbfrom holding cmd version tool email linkname WebEnv query_key)], }, 'egquery' => { 'mode' => ['GET','POST'], 'location' => 'egquery.fcgi', 'params' => [qw(term retmode tool email)], }, 'espell' => { 'mode' => ['GET','POST'], 'location' => 'espell.fcgi', 'params' => [qw(db retmode term tool email )], })
@PARAMS;
Included modules
Inherit
Synopsis
# Bio::Tools::EUtilities::EUtilParameters implements Bio::ParameterBaseI
my @params = (-eutil => 'efetch',
db => 'nucleotide',
id => \@ids,
email => 'me@foo.bar',
retmode => 'xml');
my $p = Bio::Tools::EUtilities::EUtilParameters->new(@params);
if ($p->parameters_changed) {
# ...
} # state information
$p->set_parameters(@extra_params); # set new NCBI parameters, leaves others preset
$p->reset_parameters(@new_params); # reset NCBI parameters to original state
$p->to_string(); # get a URI-encoded string representation of the URL address
$p->to_request(); # get an HTTP::Request object (to pass on to LWP::UserAgent)
Description
Bio::Tools::EUtilities::EUtilParameters is-a Bio::ParameterBaseI implementation
that allows simple manipulation of NCBI eutil parameters for CGI-based queries.
SOAP-based methods may be added in the future.
For simplicity parameters do not require dashes when passed and do not need URI
encoding (spaces are converted to '+', symbols encoded, etc). Also, the
following extra parameters can be passed to the new() constructor or via
set_parameters() or reset_parameters():
eutil - the eutil to be used. The default is 'efetch' if not set.
correspondence - Flag for how IDs are treated. Default is undef (none).
history - a Bio::Tools::EUtilities::HistoryI object. Default is undef (none).
At this point minimal checking is done for potential errors in parameter
passing, though these should be easily added in the future when necessary.
Methods
Methods description
Title : set_parameters Usage : $pobj->set_parameters(@params); Function: sets the NCBI parameters listed in the hash or array Returns : None Args : [optional] hash or array of parameter/values. Note : This sets any parameter passed but leaves previously set data alone. In addition to regular eutil-specific parameters, you can set the following:
-eutil - the eUtil to be used (default 'efetch')
-history - pass a HistoryI-implementing object, which
sets the WebEnv, query_key, and possibly db and linkname
(the latter two only for LinkSets)
-correspondence - Boolean flag, set to TRUE or FALSE; indicates how
IDs are to be added together for elink request where
ID correspondence might be needed
(default 0) |
Title : reset_parameters Usage : resets values Function: resets parameters to either undef or value in passed hash Returns : none Args : [optional] hash of parameter-value pairs Note : This sets any parameter passed, but resets all others (deletes them). In addition to regular eutil-specific parameters, you can set the following:
-eutil - the eUtil to be used (default 'efetch')
-history - pass a HistoryI-implementing object, which
sets the WebEnv, query_key, and possibly db and linkname
(the latter two only for LinkSets)
-correspondence - Boolean flag, set to TRUE or FALSE; indicates how
IDs are to be added together for elink request where
ID correspondence might be needed
(default 0) |
Title : carryover Usage : $obj->carryover(qw(email tool db)) Function : Carries over the designated parameters when using reset_parameters() Returns : a list of carried-over parameters Args : An array reference of parameters to carry over, followed optionally by the mode ('add' or 'delete', indicating whether to append to or remove the specified values passed in). To clear all values, pass in an empty array reference (the mode in this case doesn't matter). In addition to the normal eUtil-specific parameters, the following additional parameters are allowed: -eutil - the eUtil to be used (default 'efetch') -history - pass a HistoryI-implementing object, which sets the WebEnv, query_key, and possibly db and linkname (the latter two only for LinkSets) -correspondence - Boolean flag, set to TRUE or FALSE; indicates how IDs are to be added together for elink request where ID correspondence might be needed (default 0) Default : None (no carried over parameters) Status : NYI (dev in progress, carry on, nothing to see here) |
Title : request_mode Usage : $obj->request_mode Function : get/set the mode for the user agent to use for generating a request Returns : either a preset mode (checked against the eutil) or a best-possible option based upon the currently-set parameters Args : Status : |
Title : parameters_changed Usage : if ($pobj->parameters_changed) {...} Function: Returns TRUE if parameters have changed Returns : Boolean (0 or 1) Args : [optional] Boolean |
Title : available_parameters Usage : @params = $pobj->available_parameters() Function: Returns a list of the available parameters Returns : Array of available parameters (no values) Args : [optional] A string with the eutil name (for returning eutil-specific parameters) |
Title : get_parameters Usage : @params = $pobj->get_parameters; %params = $pobj->get_parameters; Function: Returns list of key/value pairs, parameter => value Returns : Flattened list of key-value pairs. All key-value pairs returned, though subsets can be returned based on the '-type' parameter. Data originally set as an array ref are returned based on whether the '-join_id' flag is set (default is the same array ref). Args : -type : the eutil name (Default: returns all). Use of '-list' supercedes this -list : array ref of specific parameters -join_ids : Boolean; join IDs based on correspondence (Default: no join) |
Title : to_string Usage : $string = $pobj->to_string; Function: Returns string (URL only in this case) Returns : String (URL only for now) Args : [optional] 'all'; build URI::http using all parameters Default : Builds based on allowed parameters (presence of history data or eutil type in %MODE). Note : Changes state of object. Absolute string |
Title : to_request Usage : $uri = $pobj->to_request; Function: Returns HTTP::Request object Returns : HTTP::Request Args : [optional] 'all'; builds request using all parameters Default : Builds based on allowed parameters (presence of history data or eutil type in %MODE). Note : Changes state of object (to boolean FALSE). Used for CGI-based GET/POST TODO : esearch, esummary, elink now accept POST for batch submission (something NCBI apparently allowed but didn't advertise). Should we switch most of these to utilize POST instead, or make it dep on the number of submitted IDs? |
Title : eutil Usage : $p->eutil('efetch') Function: gets/sets the eutil for this set of parameters Returns : string (eutil) Args : [optional] string (eutil) Throws : '$eutil not supported' if eutil not present Note : This does not reset retmode to the default if called directly. |
Title : history Usage : $p->history($history); Function: gets/sets the history object to be used for these parameters Returns : Bio::Tools::EUtilities::HistoryI (if set) Args : [optional] Bio::Tools::EUtilities::HistoryI Throws : Passed something other than a Bio::Tools::EUtilities::HistoryI Note : This overrides WebEnv() and query_key() settings when set. This caches the last history object passed and returns like a Get/Set |
Title : correspondence Usage : $p->correspondence(1); Function: Sets flag for posting IDs for one-to-one correspondence Returns : Boolean Args : [optional] boolean value |
Title : id_file Usage : $p->id_file('<foo'); Function: convenience method; passes in file containing a list of IDs for searches (one per line), sets id() to list Returns : none Args : either string indicating file to use, a file handle, or an IO::Handle object Note : use of this overrides concurrent use of the '-id' parameter when both are passed. The filename is not retained, merely parsed for IDs. |
Title : url_base_address Usage : $address = $p->url_base_address(); Function: Get URL base address Returns : String Args : None in this implementation; the URL is fixed |
Title : set_default_retmode Usage : $p->set_default_retmode(); Function: sets retmode to default value specified by the eutil() and the value in %NCBI_DATABASE (for efetch only) if called Returns : none Args : none |
Methods code
BEGIN { @PARAMS = qw(db id email retmode rettype usehistory term field tool
reldate mindate maxdate datetype retstart retmax sort seq_start seq_stop
strand complexity report dbfrom cmd holding version linkname WebEnv
query_key);
for my $method (@PARAMS) {
eval <<END; sub $method { my (\$self, \$val) = \@_; if (defined \$val) { if ((!defined \$self->{'_$method'}) || (defined \$self->{'_$method'} && \$self->{'_$method'} ne \$val)) { \$self->{'_statechange'} = 1; \$self->{'_$method'} = \$val; } } return \$self->{'_$method'}; } END
} |
sub new
{ my ($class, @args) = @_;
my $self = $class->SUPER::new(@args);
my ($retmode) = $self->_rearrange(["RETMODE"],@args);
$self->_set_from_args(\@args,
-methods => [@PARAMS, qw(eutil history correspondence id_file request_mode)]);
$self->eutil() || $self->eutil('efetch');
$self->tool() || $self->tool('BioPerl');
$self->set_default_retmode if (!$retmode);
$self->{'_statechange'} = 1;
return $self;} |
sub set_parameters
{ my ($self, @args) = @_;
my ($newmode,$file) = $self->_rearrange([qw(RETMODE ID_FILE)],@args);
$self->_set_from_args(\@args, -methods => [@PARAMS, qw(eutil correspondence history)]);
$self->set_default_retmode unless $newmode;
$file && $self->id_file($file);
return;} |
sub reset_parameters
{ my ($self, @args) = @_;
my ($retmode,$file) = $self->_rearrange([qw(RETMODE ID_FILE)],@args);
map { defined $self->{"_$_"} && undef $self->{"_$_"} } (@PARAMS, qw(eutil correspondence history_cache request_cache));
$self->_set_from_args(\@args, -methods => [@PARAMS, qw(eutil correspondence history)]);
$self->eutil() || $self->eutil('efetch');
$self->set_default_retmode unless $retmode;
$file && $self->id_file($file);
$self->{'_statechange'} = 1;} |
sub carryover
{ my ($self, $params, $mode) = @_;
my %allowed = map {$_ => 1} (@PARAMS, qw(eutil history correspondence));
if ($params) {
$self->throw("Must pass in an array ref of parameters") unless
ref($params) eq 'ARRAY';
my $mode ||= 'add';
$self->throw("Mode must be 'add' or 'delete'") unless $mode eq 'add' || $mode eq 'delete';
if (!scalar(@$params)) { $self->{_carryover} = {};
} else {
for my $p (@$params) {
if (!exists $allowed{$p}) {
$self->warn("$p is not a recognized eutil parameter");
next;
}
if ($mode eq 'add') {
$self->{_carryover}->{$p} = 1;
} else {
delete $self->{_carryover}->{$p} if exists
$self->{_carryover}->{$p};
}
}
}
}
sort keys %{$self->{_carryover}} || ();} |
sub _reset_except_carryover
{ my $self = shift;
} |
sub request_mode
{ my ($self, $mode) = @_;
$mode = uc $mode if defined $mode;
my $eutil = $self->eutil;
if ($mode) {
my %valid = map {$_ => 1} @{$MODE{$eutil}{mode}};
$self->throw("Mode $mode not supported for $eutil") unless
exists $valid{$mode};
$self->{_request_mode} = $mode;
}
return $self->{_request_mode} if $self->{_request_mode};
if (scalar(@{$MODE{$eutil}{mode}}) > 1) { my ($id, $term) = ($self->id || [], $self->term || '');
if (ref $id eq 'ARRAY' && scalar(@$id) > 200 || CORE::length($term) > 300) {
return 'POST'
}
}
$MODE{$eutil}{mode}[0];
} |
sub parameters_changed
{ my ($self) = @_;
$self->{'_statechange'};} |
sub available_parameters
{ my ($self, $type) = @_;
$type ||= 'all';
if ($type eq 'all') {
return @PARAMS;
} else {
$self->throw("$type parameters not supported") if !exists $MODE{$type};
return @{$MODE{$type}->{params}};
}} |
sub get_parameters
{ my ($self, @args) = @_;
my ($type, $list, $join) = $self->_rearrange([qw(TYPE LIST JOIN_IDS)], @args);
$self->throw("Parameter list not an array ref") if $list && ref $list ne 'ARRAY';
$type ||= '';
my @final = $list ? grep {$self->can($_)} @{$list} : $self->available_parameters($type);
my @p;
for my $param (@final) {
if ($param eq 'id' && $self->id && $join) {
my $id = $self->id;
if ($self->correspondence && $self->eutil eq 'elink') {
for my $id_group (@{ $id }) {
if (ref($id_group) eq 'ARRAY') {
push @p, ('id' => join(q(,), @{ $id_group }));
}
elsif (!ref($id_group)) {
push @p, ('id' => $id_group);
}
else {
$self->throw("Unknown ID type: $id_group");
}
}
} else {
push @p, ref $id eq 'ARRAY' ?
($param => join(',', grep {defined($_)} @{ $id })):
($param => $id);
}
}
elsif ($param eq 'db' && $self->db && $join) {
my $db = $self->db;
push @p, (ref $db eq 'ARRAY') ?
($param => join(',', @{ $db })) :
($param => $db) ;
}
else {
push @p, ($param => $self->{"_$param"}) if defined $self->{"_$param"};
}
}
return @p;} |
sub to_string
{ my ($self, @args) = @_;
if ($self->parameters_changed || !defined $self->{'_string_cache'}) {
my $string = $self->to_request(@args)->uri->as_string;
$self->{'_statechange'} = 0;
$self->{'_string_cache'} = $string;
}
return $self->{'_string_cache'};} |
sub to_request
{ my ($self, $type) = @_;
if ($self->parameters_changed || !defined $self->{'_request_cache'}) {
my $eutil = $self->eutil;
$self->throw("No eutil set") if !$eutil;
$type ||= $eutil;
my ($location, $mode) = ($MODE{$eutil}->{location}, $self->request_mode);
my $request;
my $uri = URI->new($self->url_base_address . $location);
if ($mode eq 'GET') {
$uri->query_form($self->get_parameters(-type => $type, -join_ids => 1) );
$request = HTTP::Request->new($mode => $uri);
$self->{'_request_cache'} = $request;
} elsif ($mode eq 'POST') {
$request = HTTP::Request->new($mode => $uri->as_string);
$uri->query_form($self->get_parameters(-type => $type, -join_ids => 1) );
$request->content_type('application/x-www-form-urlencoded');
$request->content($uri->query);
$self->{'_request_cache'} = $request;
} else {
$self->throw("Unrecognized request mode: $mode");
}
$self->{'_statechange'} = 0;
$self->{'_request_cache'} = $request;
}
return $self->{'_request_cache'};} |
sub eutil
{ my ($self, $eutil) = @_;
if ($eutil) {
$self->throw("$eutil not supported") if !exists $MODE{$eutil};
if (!defined $self->{'_eutil'} || ($self->{'_eutil'} && $self->{'_eutil'} ne $eutil)) {
$self->{'_eutil'} = $eutil;
$self->{'_statechange'} = 1;
}
}
return $self->{'_eutil'};} |
sub history
{ my ($self, $history) = @_;
if ($history) {
$self->throw('Not a Bio::Tools::EUtilities::HistoryI object!') if
!$history->isa('Bio::Tools::EUtilities::HistoryI');
my ($webenv, $qkey) = $history->history;
$self->WebEnv($webenv);
$self->query_key($qkey);
$self->{'_statechange'} = 1;
$self->{'_history_cache'} = $history;
}
return $self->{'_history_cache'};} |
sub correspondence
{ my ($self, $corr) = @_;
if (defined $corr) {
$self->{'_correspondence'} = $corr;
$self->{'_statechange'} = 1;
}
return $self->{'_correspondence'};} |
sub id_file
{ my ($self, $file) = @_;
if ($file) {
my $io = $self->_io;
$io->_initialize_io(-input => $file);
my @ids;
while (my $line = $io->_readline) {
chomp $line;
push @ids, $line;
}
$self->_io->close;
$self->id(\@ids);
}} |
sub url_base_address
{ my ($self, $address) = @_;
return $HOSTBASE;
}} |
sub set_default_retmode
{ my $self = shift;
if ($self->eutil eq 'efetch') {
my $db = $self->db || return; my $mode = exists $NCBI_DATABASE{$db} ? $NCBI_DATABASE{$db} : 'xml';
$self->retmode($mode);
} else {
$self->retmode('xml');
}
}} |
sub _io
{ my $self = shift;
if (!defined $self->{'_io'}) {
$self->{'_io'} = Bio::Root::IO->new();
}
return $self->{'_io'};
}
1;} |
General documentation
User feedback is an integral part of the
evolution of this and other Bioperl modules. Send
your comments and suggestions preferably to one
of the Bioperl mailing lists. Your participation
is much appreciated.
bioperl-l@lists.open-bio.org - General discussion
http://www.bioperl.org/wiki/Mailing_lists - About the mailing lists
Please direct usage questions or support issues to the mailing list:
bioperl-l@bioperl.org
rather than to the module maintainer directly. Many experienced and
reponsive experts will be able look at the problem and quickly
address it. Please include a thorough description of the problem
with code and data examples if at all possible.
Report bugs to the Bioperl bug tracking system to
help us keep track the bugs and their resolution.
Bug reports can be submitted via the web.
https://redmine.open-bio.org/projects/bioperl/
Email cjfields at bioperl dot org
The rest of the documentation details each of the
object methods. Internal methods are usually
preceded with a _
| Bio::ParameterBaseI implemented methods | Top |
| Implementation-specific to_* methods | Top |
| Implementation specific-methods | Top |