Bio
RangeI
Summary
Bio::RangeI - Range interface
Package variables
Globals (from "use vars" definitions)
%STRAND_OPTIONS
Included modules
Inherit
Synopsis
#Do not run this module directly
Description
This provides a standard BioPerl range interface that should be
implemented by any object that wants to be treated as a range. This
serves purely as an abstract base class for implementers and can not
be instantiated.
Ranges are modeled as having (start, end, length, strand). They use
Bio-coordinates - all points >= start and <= end are within the
range. End is always greater-than or equal-to start, and length is
greater than or equal to 1. The behaviour of a range is undefined if
ranges with negative numbers or zero are used.
So, in summary:
length = end - start + 1
end >= start
strand = (-1 | 0 | +1)
Methods
Methods description
Title : start Usage : $start = $range->start(); Function: get/set the start of this range Returns : the start of this range Args : optionally allows the start to be set using $range->start($start) |
Title : end Usage : $end = $range->end(); Function: get/set the end of this range Returns : the end of this range Args : optionally allows the end to be set using $range->end($end) |
Title : length Usage : $length = $range->length(); Function: get/set the length of this range Returns : the length of this range Args : optionally allows the length to be set using $range->length($length) |
Title : strand Usage : $strand = $range->strand(); Function: get/set the strand of this range Returns : the strandedness (-1, 0, +1) Args : optionally allows the strand to be set using $range->strand($strand) |
Title : overlaps Usage : if($r1->overlaps($r2)) { do stuff } Function: tests if $r2 overlaps $r1 Args : arg #1 = a range to compare this one to (mandatory) arg #2 = optional strand-testing arg ('strong', 'weak', 'ignore') Returns : true if the ranges overlap, false otherwise |
Title : contains Usage : if($r1->contains($r2) { do stuff } Function: tests whether $r1 totally contains $r2 Args : arg #1 = a range to compare this one to (mandatory) alternatively, integer scalar to test arg #2 = optional strand-testing arg ('strong', 'weak', 'ignore') Returns : true if the argument is totally contained within this range |
Title : equals Usage : if($r1->equals($r2)) Function: test whether $r1 has the same start, end, length as $r2 Args : arg #1 = a range to compare this one to (mandatory) arg #2 = optional strand-testing arg ('strong', 'weak', 'ignore') Returns : true if they are describing the same range |
Title : intersection Usage : ($start, $stop, $strand) = $r1->intersection($r2); OR ($start, $stop, $strand) = Bio::Range->intersection(\@ranges); OR my $containing_range = $r1->intersection($r2); OR my $containing_range = Bio::Range->intersection(\@ranges); Function: gives the range that is contained by all ranges Returns : undef if they do not overlap, or the range that they do overlap (in the form of an object like the calling one, OR a three element array) Args : arg #1 = [REQUIRED] a range to compare this one to, or an array ref of ranges arg #2 = optional strand-testing arg ('strong', 'weak', 'ignore') |
Title : union Usage : ($start, $stop, $strand) = $r1->union($r2); : ($start, $stop, $strand) = Bio::Range->union(@ranges); my $newrange = Bio::Range->union(@ranges); Function: finds the minimal Range that contains all of the Ranges Args : a Range or list of Range objects Returns : the range containing all of the range (in the form of an object like the calling one, OR a three element array) |
Title : overlap_extent Usage : ($a_unique,$common,$b_unique) = $a->overlap_extent($b) Function: Provides actual amount of overlap between two different ranges Example : Returns : array of values containing the length unique to the calling range, the length common to both, and the length unique to the argument range Args : a range |
Title : disconnected_ranges Usage : my @disc_ranges = Bio::Range->disconnected_ranges(@ranges); Function: finds the minimal set of ranges such that each input range is fully contained by at least one output range, and none of the output ranges overlap Args : a list of ranges Returns : a list of objects of the same type as the input (conforms to RangeI) |
Methods code
BEGIN { %STRAND_OPTIONS = map { $_, '_' . $_ }
(
'strong', 'weak', 'ignore', ); } |
sub _strong
{ my ($r1, $r2) = @_;
my ($s1, $s2) = ($r1->strand(), $r2->strand());
return 1 if $s1 != 0 && $s1 == $s2;} |
sub _weak
{ my ($r1, $r2) = @_;
my ($s1, $s2) = ($r1->strand(), $r2->strand());
return 1 if $s1 == 0 || $s2 == 0 || $s1 == $s2;} |
sub _testStrand()
{ my ($r1, $r2, $comp) = @_;
return 1 unless $comp;
my $func = $STRAND_OPTIONS{$comp};
return $r1->$func($r2);} |
sub start
{ shift->throw_not_implemented(); } |
sub end
{ shift->throw_not_implemented(); } |
sub length
{ shift->throw_not_implemented(); } |
sub strand
{ shift->throw_not_implemented(); } |
sub overlaps
{ my ($self, $other, $so) = @_;
$self->throw("start is undefined") unless defined $self->start;
$self->throw("end is undefined") unless defined $self->end;
$self->throw("not a Bio::RangeI object") unless defined $other &&
$other->isa('Bio::RangeI');
$other->throw("start is undefined") unless defined $other->start;
$other->throw("end is undefined") unless defined $other->end;
return
($self->_testStrand($other, $so)
and not (
($self->start() > $other->end() or
$self->end() < $other->start() )
));} |
sub contains
{ my ($self, $other, $so) = @_;
$self->throw("start is undefined") unless defined $self->start;
$self->throw("end is undefined") unless defined $self->end;
if(defined $other && ref $other) { $other->throw("Not a Bio::RangeI object: $other") unless $other->isa('Bio::RangeI');
$other->throw("start is undefined") unless defined $other->start;
$other->throw("end is undefined") unless defined $other->end;
return ($self->_testStrand($other, $so) and
$other->start() >= $self->start() and
$other->end() <= $self->end());
} else { $self->throw("'$other' is not an integer.\n") unless $other =~ /^[-+]?\d+$/;
return ($other >= $self->start() and $other <= $self->end());
}} |
sub equals
{ my ($self, $other, $so) = @_;
$self->throw("start is undefined") unless defined $self->start;
$self->throw("end is undefined") unless defined $self->end;
$other->throw("Not a Bio::RangeI object") unless $other->isa('Bio::RangeI');
$other->throw("start is undefined") unless defined $other->start;
$other->throw("end is undefined") unless defined $other->end;
return ($self->_testStrand($other, $so) and
$self->start() == $other->start() and
$self->end() == $other->end() );} |
sub intersection
{ my ($self, $given, $so) = @_;
$self->throw("missing arg: you need to pass in another feature") unless $given;
my @ranges;
if ($self eq "Bio::RangeI") {
$self = "Bio::Range";
$self->warn("calling static methods of an interface is deprecated; use $self instead");
}
if (ref $self) {
push(@ranges, $self);
}
ref($given) eq 'ARRAY' ? push(@ranges, @{$given}) : push(@ranges, $given);
$self->throw("Need at least 2 ranges") unless @ranges >= 2;
my $intersect;
while (@ranges > 0) {
unless ($intersect) {
$intersect = shift(@ranges);
$self->throw("Not an object: $intersect") unless ref($intersect);
$self->throw("Not a Bio::RangeI object: $intersect") unless $intersect->isa('Bio::RangeI');
$self->throw("start is undefined") unless defined $intersect->start;
$self->throw("end is undefined") unless defined $intersect->end;
}
my $compare = shift(@ranges);
$self->throw("Not an object: $compare") unless ref($compare);
$self->throw("Not a Bio::RangeI object: $compare") unless $compare->isa('Bio::RangeI');
$self->throw("start is undefined") unless defined $compare->start;
$self->throw("end is undefined") unless defined $compare->end;
return unless $compare->_testStrand($intersect, $so);
my @starts = sort {$a <=> $b} ($intersect->start(), $compare->start());
my @ends = sort {$a <=> $b} ($intersect->end(), $compare->end());
my $start = pop @starts; my $end = shift @ends;
my $intersect_strand; if (defined($intersect->strand) && defined($compare->strand) && $intersect->strand == $compare->strand) {
$intersect_strand = $compare->strand;
}
else {
$intersect_strand = 0;
}
if ($start > $end) {
return;
}
else {
$intersect = $self->new(-start => $start,
-end => $end,
-strand => $intersect_strand);
}
}
if (wantarray()) {
return ($intersect->start, $intersect->end, $intersect->strand);
}
else {
return $intersect;
}} |
sub union
{ my $self = shift;
my @ranges = @_;
if ($self eq "Bio::RangeI") {
$self = "Bio::Range";
$self->warn("calling static methods of an interface is deprecated; use $self instead");
}
if(ref $self) {
unshift @ranges, $self;
}
my @start = sort {$a<=>$b}
map( { $_->start() } @ranges);
my @end = sort {$a<=>$b}
map( { $_->end() } @ranges);
my $start = shift @start;
while( !defined $start ) {
$start = shift @start;
}
my $end = pop @end;
my $union_strand;
foreach(@ranges) {
if(! defined $union_strand) {
$union_strand = $_->strand;
next;
} else {
if(not defined $_->strand or $union_strand ne $_->strand) {
$union_strand = 0;
last;
}
}
}
return unless $start or $end;
if( wantarray() ) {
return ( $start,$end,$union_strand);
} else {
return $self->new('-start' => $start,
'-end' => $end,
'-strand' => $union_strand
);
}} |
sub overlap_extent
{ my ($a,$b) = @_;
$a->throw("start is undefined") unless defined $a->start;
$a->throw("end is undefined") unless defined $a->end;
$b->throw("Not a Bio::RangeI object") unless $b->isa('Bio::RangeI');
$b->throw("start is undefined") unless defined $b->start;
$b->throw("end is undefined") unless defined $b->end;
if( ! $a->overlaps($b) ) {
return ($a->length,0,$b->length);
}
my ($au,$bu) = (0, 0);
if( $a->start < $b->start ) {
$au = $b->start - $a->start;
} else {
$bu = $a->start - $b->start;
}
if( $a->end > $b->end ) {
$au += $a->end - $b->end;
} else {
$bu += $b->end - $a->end;
}
my $intersect = $a->intersection($b);
my $ie = $intersect->end;
my $is = $intersect->start;
return ($au,$ie-$is+1,$bu);} |
sub disconnected_ranges
{ my $self = shift;
if ($self eq "Bio::RangeI") {
$self = "Bio::Range";
$self->warn("calling static methods of an interface is deprecated; use $self instead");
}
my @inranges = @_;
if(ref $self) {
unshift @inranges, $self;
}
my @outranges = ();
foreach my $inrange (@inranges) {
my $intersects = 0;
my @outranges_new = ();
my @intersecting_ranges = ();
for (my $i=0; $i<@outranges; $i++) {
my $outrange = $outranges[$i];
my $intersection = $inrange->intersection($outrange);
if ($intersection) {
$intersects = 1;
my $union = $inrange->union($outrange);
push(@intersecting_ranges, $union);
}
else {
push(@outranges_new, $outrange);
}
}
@outranges = @outranges_new;
if (@intersecting_ranges) {
if (@intersecting_ranges > 1) {
my $merged_range =
$self->union(@intersecting_ranges);
push(@outranges, $merged_range);
}
else {
push(@outranges, @intersecting_ranges);
}
}
else {
push(@outranges,
$self->new('-start'=>$inrange->start,
'-end'=>$inrange->end,
'-strand'=>$inrange->strand,
));
}
}
return @outranges;} |
General documentation
User feedback is an integral part of the evolution of this and other
Bioperl modules. Send your comments and suggestions preferably to one
of the Bioperl mailing lists. Your participation is much appreciated.
bioperl-l@bioperl.org - General discussion
http://bioperl.org/wiki/Mailing_lists - About the mailing lists
Report bugs to the Bioperl bug tracking system to help us keep track
the bugs and their resolution. Bug reports can be submitted via the
web:
http://bugzilla.bioperl.org/
| AUTHOR - Heikki Lehvaslaiho | Top |
Email: heikki-at-bioperl-dot-org
The rest of the documentation details each of the object
methods. Internal methods are usually preceded with a _
These methods must be implemented in all subclasses.
These methods return true or false. They throw an error if start and
end are not defined.
$range->overlaps($otherRange) && print "Ranges overlap\n";
These methods do things to the geometry of ranges, and return
Bio::RangeI compliant objects or triplets (start, stop, strand) from
which new ranges could be built.