Multiple and predicate dispatching
Single, dynamic dispatching is used by OOP languages to call a designated method based on the runtime type of an object currently at a reference. When the language also features static (compile time) type checking the object's actual type will generally be either the nominal type of the reference or a subclass of that type. In either case, if the object does not have a method matching the static signature of the called method a compile time or runtime error results. For applications written using so called managed-code an exception is thrown at runtime. Using native code languages that do not provide this level of implicit type checking during execution, the error may vary from the method operating on spurious data to uncontrolled execution and possible program termination by the operating system.
In a language that features static type checking and whatever the actual mechanism involved, we can model single dispatching by giving all objects a reference to a table of methods for their class. The compiler stores the table offset of any particular method and when a call against a reference nominally to an object of a particular class is encountered, code is generated to retrieve the entry point for the method from the table. In the model a base class and each subclass has its own table - perhaps not the case as implemented. Overriding a method in a subclass the base-class method is replaced by the subclass method in the subclass' table. If a nominal superclass reference cites a subclass object at runtime, the subclass overridden method entry point is got from the table and this method is executed.
Multiple dispatch extends this method selection mechanism to include the runtime type of method arguments. Extending the model: single dispatching in a statically type-checked environment requires a single method selection table with one dimension. Without static checking an additional dimension is needed related to runtime type. Multiple dispatch requires a table with as many additional dimensions as there are arguments significant to the selection process. The table can be implemented as one per class or one per base class and there can be a table for each multi-method or a single combined table with other tables referenced by its elements.
To prove the concept we will look at implementing a single-method table for a method with a single significant argument in addition to object type. This implementation can be extended to include additional multi-methods by introducing additional tables.
Predicate dispatch extends the multiple dispatch concept by allowing the runtime values of arguments and logical relationships between types and/or values to be significant for method selection. In OOP languages the multiple dispatch and predicate concepts are quite blurred. A reference can cite an object of its nominal class or a subtype and the value of a reference is the object it refers to, one value of which is its type. Ability to include logical selection statements that deal with type and/or other values appears to be the distinguishing factor.
Implementing multiple dispatch
There are a number of possibilities. To do multiple dispatch according to the book we are going to have to use the java.lang.reflect package. If you haven't tried any of this stuff already to get an idea of what it entails you can Google 'Java Reflection' and have a look at some of Sun's postings.
For a more low level view Dennis Sosnoski's articles on IBM developerWorks go down into the world of bytecode and beyond, up to replacing reflection with runtime code generation.
As Dennis shows, using standard reflection is slow, very slow. Reflection may have a use for class initialization but what we are going to look at here is somthing far simpler and more akin to a native code approach - a DIY method table. A great advantage of implementing the table in Java is that it is pretty well guaranteed to be robust but there a number of difficulties to be overcome - methods not being a first-class data type for one.
The binary method problem
A binary method is a method that operates on two or more objects of the same type. It has at least two arguments, perhaps one implicit - the reference to the current instance, the other the same type or to a related class. Suppose we write a base class with a binary method, how can we allow for this method being overridden in a way that works for instances of any subclass including those that have not yet been written and without changing code in the base class?
Using instanceof and typecasting is one way, discussed briefly later, but what is needed is something more akin to overriding methods but related to argument types as well as the instance type. So, how can you override a method substituting covariant arguments that can be used straight off with no typecasts? Without too much difficulty it turns out.
In their paper presenting MultiJava, a Java extension supporting open classes and multiple dispatch, the authors cite the example of a Shape class with subclasses Rectangle, Circle etc. Shape has a binary method intersect(Shape) that returns true if two shapes overlap. I am going to nick this example in a slightly modified form to show how we can implement multiple dispatch in bog standard code. The methodology is kept simple and easy to explain. Perhaps a more generalized solution could be posted later but currently it is worth noting that open classes and multiple dispatch are strongly related to not implementing the Visitor design pattern, enough said!
Using double dispatch
First let's firm up what we are trying to achieve. In Shape nothing is known about rules for checking subclass intersection but in the first subclass, Rectangle say, we can write a method intersect(Rectangle).
Using single dispatching we would override intersect(Shape) from the base class:
@Override public boolean intersect(Shape argument) {
if (argument instanceof Rectangle)
return intersect((Rectangle)argument);
return argument.intersect(this);
}
In fact this is programmed double dispatching. If the current class does not have a method to compute intersection with the actual type of the argument, argument.intersect(this) is called to get the result: the method called depends on the argument type. Under this scheme each new subclass includes methods to compute intersection with an instance of itself and all the types that have already been written.This approach is fine: new shapes can be added as required without recoding any existing shapes and there is no problem with the kind of objections often made to using instanceof selection. The only drawback is that if a class does not implement a method to handle intersection for all the other classes written to date, after an instance of one of those other classes calls its intersect() the subsequent call to argument.intersect(this) gets unterminated recursion and stack overflow. An unavoidable fact of life, but things do not look quite so good when we get up to ten or twelve shapes:
@Override public boolean intersect(Shape argument) {
if (argument instanceof Rectangle)
return intersect((Rectangle)argument);
if (argument instanceof Square)
return intersect((Square)argument);
if (argument instanceof Circle)
return intersect((Circle)argument);
if (argument instanceof Triangle)
return intersect((Triangle)argument);
//..........................
return argument.intersect(this);
}
If this sort of thing is considered an 'anathema to Object Oriented Design' for unary methods why should it be OK for binary methods?Apart from the call to argument.intersect(this) there is really no element of multiple dispatching in the intersect(Shape) method. Case selection using instanceof or even a more efficient isA construct [here] does not qualify. Multiple dispatching supplies immediate direction of execution to an appropriate method based on argument type. What is needed is a way of calling the required intersect(type) method directly.
Initially we need to map type to the required offset in a table of methods and the least complicated way of doing it is to provide each class with an a property mtOffset() : method table offset - the offset of references to methods with an argument of that class' type in one or more method tables.
The Shape base class
Let's see how this works out in the abstract base class Shape:
public abstract class Shape { public static final int MAX_SHAPES = 10; protected static int classMTOffset; protected static int getOffset() { return classMTOffset++; } protected abstract int mtOffset();For simplicity the maximum number of shape classes that can be written is the constant MAX_SHAPES. It looks probable that the only way to make the size of the multi-method table fit runtime requirements exactly is to give it a nominal initial size and then expand it if required. For the moment we skip this complication. The method getOffset() is called by each subclass in turn when the class is initialized. getOffset() returns the value of the classMTOffset variable which is also incremented. Subclasses then implement the abstract mtOffset() method to return the value got during initialization.
Now we come to the base type for intersect() methods:
protected static abstract class Intersect <T extends Shape, S extends Shape> { abstract boolean intersect(T a, S b) ; // abstract boolean contains(T a, S b, int tolerance); // possibly }No first-class method type in Java so methods are implemented in nested classes. I am going to call these types Method Classes. There are pros and cons. A big pro is that any number of methods can be defined in a single class. An intersect(Shape) multimethod only requires a single sub-method but we could include a method contains(T, S) returning true if the S type shape fits entirely within T type shape for example, or similar overloaded methods with additional arguments, a tolerance factor maybe.
The characteristic shared by all intersect(T a, S b) methods as overridden in Intersect subclasses is that they all have two arguments that are covariant with Shape. Argument type is constrained by the declaration of the generic parameters to be a Shape or a subclass. Constraint is optional but shows the intention. IntersectRectangle for example is declared as:
protected static class IntersectRectangle
extends Intersect<Rectangle, Rectangle> {
@Override public boolean intersect(Rectangle t, Rectangle r) {
//..............
}
}
The important factor is that declaring Intersect as a generic class allows intersect(T, S) to be overridden in this way. If for example, we write Intersect ref = new IntersectRectangle();
Boolean b = ref.intersect(t, s);
the method called is intersect(Rectangle t, Rectangle r). Not a typecast in sight but the danger is that any subclass instance can be passed as an argument and passing the wrong kind gets a runtime error. We can design around this problem but a con is that this for the enclosing-class object must be passed as an argument. Of course it gets passed anyway. The dot before a method call can be seen as passing the reference that precedes it.In this implementation method classes are static nested classes rather than inner classes. Inner classes would be nice: the method body has implicit access to outer-class fields but, because every instance of an inner class carries a reference to the outer-class instance that created it, every outer-class object would need its own method table. Even worse, the execution cost of creating this table is added to the construction cost for every instance. Here a single table is used located in Shape:
protected static Intersect[][] intersectTable = new Intersect[MAX_SHAPES][];The table has two dimensions one for each argument type. If a method has more arguments subject to multiple dispatch the table is implemented with that number of dimensions. The array of method classes for each class is added to the table when the class is initialized. The index of a method-class array depends on the value of mtOffset() for the class, which results from class initialization order and is arbitrary.
The method-class array for all classes is the same size irrespective of how many Intersect descendants the class implements. If a method to suite a particular combination of arguments is not present the corresponding element will be null, a characteristic used implementing the intersect(Shape) method:
public boolean intersect(Shape s) { if (this == s) return true; // get method table offsets for the two types int thisMTO = this.mtOffset(); int sMTO = s.mtOffset(); // thisMTO indexes the table for the current class of object // sMTO the offset for a method to process the argument's type if (intersectTable[thisMTO][sMTO] != null) { // this object has a method so call it return intersectTable[thisMTO][sMTO].intersect(this, s); } // otherwise call the argument's method return intersectTable[sMTO][thisMTO].intersect(s, this); } } // end of ShapeThere is no possibility that the wrong method is called - it is selected programmatically rather than by some scatterbrained programmer ;).
The same strategy shown for instanceof method selection is used to allow additional subclasses to be written: if a null element is found in the current instance's table it does not have a method. It is assumed that the argument implements a suitable method but if it does not a null pointer exception will be thrown rather than stack overflow: still an error but a more traceable result.
Implementing a Rectangle class
MT_OFFSET is initialized by calling Shape.getOffset() and this class' entry in the multy-method table is done by Rectangle.initMT().
public class Rectangle extends Shape { public static final int MT_OFFSET = getOffset(); public static final boolean CLASS_INITIALIZED = initMT(); private static boolean initMT() { intersectTable[MT_OFFSET] = new Intersect[MAX_SHAPES]; intersectTable[MT_OFFSET][MT_OFFSET] = new IntersectRectangle(); return true; } @Override protected int mtOffset() { return MT_OFFSET; } public final int top, left, width, height, right, bottom; public Rectangle(int left, int top, int width, int height) { this.left = left; this.top = top; this.width = width; this.height = height; right = left + width; bottom = top + height; } public static class IntersectRectangle extends Intersect<Rectangle, Rectangle> { @Override public boolean intersect(Rectangle t, Rectangle r) { return (t.right > r.left && t.left < r.right && t.bottom > r.top && t.top < r.bottom); } } @Override public String toString() { return left + ", " + top + ", " + width + ", " + height; } }A new vector of sub-method references is created at the MT_OFFSET for this class. The class only declares a single sub-method of the intersect(Shape, Shape) multi-method leaving other entries in the vector blank. The implemented method is selected by the inherited intersect(Shape) method when this is a Rectangle and its argument is also a Rectangle.
Implementing other shapes
Each new Shape implements sub-methods of intersect(Shape, Shape) to compare instances of the class with the instances of that class and other shapes implemented to date. Square declares two sub-methods:
public class Square extends Shape { public static final int MT_OFFSET = getOffset(); public static final boolean CLASS_INITIALIZED = initMT(); static boolean initMT() { intersectTable[MT_OFFSET] = new Intersect[MAX_SHAPES]; intersectTable[MT_OFFSET][Rectangle.MT_OFFSET] = new IntersectRectangle(); intersectTable[MT_OFFSET][MT_OFFSET] = new IntersectSquare(); return true; } @Override protected int mtOffset() { return MT_OFFSET; } public final int top, left, side, right, bottom; public Square(int left, int top, int side) { this.left = left; this.top = top; this.side = side; right = left + side; bottom = top + side; } static class IntersectSquare extends Intersect<Square, Square> { @Override public boolean intersect(Square t, Square s) { return (t.right > s.left && t.left < s.right && t.bottom > s.top && t.top < s.bottom); } } static class IntersectRectangle extends Intersect<Square, Rectangle> { @Override public boolean intersect(Square t, Rectangle r) { return (t.right > r.left && t.left < r.right && t.bottom > r.top && t.top < r.bottom); } } @Override public String toString() { return left + ", " + top + ", " + side; } }Circle has three intersect() sub-methods:
public class Circle extends Shape { public static final int MT_OFFSET = getOffset(); public static final boolean CLASS_INITIALIZED = initMT(); static boolean initMT() { intersectTable[MT_OFFSET] = new Intersect[MAX_SHAPES]; intersectTable[MT_OFFSET][Rectangle.MT_OFFSET] = new IntersectRectangle(); intersectTable[MT_OFFSET][Square.MT_OFFSET] = new IntersectSquare(); intersectTable[MT_OFFSET][MT_OFFSET] = new IntersectCircle(); return true; } @Override protected int mtOffset() { return MT_OFFSET; } public final int left, top, radius, cx, cy ; public Circle(int left, int top, int radius) { this.left = left; this.top = top; this.radius = radius; cx = left + radius; cy = top + radius; } static class IntersectCircle extends Intersect<Circle, Circle>{ @Override public boolean intersect(Circle t, Circle c) { int dx = c.cx - t.cx; int dy = c.cy - t.cy; double distance = Math.sqrt(dx*dx + dy*dy); return (distance < t.radius + c.radius); } } static class IntersectRectangle extends Intersect<Circle, Rectangle>{ @Override public boolean intersect(Circle t, Rectangle r) { return t.intRect(r.left, r.top, r.right, r.bottom); } } static class IntersectSquare extends Intersect<Circle, Square>{ @Override public boolean intersect(Circle t, Square s) { return t.intRect(s.left, s.top, s.right, s.bottom); } } @Override public String toString() { return left + ", " + top + ", " + radius; } private int distance(int x, int y) { int dx = cx - x; int dy = cy - y; return (int)Math.sqrt(dx*dx + dy*dy); } private boolean intRect(int left, int top, int right, int bottom) { int lr = left - radius; int rr = right + radius; int tr = top - radius; int br = bottom + radius; if (cx <= lr || cx >= rr || cy <= tr || cy >= br) return false; if (cx <= left) { if (cy <= top) return distance(left, top) < radius; if (cy >= bottom) return distance(left, bottom) < radius; } else if (cx >= right) { if (cy <= top) return distance(right, top) < radius; if (cy >= bottom) return distance(right, bottom) < radius; } return true; } }
Testing it out
First sketch some shapes and modify the test function from the implementing equals() article then code main to compare the shapes.
By the way, I make no claims that the intersect algorithms are efficient or even correct: Let's see how this works out in the abstract base class Shape:
public class TestIntersect { public static void interPrint(Shape a, Shape b) { String ac = a.getClass().getCanonicalName(); if (ac == null) ac = "?"; else ac = ac.substring(ac.indexOf('.') + 1); String bc = b.getClass().getCanonicalName(); if (bc == null) bc = "?"; else bc = bc.substring(bc.indexOf('.') + 1); System.out.println(ac + " " + a.toString() + " a intersect() " + bc + " " + b.toString() + " b is " + a.intersect(b)); System.out.println("aoff: " + a.mtOffset() + " boff: " + b.mtOffset()+ ", " + bc + " b intersect() " + ac + " a is " + b.intersect(a) ); } public static void main(String[] args) { Square square1 = new Square(10, 10, 40); Circle circle1 = new Circle(30, 30, 20); Rectangle rect1 = new Rectangle(40, 40, 20, 80); interPrint(square1, circle1); interPrint(circle1, rect1); interPrint(square1, rect1); System.out.println(); Square square2 = new Square(60, 120, 40); Circle circle2 = new Circle(94, 85, 20); Rectangle rect2 = new Rectangle(103, 5, 20, 80); interPrint(square2, rect1); interPrint(square2, circle2); interPrint(circle2, rect2); interPrint(square2, rect2); } }Works O.K. Note that the Square's mtOffset() is zero because it is initialized first. Changing instance construction order in main() will change these values but functionality is not affected.
Square 10, 10, 40 a intersect() Circle 30, 30, 20 b is true
aoff: 0, boff: 2. Circle intersect() Square is true
Circle 30, 30, 20 a intersect() Rectangle 40, 40, 20, 80 b is true
aoff: 2, boff: 1. Rectangle intersect() Circle is true
Square 10, 10, 40 a intersect() Rectangle 40, 40, 20, 80 b is true
aoff: 0, boff: 1. Rectangle intersect() Square is true
Square 60, 120, 40 a intersect() Rectangle 40, 40, 20, 80 b is false
aoff: 0, boff: 1. Rectangle intersect() Square is false
Square 60, 120, 40 a intersect() Circle 94, 85, 20 b is false
aoff: 0, boff: 2. Circle intersect() Square is false
Circle 94, 85, 20 a intersect() Rectangle 103, 5, 20, 80 b is false
aoff: 2, boff: 1. Rectangle intersect() Circle is false
Square 60, 120, 40 a intersect() Rectangle 103, 5, 20, 80 b is false
aoff: 0, boff: 1. Rectangle intersect() Square is false
Summary
Generic nested classes provide a way of declaring methods capable of multiple dispatch selection in standard Java. Without performance testing and as-is the technique looks faster than selection from a list of isA statements.