Geon (psychology)Geons are the simple 2D or 3D forms such as cylinders, bricks, wedges, cones, circles and rectangles corresponding to the simple parts of an object in Biederman's recognition-by-components theory.[1] The theory proposes that the visual input is matched against structural representations of objects in the brain. These structural representations consist of geons and their relations (e.g., an ice cream cone could be broken down into a sphere located above a cone). Only a modest number of geons (< 40) are assumed. When combined in different relations to each other (e.g., on-top-of, larger-than, end-to-end, end-to-middle) and coarse metric variation such as aspect ratio and 2D orientation, billions of possible 2- and 3-geon objects can be generated. Two classes of shape-based visual identification that are not done through geon representations, are those involved in: a) distinguishing between similar faces, and b) classifications that don’t have definite boundaries, such as that of bushes or a crumpled garment. Typically, such identifications are not viewpoint-invariant. Properties of geonsThere are 4 essential properties of geons:
Derivation of invariant properties of geonsViewpoint invariance: The viewpoint invariance of geons derives from their being distinguished by three nonaccidental properties (NAPs) of contours that do not change with orientation in depth:
NAPs can be distinguished from metric properties (MPs), such as the degree of non-zero curvature of a contour or its length, which do vary with changes in orientation in depth. Invariance to lighting direction and surface characteristicsGeons can be determined from the contours that mark the edges at orientation and depth discontinuities of an image of an object, i.e., the contours that specify a good line drawing of the object’s shape or volume. Orientation discontinuities define those edges where there is a sharp change in the orientation of the normal to the surface of a volume, as occurs at the contour at the boundaries of the different sides of a brick. A depth discontinuity is where the observer’s line of sight jumps from the surface of an object to the background (i.e., is tangent to the surface), as occurs at the sides of a cylinder. The same contour might mark both an orientation and depth discontinuity, as with the back edge of a brick. Because the geons are based on these discontinuities, they are invariant to variations in the direction of lighting, shadows, and surface texture and markings. Geons and generalized conesThe geons constitute a partition of the set of generalized cones,[2] which are the volumes created when a cross section is swept along an axis. For example, a circle swept along a straight axis would define a cylinder (see Figure). A rectangle swept along a straight axis would define a "brick" (see Figure). Four dimensions with contrastive values (i.e., mutually exclusive values) define the current set of geons (see Figure):
These variations in the generating of geons create shapes that differ in NAPs. Experimental tests of the viewpoint invariance of geonsThere is now considerable support for the major assumptions of geon theory (See Recognition-by-components theory). One issue that generated some discussion was the finding[3] that the geons were viewpoint invariant with little or no cost in the speed or accuracy of recognizing or matching a geon from an orientation in depth not previously experienced. Some studies[4] reported modest costs in matching geons at new orientations in depth but these studies had several methodological shortcomings.[5][6] Research on geonsThere is much research out about geons and how they are interpreted. Kim Kirkpatrick-Steger, Edward A. Wasserman and Irving Biederman have found that the individual geons along with their spatial composition are important in recognition.[7] Furthermore, the findings in this research seem to indicate that non-accidental sensitivity can be found in all shape discriminating species.[8] Notes
|