PUTTING GRAPHICAL INTERFACES INTO PERSPECTIVE

An application without representations is tyranny

by Kent Dahlgren

Kent Dahlgren is the strategic planner for graphics products at Paradise Systems, a manufacturer of graphics devices. He can be reached at 800 E. Middlefield Rd., Mountain View, CA 94043.

Inspired by the success of the Apple Macintosh family with its user windowed interface, the personal computer industry has adopted the use of graphical interfaces as one of its dominant trends. The market is rapidly advancing to a point when advanced graphics will be expected of all new applications. Unfortunately, the proliferation of windowing standards, graphics libraries, and printer interfaces has created confusion among both hardware and software developers. A wide range of interfaces is available with an equally wade range of functionality. In addition, issues of compatibility between various subroutine libraries and windowing interfaces are present. This article provides programmers with an overview of some of the more important graphical interfaces, as well as considerations in selecting one.

Conceptual Model

Consider a model representing the components that might be present in a graphical interface package. This will provide a common frame of reference for the purposes of discussion, the same way that the OSI networking model serves as a means of describing networking interfaces. Figure 1, page 33, shows the components of the model, as well as their relationships.

Figure 1: A model depicting the relationship of components that form a graphical interface

           
                             User                    
           
              Application             User          
                                      Interface     
                                  
                        Window Manager             
     API-->
                      Display List Manager           
           
                    Mapping / Translation Layer      
     VDI-->
                        Rendering Interface          
           
                Driver A             Driver B      
                  
                Device A             Device B

In addition to the components themselves, consider the terminology describing two of the key interfaces in most graphics system implementations. The Applications Programmer's Interface (or API) is a set of routines that allows an application programmer to communicate with both the Window Manager and the Graphics Engine. Systems programmers who are porting new graphical interfaces to a machine, as well as hardware vendors who are interfacing graphics hardware, are concerned with the Virtual Device Interface (VDI). These represent the front end and back end of a computer graphics system.

User Interface

In windowing environments, the user interface is commonly referred to as the "look and feel." This is the channel through which the user communicates with the Window Manager. This channel allows the user to alter the size, shape, and arrangement of windows, as well as open and close them. The user interface also handles pull-down and pop-up menus, dialog boxes, and other graphical elements of communication.

Note that the user interface is, for the most part, separate from the remainder of the windowing system. In many systems, the differentiation is purely conceptual. On the other hand, some systems (such as X Windows) don't define any user interface as part of the specification. In those cases, the implementer must decide how screen actions will affect the system state.

As a result of this separation between the User Interface and the Window Manager, you can map a common look and feel onto differing windowing systems. Or, you can map several user interfaces onto a single windowing system. If done properly, such differences between user interfaces are transparent to applications.

Window Manager

The Window Manager is responsible for the abstraction of a bit-mapped display image to multiple, virtual display surfaces. It maintains the system's data structures and informs both the user interface and the applications about the size, shape, and visibility of the various windows displayed on the screen. In the case of Microsoft Windows, this is the portion of the system that layers multitasking capabilities on top of DOS.

Two classes of interactions occur between applications and the Window Manager. Applications request the manager's services through a library of subroutines that handle such nondrawing activities as the opening and closing of windows. The Window Manager responds to user-generated events by sending messages to the affected applications. Clicking on a window's close box, for example, sends a close message to the application that owns that window, as well as other applications owning windows uncovered by the closing receive messages that indicate what needs to be redrawn.

This type of asynchronous, event-driven environment requires program structures closer to that of real-time control environments than to typical applications programs. Figure 2, page 33, shows the basic flow of a typical windowing application as expressed in pseudocode.

Figure 2: Pseudocode flow of a typical windowing application

BEGIN
   Initialize data structures
   Setup menus
   Open main window
   WHILE (not done)
   BEGIN
      CASE (message type) OF
         type_A_message : execute_A_handler( );
         type_B_message : execute_B_handler( );
         type_C_message : execute_C_handler( );
         type_D_message : done = TRUE
         default handler
   END
   Clean up environment
END

Although the structure of the program requires rethinking, the windowing system itself typically offloads many common chores from the application. For example, a single function call within the Macintosh programming environment initiates the opening of a window that allows the user to select a file. Putting this level of functionality in one call not only simplifies programming, but also ensures that the user will encounter the same menu structure in different applications. This uniformity allows Macintosh users to quickly learn new applications.

Graphics Engine

The Display List Manager, Mapping/Translation Layer, and Rendering Interface are collectively referred to as the Graphics Engine. Through its drivers the Graphics Engine handles the task of putting graphical objects on the screen. Applications use the Graphics Engine to display objects within their windows. The user interface uses it to construct window borders, menus, screen backgrounds, and other visual elements.

In nonwindowed graphical systems, the User Interface and the Window Manager are not present, and the entire interface consists of components of the Graphics Engine. The Graphics Engine itself is also considerably simplified in these cases since there is no requirement to map multiple logical screens to the physical screen.

Display List Manager

The Display List Manager decouples the generation of drawing requests of the application from the hardware (or software) that performs the rendering. This requires some form of buffering, which could be as simple as a queue or as sophisticated as a hierarchical object-oriented database.

Even a simple queuing arrangement can be beneficial when a graphics coprocessor performs the rendering. A display list queue eliminates the requirement of the host CPU waiting for completion of the first drawing command before issuing the next. Applications tend to issue these drawing commands in bursts; the display list queue evens out the workload.

More sophisticated drawing interfaces (such as GKS, PHIGS, and HOOPS) allow the application to group drawing primitives and manipulate them as single objects. PHIGS and HOOPS carry this idea one step further by arranging these display list groupings into hierarchies that allow children to inherit characteristics from their parents. Inheritance is particularly important in 3-D graphics, where the inheritance allows the programmer to manipulate one component, several related elements, or an entire complex object, all with equal ease.

An example is the image of a robot arm, which might consist of a base, an upper arm, a lower arm, and a hand. All of these parts share a positional relationship to each other. If one rotates the base, all the other components remain fixed with respect to one another and move as a unit. Therefore, the base is the parent node of the hierarchy, and all other parts inherit the attribute of position from the base. Each child component also has some freedom of movement, which affects its children but not its parents. If you move the lower arm, for example, the hand must go with it--obeying the law of inheritance--but the upper arm and base are unaffected.

Most graphical interfaces map drawing primitives to the display through layers of coordinate transformations. Figure 3, page 35, shows how GKS implements coordinate transformations in a two-dimensional space. The coordinates that the application uses to describe objects are referred to as the World Coordinate (WC) system and use a floating-point representation. GKS then maps these points as an internal abstract display using what are called Normalized Device Coordinates (NDC). These coordinates are unsigned values normalized between 0 and 1 in both the X and Y directions. The rendering interface maps NDC to the actual Device Coordinates (DC) of the output medium. This two-stage mapping allows GKS applications to zoom or pan the viewing area over the database by changing transformation parameters.

Figure 3: 2-D coordinate trsnsformations in GKS

While the bulk of the older interfaces supports only two-dimensional drawing spaces, the increasing interest in CAD-type software has created a demand for three-dimensional graphics capabilities in new interfaces. The development of personal computers with the power of a workstation has only recently made three-dimensional graphics practical on small systems.

Handling three-dimensional-display lists is not challenging. Rather, the problem is mapping the data to the screen and rendering it in the display buffer. In order to give the WC system sufficient dynamic range, floating-point coordinates normally are used. The transformation of each point requires at least one matrix multiplication (and usually several), and it incurs significant additional overhead for hidden line removal and surface shading. These programmers who use 3-D graphics on the present generation of personal computers must either be content with nonreal-time image creation, or they must invest in expensive special-purpose hardware.

The-dimensional graphics standards (whether formal or vendor-specified) have several attractive features. For one, the programmer does not have to deal with the complex issues of transforming internal representations of objects into visual images. These tasks are done by making simple API subroutines, passing the names of an object to be manipulated, and various control parameters. For another, the level of data abstraction allows you to achieve performance improvements transparently as new and faster hardware becomes available--assuming, of course, that the new platform supports the graphical system under which the application and its data were developed.

Table 1: Major features of several graphic rendering packages

Support for various hardware/software platforms and data portability among them might also be an important consideration in selecting a graphics system. For example, a work group using both 386-based PCs running DOS and Sun workstations under Unix and must share data and programs. In that case, Finder is definitely out of the running, since it's only available on Macintosh machines, but something like the widely-implemented HOOPS might be ideal.

Another very important factor is ease of programming and the quality of documentation. One of the chief objectives in using a packaged graphics interface is productivity: relieving programmers of the complex tedium required to get images onto the screen or manage the user interface (or both). A well-designed API allows the programmer to concentrate on the purpose of the application, rather than on display management, thus achieving the goal of increased productivity. One that is poorly designed or--even worse, badly documented--merely replaces one set of complexities with another.

Graphical interfaces are powerful and complicated toolsets. The key to selecting a graphics package and using it effectively is knowing what components it has and how they interact to solve your programming problems.