ACCESSING HARDWARE FROM 80386 PROTECTED MODE PART I

Understanding the 386 architecture may simply be a matter of building on what you already know

Figure 1 shows the local descriptors for a program running under Phar Lap with the no page switch on. These values were obtained by running an NDP C program under the Phar Lap 386DEBUG program and using the dl command to dump the local descriptor table. The selector numbers on the left side of the table are the values that a programmer passes into the 386 segment registers to activate a segment. Because 386DEBUG was invoked with paging off, the BASE values in Figure 1 correspond to physical addresses.

Figure 1: The local descriptors for a program running under Phar Lap

  Selector       BASE     Limit     Flags     Use     Gran      Comment

---------------------------------------------------------------------------

     04           53030        FF       92       32     BYTE     DOS EXTEND
     0C          100000       2FF       9A       32     BYTE     USER CODE
     14          100000       2FF       92       32     BYTE     USER DATA
     1C           B8000       FFF       92       32     BYTE     DOS SCREEN
     24           53030        FF       92       32     BYTE     DOS EXTEND
     2C           52f60        B9       92       32     BYTE     DOS EXTEND
     34          000000     FFFFF       92       32     BYTE     1st MEG
     3C        C0000000      FFFF       92       32     BYTE     WEITEK

The memory map has a number of these selectors pinpointed on its left side. Looking at selectors 0C and 14, we see that their corresponding segments are located at the start of what IBM calls "extended memory" (the start of the second megabyte of memory). If we had invoked 386DEBUG with paging on, the primary difference in our segment memory map would be that 0C and 14 would be moved down into the first megabyte to save memory. However, with paging enabled, it would not be possible to read the physical location of a segment from the selector BASE value, as the processor performs an additional address translation with paging enabled. Therefore, we will examine some of the selectors in Figure 1 that have been set up by the DOS extender before going on to see what happens when paging is enabled.

Selector 1C has been set up so that it contains the current screen buffer. This selector has a base that starts at address 0B800:0 (in 8086 notation) and is 0FFFH + 1 byte in length (16K bytes). The fact that this segment corresponds exactly to the screen buffer was no accident. The Phar Lap DOS Extender queried the system to find out what kind of graphics adapter was active, and based on this information created an entry in the LDT (local descriptor table) that precisely matched the device. It is also important to point out that the use of selector 1CH is preferred over selector 34H (which maps in the entire first megabyte of RAM) for screen buffer accesses, because an out-of-bounds write will result in a protection fault when using 1CH, but could have disastrous results if 34H were used.

The selectors 0CH and 14H were created for user code and data. Note that these selectors have the same location base and limit. In fact, they are identical in every way, except for the attribute flags. The format of the attribute byte is:

     upper                  lower
|P|DPL|DT|             |TYPE|

Looking over the memory map, we see that the flag byte has only two values: 92H and 9AH. The lower nibble in 92H indicates that the segment is of type 2, which means the segment is read/write (for data only). All but one of the segments must therefore contain data. Looking at the map, we discover that the segment that we have identified with "user code" has an attribute of 9A. The TYPE nibble, 0AH, indicates that selector, 0CH, is execute/read only (code).

The upper nibble contains miscellaneous information about the segment, including the present bit, two bits that specify the privilege level, and a bit which, when set, specifies that the descriptor describes memory (as opposed to a task switch or special system entity). The binary translation of 9 is 1001, which translates into the segment marked as present in memory with a privilege level of 0. Privilege level 0 is the highest available, and is frequently referred to as "ring 0."

Segments that run in ring 0 are theoretically capable of creating havoc by playing games with systems' tables that should only be accessed by the operating system or DOS extender. As a practical matter, the only time we have had to deal with invisible system tables, such as the global descriptors, was in the early 80386 days, before the DOS extenders had calls for mapping in new hardware, such as the Weitek coprocessor (which is now automatically mapped in by all DOS extenders).

As long as the program you write goes through systems calls provided by Phar Lap and Eclipse to modify lower-level system tables, such as the interrupt descriptor table, the program that results will conform to the VCPI specification, which means it will run with VCPI operating environments, such as Desqview-386, Netware-386, Phar Lap, and Eclipse.

As a point of interest, Eclipse runs programs in ring 3. There is a movement in the 386 extender industry toward running in ring 3 instead of ring 0. As long as the operating environments continue to provide the memory mapping capabilities that are utilized below, we have no objection to running in ring 3 over 0. However, we think there is, and will continue to be, a need for operating environments that provide direct access to all system resources, as a counter measure to operating systems such as OS/2 and Unix, which are attempting to shut off access to these facilities.

Real Memory from Protected Mode

To move a block of characters and attributes into screen RAM in an 8086 system, we might employ a block move. This technique is frequently used by spreadsheets that build an image in memory of what the screen is going to contain and then instantaneously move this buffer to screen RAM by using a single processor instruction. To set up a block move in an 8086, we point the ds:si registers at the source, the es:di registers at the destination, place the number of bytes to be moved in cx and then use a rep movsb instruction to have the processor make the transfer for us.

The code for an 80386 block move is identical, except that we now use 32-bit registers to hold 32-bit offsets, and where we used physical paragraphs in ds and es, we now use the appropriate selectors. In addition, where we placed the count in cx, we now place the count in ecx, which is a 32-bit register and makes it possible to move more than 64K with a single instruction. For example, to move a 16K buffer of character attribute pairs to a monochrome screen buffer located at paragraph B800, we would employ one of the two sequences of code shown in Figure 2, depending on whether we were running in real mode or 80386 32-bit mode under Phar Lap.

Figure 2: 16-bit vs. 32-bit assembly code to move a 16K buffer of character attribute pairs to a monochrome screen buffer

  Real mode                 32-bit protected mode
  ------------------------------------------------------------------

  mov    ax,0B800H          mov    eax,1CH       ;set destination
  mov    es,ax              mov    es,ax         ;segment
  xor    di,di              xor    edi,edi       ;dest offset = 0
  mov    si,buffer          mov    esi,buffer    ;set source offset
  mov    cx,1000H           mov    ecx,1000H     ;set count
  rep    movsb              rep    movsb         ;perform block move

The program assumes that the buffer being moved is contained by the current data segment in ds. It then sets up a FAR pointer to the destination (screen buffer at B800:0). Note that where the real-mode code used the physical paragraph of the screen buffer, the 80386 uses the selector set up by Phar Lap. Next, the code points si or esi at the buffer to be moved. Again, note that where a 16-bit offset was used by the real mode code, a 32-bit offset is now being used by the 80386 for the 32-bit code. Finally, the program sets the number of bytes to be moved in cx or ecx, and requests the processor to carry out the block move. Except for the first line, these two sequences are virtually identical.

Because the selectors in Figure 1 can access all of the memory in the first physical megabyte of RAM, we have just demonstrated that it is possible to access all of a system's "real" memory from a program running in protected mode. In our example, the source buffer is contained by the default data segment, 14H, which is located in "extended" memory above the first megabyte.

48-bit Address Space?

All that remains to our expose of the 386's flat model is to explore the operation of ports, interrupts, and paging. However, before we leave segmentation, there is one myth we need to burst. The typical text on the 80386 presents the processor as having three address spaces -- virtual, linear, and physical. Up to this point, what we have been exploring is the linear and physical, which are both identical when paging is disabled. The mythical address space turns out to be the "virtual" one. The myth was born because individuals who were used to programming in the large or huge models on the 8086 asked, "What would happen if we could write large or huge code on an 80386, instead of small code?" They quickly came to the conclusion that programs written with compilers, and operating systems that support 48-bit pointers (the 16-bits of the selector count for 16- and the 32-bit maximum size of the limit count for 32), would be capable of addressing a 48-bit address space, which just happens to contain 64 terrabytes!

We don't know who created this concept, although we suspect that Intel marketing told its systems' architects (after the last perceived black eye they got from a segmented architecture) that if they had to resort to segmentation again, they better have a damn good reason. The reality of the situation is that practical program size is limited by the size of what Intel calls the "linear" address space (to 32-bits), and that a 48-bit address space will not become a reality until Intel increases the size of the linear address space in a future device.

To prove the point, we did a calculation of what would happen if we took a simple program that performed a matrix multiply and extended it to handle arrays whose total size was greater than 4 gigabytes. As the total size of the arrays in our problem approach 4 gigabytes (each of the three arrays approach 1.3 gigabytes), we have to abandon our 80386 small model, and Phar Lap, in favor of a compiler-supported memory model and operating system that utilizes the virtual address space (which is not the same as demand-paged virtual memory, which we commonly refer to as "virtual memory").

Once our problem hits the 1.4-gigabyte array size, it is impossible to have all three arrays in our 4-gigabyte linear address space at the same time. So, we take advantage of the present bit in the descriptor table to make it possible for our large model operating system to swap arrays as needed. Our large model operating system makes it possible to run large model virtual segments. When we compute the time required to swap our 1.4-gigabyte segments as required by our algorithm, we discover that, assuming we have the world's fastest hard disks, our code runs 100,000 times slower than it did in the small model currently supported by Phar Lap, Unix, and Xenix.

The largest sized array that our large model supports is 4 gigabytes, which means our problem will span a tiny (in comparison to 64 terrabytes) 12-gigabyte address space. But never fear, we have still not finished digging into our bag of 8086 tricks. By resurrecting FAR pointers, the huge model, and tiling, we can hit our 64 terrabyte goal -- and for only a cost factor of 400 percent in code efficiency.

What's Next?

That these systems tricks are crucial for future Intel products is quite evident from the 80486, which, unlike the 80386, achieves its best speed with small model code that limits data accesses to the ds segment register only. It's amazing what happens to the best laid plans of product managers, public relations, and system types, when everyone suddenly discovers that the key to selling systems is simplicity (i.e., RISC)! But, I hope to convince you next month in Part II of this article that the only use for FAR pointers in 80386 code appear in operating system kernels.