LETTERS

C++ and the 386

As one of what I would imagine is a very small number of Intek C++ users, I read with interest the exchange between Mac Cutchins of Intek and Al Stevens in your March 1990 "Letters" section. When I first received the Intek package I had many of the same problems that Al did, including a not so amusing little bug that resulted in the compiler only working every second time. Several of their header files would not compile due to typos and bugs of one sort or another. Intek technical support was always polite but rarely helpful and I sometimes got the impression that I was the only user they had actually suckered into buying the product. To be fair they did supply an upgrade when I complained that the old version of the PharLap binder they had used prevented its working correctly with the VCPI standard and hence precluded the use of QEMM and DesqView. I was also notified (by telephone no less) of their upgrade to 2.0 and it was reasonably priced and delivered promptly.

I was eventually able to contrive the necessary patches, batch files, and bug fixes so that it would reliably compile things from my Brief editor and I could pretend I was working with a real development tool. Still one might ask why bother with it when there are other alternatives.

First and foremost because Intek C++ is the only product which will support the MetaWare HiC or Watcom compilers, thus it is the only way to produce code for 386 protected mode programs running under DOS extenders. Also, as Mac Cutchins points out, Intek's use of 386 protected mode means you are not concerned with running out of memory when compiling large source modules. The large and numerous header files that C++ encourages can quickly exhaust the memory of real mode compilers such as Zortech. Many of my large library modules would have to be split up and might still give problems compiling under Zortech. Finally and unexpectedly, the translator itself is quite robust once it is running. It correctly compiled code segments where Zortech 1.2 version gave spurious errors. The 2.0 version of Zortech seemed more robust in my limited testing of it, but could not compile many of my files due to the memory limitation problem.

I am the only one in our shop using C++ at the moment, but that will change in the near future and I dread having to invest in more copies of the Intek product. With each new issue of DDJ I carefully scan all the adds and announcements for a Turbo C++ 386 or something similar. The OS/2 version of Zortech is tempting, but I need to use too many PharLap programs to make that feasible just yet. When all is said and done Intek has the singular advantage of being the only product available under DOS for creating really large C++ applications. If something else is available I would love to know about it.

Craig Morris

Calgary, Alberta, Canada

DDJ responds: Thanks for your insights, Craig. Just within the last few days, DDJ contributing editor Andrew Schulman started an in-depth look of C++ implementations for the 386, beginning with Intek C++ and MicroWay's NDP C++. We're looking forward to sharing his findings sometime in the near future.

Trick Trade-offs

Dear DDJ,

This message is in regards to Tim Paterson's article "Assembly Language Tricks of the Trade" in the March 1990 DDJ.

I've always enjoyed reading articles about the tricks and magic that other programmers use. If we assume some things, though, we can do your Binary-To-ASCII Conversion one better.

If we assume that the Carry and Auxiliary Carry are clear, then a binary value in the range 00-0F in AL can be converted to ASCII by:

  daa                         ; 00-09, 10-15
  add          Al0F0H         ; F0-F9 NC, 00-05 CY
  adc          al,040h        ; 30-39, 41-46 ('0'-'9','A'-'F')

Since we usually want to convert a BYTE to two ASCII characters, this is usually preceded by masking and/or shifting some other value. These operations will clear the Carry and Auxiliary Carry, so everything's OK.

Yet another trick: You mention using the AAM and AAD instructions for Binary/Decimal Conversion. There is an undocumented "extension" to these instructions, which is often useful. The opcodes for AAD and AAM are:

  AAD    =    D5 0A
  AAM    =    D4 0A

If the 0As look a little suspicious, it's because they are the divisors used in the conversion. The instruction sequence D4 10 is equivalent to separating the byte in AL into its upper/lower nibbles and placing the upper nibble into the lower nibble of AH, leaving just the lower nibble in AL. This also happens to clear the Carry and Auxiliary Carry flags. Sooooo ... used in conjunction with the Binary-to-ASCII Conversion code above will result in an extremely compact, brutally fast Byte-to-Two-ASCII-Digits Conversion. Neat, eh?

Keith Moore

Fort Worth, Texas

Tim responds: I am aware of the tricks Keith mentions. However, both rely on undocumented features of the 8086 family, which is a very dangerous practice.

The only instructions which are documented to affect the Auxiliary Carry (AC) flag in a specific way are arithmetic instructions (not including shifts). Masking and shifting instructions are documented as leaving the AC flag undefined. Thus it is very unlikely that the state of the AC flag will be known when Keith's instruction is executed, and the method could easily fail.

Testing with a debugger may leave the impression that masking, for example, leaves the AC flag clear. However, did you check this on an 8088, 8086, 286, or 386? What about the 33-MHz 386, which uses a different mask set than the slower versions? Are you sure the 386SX, 486, and 586 (which no one has seen yet) all work that way?

The same thing can be said for using variants of the AAM and AAD instructions to multiply or divide by something other than ten. Eleven years ago I discovered that the 8086 used the second byte of those instructions as an immediate value. But does a 486? If it does, then the 486 has a bug -- it should perform an invalid opcode trap if the second byte is not OAH. Or else Intel needs to document that it works.

There are too many different processors in the family -- and too many different manufacturers -- to consider using undocumented features. Let's all play by the rules.

But Basic Already Does That...

Dear DDJ,

No one is a bigger fan of Jeff Duntemann than I, but he completely missed the boat in his Modula-2 discussion (DDJ, February 1990). As Jeff went over the list of omissions in both Pascal and Modula-2, I kept saying to myself, "But QuickBASIC already does that." In my opinion, Microsoft QuickBASIC overcomes all of the shortcomings of both Pascal and Modula-2, with a language that is both fully structured and incredibly easy to use.

For example, Jeff laments Pascal's inability to view a list of procedures, and praises that feature in Modula-2. But QuickBASIC has had a "View Subs" menu for years. He then compares Pascal's ability to use a varying number and type of parameters for built-in statements, as opposed to Modula with its separate WriteString, ReadInt, and so forth. Again, QuickBASIC (and even interpreted BASIC!) has always had that capability. Worse still, procedures in either language cannot accept a truly "open ended" array. And again, QuickBASIC lets you pass any array -- with any number of dimensions and any range of upper and lower bounds -- to any subroutine. How else could one write a usable sort routine?!

I won't belabor the remaining list of advantages that QuickBASIC has over the "Wirth" languages. No, I won't dwell on QuickBASIC's many data types, automatic support for a coprocessor, TRUE dynamic strings, world-coordinate graphics, or its ability to manage an entire project without requiring all of the files to be in the same directory. (Yeah, that's a good one -- multiple copies of your debugged subroutines scattered all over a disk.) And I won't even belabor QuickBASIC's outstanding support for fully interrupt-drive communications. Where Jeff is bragging about a 100-line Comm program he wrote in an hour using Modula-2, I maintain the same could be done in, say, 20 lines in ten minutes using QuickBASIC.

Indeed, if any language is the rightful successor to "king" Turbo Pascal, surely it is QuickBASIC.

Ethan Winer

Stamford, Connecticut

Editor's note: Ethan is president of Crescent Software, developers of QuickBASIC add-on tools.

Forth-Coming

Dear DDJ,

I read Martin Tracy's article, "Zen Forth," with great interest (DDJ, January 1990). As a Forth programmer myself, I'm interested in Forth systems and applications. I even wrote a Forth system for sale (CorrectForth -- I published it as a product of Correct Software, Inc.). I have a number of comments on the implementation and what looks like bugs in the source code.

First, you could put the address of colon into the register DI. Then colon looks like this:

  LABEL COLON BP SPXGHGSI PUSH BPSP X CHG SI POP NEXT C;

(the CFA code or a colon definition is DI CALL). The result is a system about 290 faster than a JMP colon and numerous changes to the source code (string operators, FIND, etc.). The changes are minor and would involve saving and restoring DI. Another change would be to use register ES to point to RAM, thus increasing the amount of code space and data space available. Only string operations would be affected and would involve saving and restoring ES. Then, too, you could describe another register to hold the nest to top of stack value. This speeds up the system by 10 percent since lots of Forth words use 2 parameters. The system as published in DDJ runs the Sieve of Erastothenes benchmarks in 46 seconds, but the new improved system in 45 second. Time counts in real-time applications!

The source code bugs are as follows:

Screen     Page        Bug(s)
13         98          use of TRUE (a code defined word) in = <, U<
14         98          same as above, only for 0=, 0<
37         102         use of SPO in depth

The reason I'd call them bugs is that I don't think the metacompiler Martin was using would execute works defined in the metacompiler's target dictionary. If it did, I'd think twice before I'd use such a "feature" -- I would cross compile into a processor that might not execute host code ...!

Overall, this system sings pretty good. I counted on that -- Mr. Tracy's been in the Forth community much longer than I have. The choice of a DCT (direct threaded code) implementation of Forth is the best in my opinion since it has the best tradeoff of size vs. speed. If you want speed and don't care about size, go for STC (subroutine threaded code) (like Small C did). If you want really tight code (say you only have 4K of ROM), go for TTC (token threaded code). If you want speed and just have to have small size, go for DTC. The high-level words run at an acceptable speed and providing you chose the proper words as going into assembler (CODE definitions for the knowledgeable), you'll get screaming speed at little cost.

Russell McCale

New York, New York

Martin responds: Thank you for your interest in ZEN Forth. I am writing this letter to answer some of the many questions I have received.

ZEN is a personal dialect I have been developing and porting for several years. Most recently, I have been using it to track the development of the ANSI X3J14 proposed standard. The current state of the standard is reflected in a working document called BASIS. The BASIS changes every three months.

The most recent BASIS is BASIS 10, and I have written ZEN1_10 to match it. ZEN1_10 means Version 1, release 10. I have posted ZEN1_10 on GENIE and on BIX, and will continue to post new versions there.

ZEN1_10 is not meant to be a development system, but rather a simple and efficient Forth dialect. I have provided only the source code, for your study, and an executable file that you can use to load a text file to test a program for ANSI compatibility.

Yes, you are missing documentation, assembler, metacompiler, etc. These will not be written until the draft proposal dpANS is ready, which is at least nine months away. The current release was created by a Forth-to-assembler-source translator. The next release will probably be written in Turbo C or C++.