GENERATION SCAVENGING

An efficient, unobtrusive, portable garbage collector

Frank Jackson

Frank is a member of the technical staff at ParcPlace Systems and has spent much of his time there designing, building, and evaluating various forms of automatic memory management. He can be reached at ParcPlace Systems, 1550 Plymouth St., Mountain View, CA 94043.

Nobody likes to take out the trash, and programmers are no exception. Wouldn't it be nice if someone else would gather up all the garbage that we create and dispose of it for us? It should come as a welcome relief, then, to discover that the run-time systems of programming languages such as Lisp, Smalltalk, and Prolog generally provide facilities that do exactly that. With the advent of powerful computer workstations, automatic garbage collection has become an important component of many modern interactive programming environments as well as the applications that are built using such environments.

Although traditional programming languages such as C, Fortran, and Pascal do not require the programmer to expend any effort managing the memory occupied by either the data that is allocated on the system's run-time stack or the data that is statically allocated on the system's heap, they do require the programmer to manage any data that is dynamically allocated on the heap. Programmers that use such languages are forced to litter their programs with explicit free statements if they wish to recycle the storage consumed by heap-allocated data that is no longer useful. By having the language's run-time system collect such garbage automatically, a certain class of well-known bugs is eliminated. For example, storage leaks cannot occur in such a system, so valuable memory is not wasted if the programmer neglects to free data that is no longer accessible. Even more important, data cannot be prematurely freed, avoiding the chaos that can result when an application tries to access data that was mistakenly recycled. Finally, the programmer is relieved of the burden of having to explicitly manage the heap, which saves development time and results in less complex code.

Given these benefits, you might expect heap-based garbage collection to be an integral component of most present day language implementations. This is not the case, however. Most traditional programming languages were not designed with garbage collection in mind, and it is generally difficult to retrofit existing language implementations with an automatic garbage collector. Further, there are a number of serious drawbacks to the classical garbage collection algorithms. These drawbacks include:

Distracting pauses when performing a garbage collection

Failure to reclaim certain types of garbage

High overhead in space and time

These drawbacks become even more apparent when the algorithms are deployed in modern interactive environments, given the stringent response-time requirements of these environments.

Significant progress has been made in the past decade, however, and new garbage collection techniques have been developed that all but eliminate the above drawbacks. One of these techniques -- generation scavenging -- not only addresses each of the problems just listed, but it requires no hardware support, making it portable across a wide variety of personal computers and engineering workstations. In this article, I'll discuss some of the historical events that led to the development of the original generation-scavenging algorithm. In addition, I'll describe some of the more recent refinements that significantly enhance the performance of the basic generation scavenger. In particular, the scavenging algorithm described later can be tuned so that the average pause time and the total overhead for collecting garbage can be reduced to an acceptably low level.

Because the generation-scavenging algorithm was first published, many variations have been both proposed and implemented. In some systems, newspace is composed of two zones instead of three. Other implementations allow the scavenger to scavenge multiple spaces instead of restricting its purview to a single space. These spaces are sometimes arranged as pairs of semispaces and sometimes as a bucket brigade of consecutive spaces through which the surviving objects are promoted. Some systems eliminate the need for an age field by spatially segregating objects of the same age. (See Wilson and Moher¹³ for an example of a system that obviates the need for an age field by organizing its spaces into a bucket brigade.) Finally, various schemes have been proposed for efficiently identifying the roots of newspace. Shaw,⁹ for example, recently suggested combining the store check with the virtual memory mechanism that marks hardware pages as being dirty and then scanning these dirty pages for actual roots at scavenge time.

Tenure Policies

One of the key decisions that a generation scavenger must make is when to tenure an object to oldspace. Early scavengers generally employed a simple fixed-age tenure threshold: They tenured any object that had survived for a fixed amount of time or a fixed number of scavenges. Studies conducted by myself and Ungar¹² show that such tenure policies are not particularly effective in minimizing the amount of tenured garbage (that is, objects that die after being tenured) or in controlling the length of the pauses required to perform the scavenge. Different applications cause objects to survive for different amounts of time, so no single tenure threshold will perform optimally in all circumstances. If the tenure threshold is set too young, then oldspace will be flooded with objects that die shortly thereafter. (This problem will be further exacerbated by the effects of nepotism. See Figure 1.) And if the tenure threshold is set too high, then the scavenge pauses can easily become disruptive.

Both of the above problems can be solved by employing a tenure policy that modifies its tenure threshold dynamically according to the demographics of the object population currently housed in newspace. I will now describe how one might go about designing such a tenure policy. Because stop-and-copy collectors traditionally have problems with distracting pauses, it is important to provide the scavenger with the means to control the length of its pauses. Assuming that we have determined the maximum pause time that the scavenger can be permitted to take without being considered disruptive, we need to measure how many bytes of surviving objects the scavenger can copy in that amount of time. The scavenger can then control the length of its pauses by using this number as a watermark in the survivor zones. If the aggregate size of the objects in the survivor zone is less than this watermark, then the scavenger doesn't need to tenure any objects during the upcoming scavenge, because the pause required to scavenge these survivors will probably be acceptably brief. If, however, the size of these survivors actually exceeds this watermark, then the scavenger should tenure some objects during the next scavenge to keep the pause times from becoming disruptive.

Because the scavenger tenures objects to keep the duration of its pauses under control, we need to provide it with the means to minimize the amount of the tenured garbage that it creates. Rather than tenure objects randomly, which could result in young objects that are unworthy of promotion being tenured, the scavenger uses demographic information to select a tenure threshold that will result in the desired amount of the oldest objects in newspace being tenured. The necessary demographic information can be kept in a table indexed by age that contains the number of data bytes in newspace for each age. This table can either be maintained by the scavenger as a matter of course, or it can be created on-the-fly by a quick scan of the occupied survivor zone. By scanning this table backwards, the scavenger can then set the appropriate tenure threshold for the ensuing scavenge. These measures permit the scavenger to tenure the minimal amount of objects that have the highest likelihood of surviving (see Figure 2).

Thus far, we've described a scavenger that can easily be made nondisruptive, even in the face of the varying object demographics, but what about the total scavenging overhead? Given a maximal acceptable pause time of, say, 100 milliseconds, we can drive the total scavenge overhead reasonably low by sizing the creation zone appropriately (assuming that the overhead required to perform the store checks is as low as recent studies seem to suggest). That is, if we were to size the creation zone such that it filled up once per second (and, hence, a scavenge was performed every second or so), then the total overhead for scavenging would be around ten percent. If, however, we sized the creation zone so that it filled up every three to four seconds, then the scavenge overhead would be less than three percent.

Thus, the generation scavenger described can easily be tuned in two respects: The average pause time required to perform a scavenge and the total scavenge overhead can be controlled by setting the watermark in survivor space and the size of the creation zone, respectively. Of course, the cost for reducing both pause times and scavenge overhead is paid in memory, in terms of both the memory required to size the creation zone appropriately and the space taken up by tenured garbage resulting from the need to keep the total size of the scavenge survivors less than the survivor zone watermark. For example, current Smalltalk implementations on stock hardware that utilize this particular type of scavenger typically have survivor zone watermarks that vary between 50K and 120K, resulting in worst case pause times of 100 milliseconds, and creation zones between 400K and 800K, resulting in a scavenge overhead of less than three percent.

Because it requires neither hardware assistance nor any operating system's software, this scavenger has been successfully deployed as a component of Objectworks for Smalltalk-80 fielded by ParcPlace Systems. This particular implementation, coded entirely in C, has been ported to a wide variety of personal workstations, including the Apple Macintosh family, most 386-based DOS PCs, and the workstations sold by Sun, Digital Equipment, and Hewlett-Packard.