| Contents |Prev |Next |Index | The Java Virtual Machine Specification |
CHAPTER 8
This chapter details the low-level actions that may be used to explain the interactionof Java virtual machine threads with a shared main memory. It has been adapted with minimal changes from Chapter 17 of the first edition ofThe Java Language Specification, by James Gosling, Bill Joy, and Guy Steele.
Every thread has aworking memory in which it keeps its ownworking copy of variables that it must use or assign. As the thread executes a program, it operates on these working copies. The main memory contains themaster copy of every variable. There are rules about when a thread is permitted or required to transfer the contents of its working copy of a variable into the master copy or vice versa.
The main memory also containslocks; there is one lock associated with each object. Threads may compete to acquire a lock.
For the purposes of this chapter, the verbsuse,assign,load,store,lock, andunlock name actions that a thread can perform. The verbsread,write,lock, andunlock name actions that the main memory subsystem can perform. Each of these operations is atomic (indivisible).
Ause orassign operation is a tightly coupled interaction between a thread's execution engine and the thread's working memory. Alock orunlock operation is a tightly coupled interaction between a thread's execution engine and the main memory. But the transfer of data between the main memory and a thread's working memory is loosely coupled. When data is copied from the main memory to a working memory, two actions must occur: aread operation performed by the main memory, followed some time later by a correspondingload operation performed by the working memory. When data is copied from a working memory to the main memory, two actions must occur: astore operation performed by the working memory, followed some time later by a correspondingwrite operation performed by the main memory. There may be some transit time between main memory and a working memory, and the transit time may be different for each transaction; thus, operations initiated by a thread on different variables may be viewed by another thread as occurring in a different order. For each variable, however, the operations in main memory on behalf of any one thread are performed in the same order as the corresponding operations by that thread. (This is explained in greater detail later.)
A single thread issues a stream ofuse,assign,lock, andunlock operations as dictated by the semantics of the program it is executing. The underlying Java virtual machine implementation is then required additionally to perform appropriateload,store,read, andwrite operations so as to obey a certain set of constraints, explained later. If the implementation correctly follows these rules and the programmer follows certain other rules of programming, then data can be reliably transferred between threads through shared variables. The rules are designed to be "tight" enough to make this possible, but "loose" enough to allow hardware and software designers considerable freedom to improve speed and throughput through such mechanisms as registers, queues, and caches.
Here are the detailed definitions of each of the operations:
Threads do not interact directly; they communicate only through the shared main memory. The relationships between the actions of a thread and the actions of main memory are constrained in three ways:
In the rules that follow, the phrasing "B must intervene betweenA andC" means that actionB must follow actionA and precede actionC.
+ operator requires that a singleuse operation occur onV ; an occurrence ofV as the left-hand operand of the assignment operator= requires that a singleassign operation occur. Alluse andassign actions by a given thread must occur in the order specified by the program being executed by the thread. If the following rules forbidT to perform a requireduse as its next action, it may be necessary forT to perform aloadfirst in order to make progress.There are also certain constraints on theread andwrite operations performed by main memory:
volatile variables(§8.7).double orlong variable is not declaredvolatile, then for the purposes ofload,store,read, andwrite operations it is treated as if it were two variables of 32 bits each; wherever the rules require one of these operations, two such operationsare performed, one for each 32-bit half. The manner in which the 64 bits of adouble orlong variable are encoded into two 32-bit quantities and the order of the operations on the halves of the variables are not defined byThe Java Language Specification.This matters only because aread orwrite of adouble orlong variable may be handled by an actual main memory as two 32-bitread orwrite operations that may be separated in time, with other operations coming between them. Consequently, if two threads concurrently assign distinct values to the same shared non-volatiledouble orlong variable, a subsequent use of that variable may obtain a value that is not equal to either of the assigned values, but rather some implementation-dependent mixture of the two values.
An implementation is free to implementload,store,read, andwrite operations fordouble andlong values as atomic 64-bit operations; in fact, this is strongly encouraged. The model divides them into 32-bit halves for the sake of currently popular microprocessors that fail to provide efficient atomic memory transactions on 64-bit quantities. It would have been simpler for the Java virtual machine to define all memory transactions on single variables as atomic; this more complex definition is a pragmatic concession to current hardware practice. In the future this concession may be eliminated. Meanwhile, programmers are cautioned to explicitly synchronize access to shareddouble andlong variables.
volatile, then the rules in the previous sections are relaxed slightly to allowstore operations to occur earlier than would otherwise be permitted. The purpose of this relaxation is to allow optimizing compilers to performcertain kinds of code rearrangement that preserve the semantics of properly synchronized programs, but might be caught in the act of performing memory operationsout of order by programs that are not properly synchronized.Suppose that astore byT ofV would follow a particularassign byT ofV according to the rules of the previous sections, with no interveningload orassign byT ofV. Then thatstore operation would send to the main memory the value that theassign operation put into the working memory of threadT. The special rule allows thestore operation actually to occur before theassign operation instead, if the following restrictions are obeyed:
synchronized methods are a convenient way to follow this convention.In other applications, it may suffice to use a single lock to synchronize access to a large collection of objects.If a thread uses a particular shared variable only after locking a particular lock and before the corresponding unlocking of that same lock, then the thread will read the shared value of that variable from main memory after thelock operation, if necessary, and will copy back to main memory the value most recently assigned to that variable before theunlock operation. This, in conjunction with the mutual exclusion rules for locks, suffices to guarantee that values are correctly transmitted from one thread to another through shared variables.
The rules for volatile variables effectively require that main memory be touched exactly once for eachuse orassign of a volatile variable by a thread, and that main memory be touched in exactly the order dictated by the thread execution semantics. However, such memory operations are not ordered with respect toread andwrite operations on nonvolatile variables.
a andb and methodshither andyon:class Sample { int a = 1, b = 2; void hither() { a = b; } void yon() b = a; }}Now suppose that two threads are created and that one thread callshither while the other thread callsyon. What is the required set of actions and what are the orderingconstraints?Let us consider the thread that callshither. According to the rules, this thread must perform ause ofb followed by anassign ofa. That is the bare minimum required to execute a call to the methodhither.
Now, the first operation on variableb by the thread cannot beuse. But it may beassign orload. Anassign tob cannot occur because the program text does not call for such anassign operation, so aload ofb is required. Thisload operation by the thread in turn requires a precedingread operation forb by the main memory.
The thread may optionallystore the value ofa after theassign has occurred. If it does, then thestore operation in turn requires a followingwrite operation fora by the main memory.
The situation for the thread that callsyon is similar, but with the roles ofa andb exchanged.
The total set of operations may be pictured as follows:

Here an arrow from actionA to actionB indicates thatA must precedeB.
In what order may the operations by the main memory occur? The only constraint is that it is not possible both for thewrite ofa to precede theread ofa and for thewrite ofb to precede theread ofb, because the causality arrows in the diagram would form a loop so that an action would have to precede itself, which is not allowed. Assuming that the optionalstore andwrite operations are to occur, there are three possible orderings in which the main memory might legitimately perform its operations. Letha andhb be the working copies ofa andb for thehither thread, letya andyb be the working copies for theyon thread, and letma andmb be the master copies in main memory. Initiallyma=1 andmb=2. Then the three possible orderings of operations and the resulting states are as follows:
a
reada,readb
writeb (thenha=2,hb=2,ma=2,mb=2,ya=2,yb=2)a
writea,writeb
readb (thenha=1,hb=1,ma=1,mb=1,ya=1,yb=1)a
writea,readb
writeb (thenha=2,hb=2,ma=2,mb=1,ya=1,yb=1)b is copied intoa,a is copied intob, or the values ofa andb are swapped; moreover, the working copies of the variables might or might not agree. It would be incorrect, of course, to assume that any one of these outcomes is more likely than another. This is one place in which the behavior of a program is necessarily timing-dependent.Of course, an implementation might also choose not to perform thestore andwrite operations, or only one of the two pairs, leading to yet other possible results.
Now suppose that we modify the example to usesynchronized methods:
class SynchSample { int a = 1, b = 2; synchronized void hither() { a = b; } synchronized void yon() b = a; }}Let us again consider the thread that callshither. According to the rules, this thread must perform alock operation (on the instance of classSynchSample on which thehither method is being called) before the body of methodhither is executed. This is followed by ause ofb and then anassign ofa. Finally, anunlock operation on that same instance ofSynchSample must be performed after the body of methodhither completes. That is the bare minimum required to execute a call to the methodhither.As before, aload ofb is required, which in turn requires a precedingread operation forb by the main memory. Because theload follows thelock operation, the correspondingread must also follow thelock operation.
Because anunlock operation follows theassign ofa, astore operation ona is mandatory, which in turn requires a followingwrite operation fora by the main memory. Thewrite must precede theunlock operation.
The situation for the thread that callsyon is similar, but with the roles ofa andb exchanged.
The total set of operations may be pictured as follows:

Thelock andunlock operations provide further constraints on the order of operations by the main memory; thelock operation by one thread cannot occur between thelock andunlock operations of the other thread. Moreover, theunlock operations require that thestore andwrite operations occur. It follows that only two sequences are possible:
a
reada,readb
writeb (thenha=2,hb=2,ma=2,mb=2, ya=2,yb=2)a
writea,writeb
readb (thenha=1,hb=1,ma=1,mb=1,ya=1,yb=1)a andb.a andb and methodsto andfro:class Simple { int a = 1, b = 2; void to() { a = 3; b = 4; } void fro() System.out.println("a= " + a + ", b=" + b); }}Now suppose that two threads are created and that one thread callsto while the other thread callsfro. What is the required set of actions and what are the ordering constraints?Let us consider the thread that callsto. According to the rules, this thread must perform anassign ofa followed by anassign ofb. That is the bare minimum required to execute a call to the methodto. Because there is no synchronization, it is at the option of the implementation whether or not tostore the assigned values back to main memory! Therefore, the thread that callsfro may obtain either1 or3 for the value ofa and independently may obtain either2 or4 for the value ofb.
Now suppose thatto issynchronized butfro is not:
class SynchSimple { int a = 1, b = 2; synchronized void to() { a = 3; b = 4; } void fro() System.out.println("a= " + a + ", b=" + b); }}In this case the methodto will be forced tostore the assigned values back to main memory before theunlock operation at the end of the method. The methodfro must, of course, usea andb (in that order) and so mustload values fora andb from main memory.The total set of operations may be pictured as follows:
Here an arrow from actionA to actionB indicates thatA must precedeB.
In what order may the operations by the main memory occur? Note that the rules do not require thatwritea occur beforewriteb; neither do they require thatreada occur beforereadb. Also, even though methodto is synchronized, methodfro is not synchronized, so there is nothing to prevent theread operations from occurring between thelock andunlock operations. (The point is that declaring one methodsynchronized does not of itself make that method behave as if it were atomic.)
As a result, the methodfro could still obtain either1 or3 for the value ofa and independently could obtain either2 or4 for the value ofb. In particular,fro might observe the value1 fora and4 forb. Thus, even thoughto does anassign toa and then anassign tob, thewrite operations to main memory may be observed by another thread to occur as if in the opposite order.
Finally, suppose thatto andfro are bothsynchronized:
class SynchSynchSimple { int a = 1, b = 2; synchronized void to() { a = 3; b = 4; } synchronized void fro() System.out.println("a= " + a + ", b=" + b); }}In this case, the actions of methodfro cannot be interleaved with the actions of methodto, and sofro will print either "a=1, b=2" or "a=3, b=4".Thread andThreadGroup. CreatingaThread object creates a thread, and that is the only way to create a thread. When the thread is created, it is not yet active; it begins to run when itsstart method is called.Thesynchronized statement computes a reference to an object; it then attempts to perform alock operation on that object and does not proceed further until thelock operation has successfully completed. (Alock operation may be delayed because the rules about locks can prevent the main memory from participating until some other thread is ready to perform one or moreunlock operations.) After the lock operation has been performed, the body of thesynchronized statement is executed. Normally, a compiler for the Java programming language ensures that thelock operation implemented by amonitorenter instruction executed prior to the execution of the body of thesynchronized statement is matched by an unlock operation implemented by amonitorexit instruction whenever thesynchronized statement completes, whether completion is normal or abrupt.
Asynchronized method automatically performs alock operation when it is invoked; its body is not executed until thelock operation has successfully completed. If the method is an instance method, it locks the lock associated with the instance for which it was invoked (that is, the object that will be known asthis during execution of the method's body). If the method isstatic, it locks the lock associated with theClass object that represents the class in which the method is defined. If execution of the method's body is ever completed, either normally or abruptly, anunlock operation is automatically performed on that same lock.
Best practice is that if a variable is ever to be assigned by one thread and used or assigned by another, then all accesses to that variable should be enclosed insynchronized methods orsynchronized statements.
Although a compiler for the Java programming language normally guarantees structured use of locks (seeSection 7.14, "Synchronization"), there is no assurance that all code submitted to the Java virtual machine will obey this property. Implementations of the Java virtual machine are permitted but not required to enforce both of the following two rules guaranteeing structured locking.
LetT be a thread andL be a lock. Then:
Note that the locking and unlocking automatically performed by the Java virtual machine when invoking a synchronized method are considered to occur during the calling method's invocation.
Wait sets are used by the methodswait,notify, andnotifyAll of classObject. These methods also interact with the scheduling mechanism for threads.
The methodwait should be invoked for an object only when the current thread (call itT ) has already locked the object's lock. Suppose that threadT has in fact performedNlock operations on the object that have not been matched byunlock operations on that same object. Thewait method then adds the current thread to the wait set for the object, disables the current thread for thread scheduling purposes, and performsNunlock operations on the object to relinquish the lock on it. Locks having been locked by threadT on objects other than the oneT is to wait on are not relinquished. The threadT then lies dormant until one of three things happens:
notify method for that object, and threadT happens to be the one arbitrarily chosen as the one to notify.notifyAll method for that object.wait method specified a time-out interval, then the specified amount of real time elapses.wait method. Thus, on return from thewait method, the state of the object's lock is exactly as it was when thewait method was invoked.Thenotify method should be invoked for an object only when the current thread has already locked the object's lock, or anIllegalMonitorStateException will be thrown. If the wait set for the object is not empty, then some arbitrarily chosen thread is removed from the wait set and reenabled for thread scheduling. (Of course, that thread will not be able to proceed until the current thread relinquishes the object's lock.)
ThenotifyAll method should be invoked for an object only when the current thread has already locked the object's lock, or anIllegalMonitorStateException will be thrown. Every thread in the wait set for the object is removed from the wait set and reenabled for thread scheduling. (Those threads will not be able to proceed until the current thread relinquishes the object's lock.)
Virtual Machine Specification
Copyright © 1999 Sun Microsystems, Inc.All rights reserved
Please send any comments or corrections through ourfeedback form