BACKGROUND OF THE INVENTION 1. Field of the Invention:
The present invention relates generally to a data processing system. In particular, the present invention relates to a Java™ runtime environment in a data processing system. Still more particularly, the present invention relates to optimizing a classpath environment variable in the Java™ runtime environment by generating a new classpath environment variable that only includes directories containing classes that are required by a current Java™ program.
2. Description of the Related Art:
In the Java™ runtime environment, when a user invokes a Java™ program, the Java™ class loader loads bootstrap classes, extension classes, and user classes. Java™ runtime environment (JRE) is a product available from Sun Microsystems, Inc. Bootstrap classes are classes of the basic Java™ platform. Extension classes are Java™ archive (JAR) files that are required by the Java™ platform. User classes are classes that are defined by program developers. Currently, the location of the user classes is identified by the classpath environment variable, which lists directories and JAR files containing class files. The classpath environment variable is typically set on a user login to be used for all of the Java™ programs.
When Java™ programs are installed, Java™ class loader loads packages containing classes that are required by the Java™ program by traversing the classpath environment variable as defined by the user. In this way, the required classes may be identified by the Java™ class loader. However, as the number of directories or JAR files defined in the classpath environment variable increases or as the number of Java™ programs increases, searching through the classpath environment variable becomes very time consuming. The number of classes or JAR files in a directory also is a factor. Thus, the number of classes, directories, and JAR files may increase as additional Java™ programs are installed.
Furthermore, some Java™ programs may be executed repeatedly with different input data. These programs include programs that process a file of credit card transactions, cumulative rainfall amounts, seismic data, images, or other types of information. These repeatedly executed programs are either time sensitive or run on resources that are constrained by usage or time. If all directories and JAR files in the classpath environment variable are searched each time one of these programs is executed, performance of the program may suffer.
Since the required classes must be loaded before the Java™ program can be executed, it would be advantageous to have the classpath environment variable that only contain relevant classes for the current Java™ program, such that the time spent searching for the required classes in the classpath environment variable may be minimized.
SUMMARY OF THE INVENTION A computer implemented method, a data processing system, and computer usable program code are provided for classpath optimization in a platform independent runtime environment. A required class is identified in an import statement in source code for a software program, wherein the software program requires the required class. In a classpath environment variable, an element containing the required class is identified to form an identified element. Using the identified element, a new classpath environment variable is generated.
BRIEF DESCRIPTION OF THE DRAWINGS The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as a preferred mode of use, further objectives and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings, wherein:
FIG. 1 is a block diagram of a data processing system in which exemplary aspects of the present invention may be implemented;
FIG. 2 is a diagram illustrating interactions between aspects of the present invention in accordance with an illustrative embodiment of the present invention;
FIG. 3 is a block diagram illustrating the relationship of software components operating within a computer system invention in accordance with an illustrative embodiment of the present invention;
FIG. 4 is a block diagram of a Java™ virtual machine in accordance with an illustrative embodiment of the present invention;
FIG. 5 is a diagram illustrating a known script for setting the classpath environment variable;
FIG. 6 is a diagram illustrating a known classpath environment variable generated;
FIG. 7 is a diagram illustrating a known classpath environment variable;
FIG. 8 is a diagram illustrating a known Java™ program source file;
FIG. 9 is a diagram illustrating an exemplary classpath environment variable generated by the aspects of the present invention in accordance with an illustrative embodiment of the present invention;
FIG. 10 is a flowchart of an exemplary process for classpath environment variable optimization in accordance with an illustrative embodiment of the present invention; and
FIG. 11 is an exemplary implementation of classpath environment variable optimization in accordance with an illustrative embodiment of the present invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTFIGS. 1-2 are provided as exemplary diagrams of data processing environments in which embodiments of the present invention may be implemented. It should be appreciated thatFIGS. 1-2 are only exemplary and are not intended to assert or imply any limitation with regard to the environments in which aspects or embodiments of the present invention may be implemented. Many modifications to the depicted environments may be made without departing from the spirit and scope of the present invention.
With reference now to the figures,FIG. 1 depicts a pictorial representation of a network of data processing systems in which aspects of the present invention may be implemented. Networkdata processing system100 is a network of computers in which embodiments of the present invention may be implemented. Networkdata processing system100 containsnetwork102, which is the medium used to provide communication links between various devices and computers connected together within networkdata processing system100. Network102 may include connections, such as wire, wireless communication links, or fiber optic cables.
In the depicted example,server104 andserver106 connect tonetwork102 along withstorage108. In addition,clients110,112, and114 connect tonetwork102. Theseclients110,112, and114 may be, for example, personal computers or network computers. In the depicted example,server104 provides data, such as boot files, operating system images, and applications toclients110,112, and114.Clients110,112, and114 are clients to server104 in this example. Networkdata processing system100 may include additional servers, clients, and other devices not shown.
In the depicted example, networkdata processing system100 is the Internet withnetwork102 representing a worldwide collection of networks and gateways that use the Transmission Control Protocol/Internet Protocol (TCP/IP) suite of protocols to communicate with one another. At the heart of the Internet is a backbone of high-speed data communication lines between major nodes or host computers, consisting of thousands of commercial, governmental, educational, and other computer systems that route data and messages. Of course, networkdata processing system100 also may be implemented as a number of different types of networks, such as for example, an intranet, a local area network (LAN), or a wide area network (WAN).FIG. 1 is intended as an example, and not as an architectural limitation for different embodiments of the present invention.
With reference now toFIG. 2, a block diagram of a data processing system is shown in which aspects of the present invention may be implemented.Data processing system200 is an example of a computer, such asserver104 orclient110 inFIG. 1, in which computer usable code or instructions implementing the processes for embodiments of the present invention may be located.
In the depicted example,data processing system200 employs a hub architecture including north bridge and memory controller hub (NB/MCH)202 and south bridge and input/output (I/O) controller hub (SB/ICH)204.Processing unit206,main memory208, andgraphics processor210 are connected to NB/MCH202.Graphics processor210 may be connected to NB/MCH202 through an accelerated graphics port (AGP).
In the depicted example,network adapter212 connects to SB/ICH204.Audio adapter216, keyboard andmouse adapter220,modem222, read only memory (ROM)224, hard disk drive (HDD)226, CD-ROM drive230, universal serial bus (USB) ports andother ports232, and PCI/PCIe devices234 connect to SB/ICH204 throughbus238 andbus240. PCI/PCIe devices may include, for example, Ethernet adapters, add-in cards, and PC cards for notebook computers. PCI uses a card bus controller, while PCIe does not.ROM224 may be, for example, a flash binary input/output system (BIOS).
HDD226 and CD-ROM drive230 connect to SB/ICH204 throughbus240.HDD226 and CD-ROM drive230 may use, for example, an integrated drive electronics (IDE) or serial advanced technology attachment (SATA) interface. Super I/O (SIO)device236 may be connected to SB/ICH204.
An operating system runs onprocessing unit206 and coordinates and provides control of various components withindata processing system200 inFIG. 2. As a client, the operating system may be a commercially available operating system such as Microsoft® Windows® XP (Microsoft and Windows are trademarks of Microsoft Corporation in the United States, other countries, or both). An object-oriented programming system, such as, the Java™ programming system, may run in conjunction with the operating system and provides calls to the operating system from Java™ programs or applications executing on data processing system200 (Java™ is a trademark of Sun Microsystems, Inc. in the United States, other countries, or both).
As a server,data processing system200 may be, for example, an IBM eServer™ pSeries® computer system, running the Advanced Interactive Executive (AIX®) operating system or the LINUX operating system (eserver, pSeries and AIX are trademarks of International Business Machines Corporation in the United States, other countries, or both while LINUX is a trademark of Linus Torvalds in the United States, other countries, or both).Data processing system200 may be a symmetric multiprocessor (SMP) system including a plurality of processors inprocessing unit206. Alternatively, a single processor system may be employed. Instructions for the operating system, the object-oriented programming system, and applications or programs are located on storage devices, such asHDD226, and may be loaded intomain memory208 for execution by processingunit206. The processes for embodiments of the present invention are performed by processingunit206 using computer usable program code, which may be located in a memory such as, for example,main memory208,ROM224, or in one or moreperipheral devices226 and230.
Those of ordinary skill in the art will appreciate that the hardware inFIGS. 1-2 may vary depending on the implementation. Other internal hardware or peripheral devices, such as flash memory, equivalent non-volatile memory, or optical disk drives and the like, may be used in addition to or in place of the hardware depicted inFIGS. 1-2. Also, the processes of the present invention may be applied to a multiprocessor data processing system.
In some illustrative examples,data processing system200 may be a personal digital assistant (PDA), which is configured with flash memory to provide non-volitile memory for storing operating system files and/or user-generated data.
A bus system may be comprised of one or more buses, such asbus238 orbus240 as shown inFIG. 2. Of course, the bus system may be implemented using any type of communication fabric or architecture that provides for a transfer of data between different components or devices attached to the fabric or architecture. A communication unit may include one or more devices used to transmit and receive data, such asmodem222 ornetwork adapter212 as shown inFIG. 2. A memory may be, for example,main memory208,ROM224, or a cache such as found in NB/MCH202 as shown inFIG. 2. The depicted examples inFIGS. 1-2 and above-described examples are not meant to imply architectural limitations. For example,data processing system200 also may be a tablet computer, laptop computer, or telephone device in addition to taking the form of a PDA.
With reference now toFIG. 3, a block diagram illustrating the relationship of software components operating within a computer system that may implement the present invention. Java™-basedsystem300 contains platformspecific operating system302 that provides hardware and system support to software executing on a specific hardware platform. Java™ Virtual Machine (JVM™)304 is one software application that may execute in conjunction with the operating system.JVM™304 provides a Java™ run-time environment with the ability to execute Java™ application orapplet306, which is a program, servlet, or software component written in Java™ programming language. The computer system in whichJVM™304 operates may be similar todata processing system200 or networkdata processing system100, as described above. However,JVM™304 may be implemented in dedicated hardware on a so-called Java™ chip, Java™-on-silicon, or Java™ processor with an embedded pico Java™ Core.
At the center of a Java™ run-time environment is the JVM™, which supports all aspects of Java™'s environment, including its architecture, security features, mobility across networks, and platform independence. The JVM™ is a virtual computer component that actually executes Java™ programs. Java™ programs are not run directly by the central processor but instead by the JVM™, which is itself a piece of software running on the processor. The JVM™ allows Java™ programs to be executed on a different platform as opposed to only one platform for which the code was compiled. Java™ programs are compiled for the JVM™. In this manner, Java™ is able to support applications for many types of data processing systems, which may contain a variety of central processing units and operating system architectures.
To enable a Java™ application to execute on different types of data processing systems, a compiler typically generates an architecture-neutral file format—the compiled code is executable on many processors, given the presence of the Java™ n run-time system. The Java™ compiler generates bytecode instructions that are nonspecific to a particular computer architecture.
A bytecode is a machine independent code generated by the Java™ compiler and executed by a Java™ interpreter. A Java™ interpreter is part of the JVM™ that alternately decodes and interprets the bytecode or bytecodes. The bytecode instructions are designed to be easy to interpret on any computer and easily translated on the fly into native machine code. Bytecodes may be translated into native machine code by a just-in-time compiler or JIT.
The JVM™ loads the class files and executes the bytecodes within them. The class files are loaded by a class loader in the JVM™. The class loader loads class files from an application and the class files from the Java™ application programming interfaces (APIs) which are needed by the application. The execution engine that executes the bytecodes may vary across platforms and implementations.
One type of software-based execution engine is the just-in-time compiler. With this type of execution, the bytecodes of a method are compiled to native machine code upon successful fulfillment of some type of criteria for jitting a method. The native machine code for the method is then cached and reused upon the next invocation of the method. The execution engine may also be implemented in hardware and embedded on a chip so that the Java™ bytecodes are executed natively. JVM™s usually interprets bytecodes but JVM™s may also use other techniques, such as just-in-time compiling, to execute bytecodes.
When an application is executed on the JVM™ that is implemented in software on a platform-specific operating system, a Java™ application may interact with the host operating system by invoking native methods. A Java™ method is written in the Java™ language, compiled to bytecodes, and stored in class files. A native method is written in some other language and compiled to the native machine code of a particular processor. Native methods are stored in a dynamically linked library whose exact form is platform specific.
With reference now toFIG. 4, a block diagram of a JVM™ is depicted in accordance with an illustrative embodiment of the present invention.JVM™400 includesclass loader subsystem402, which is a mechanism for loading types, such as classes and interfaces given fully qualified names.JVM™400 also containsruntime data areas404,execution engine406,native method interface408, andmemory management410.Execution engine406 is a mechanism for executing instructions contained in methods of classes loaded byclass loader subsystem402.Execution engine406 may be, for example,Java™ interpreter412 orcompiler414.Native method interface408 allows access to resources in the underlying operating system.Native method interface408 may be, for example, the Java™ Native Interface (JNI).
Runtime data areas404 contain native method stacks416, Java™ stacks418, PC registers420,method area422, andheap424. These different data areas represent the organization of memory needed byJVM™400 to execute a program.
Java™ stacks418 are used to store the state of Java™ method invocations. When a new thread is launched, the JVM™ creates a new Java™ stack for the thread. The JVM™ performs only two operations directly on Java™ stacks: it pushes and pops frames. The thread's Java™ stack stores the state of Java™ method invocations for the thread. The state of the Java™ method invocation includes its local variables, the parameters with which it was invoked, its return value, if any, and intermediate calculations. Java™ stacks are composed of stack frames. A stack frame contains a state of a single Java™ method invocation. When a thread invokes a method, the JVM™ pushes a new frame onto the Java™ stack of the thread. When the method completes, the JVM™ pops the frame for that method and discards it. The JVM™ does not have any registers for holding intermediate values; any Java™ instruction that requires or produces an intermediate value uses the stack for holding the intermediate values. In this manner, the Java™ instruction set is well defined for a variety of platform architectures.
Program counter (PC) registers420 are used to indicate the next instruction to be executed. Each instantiated thread gets its own PC register and Java™ stack. If the thread is executing a JVM™ method, the value of the PC register indicates the next instruction to execute. If the thread is executing a native method, then the contents of the PC register are undefined. Native method stacks416 store the state of invocations of native methods. The state of native method invocations is stored in an implementation-dependent way in native method stacks, registers, or other implementation-dependent memory areas. In some JVM™ implementations, native method stacks416 and Java™ stacks418 are combined.
Method area422 contains class data whileheap424 contains all instantiated objects. The constant pool is located inmethod area422 in these examples. The JVM™ specification strictly defines data types and operations. Most JVM™s choose to have one method area and one heap, each of which are shared by all threads running inside the JVM™, such asJVM™400. WhenJVM™400 loads a class file, it parses information about a type from the binary data contained in the class file.JVM™400 places this type of information into the method area. Each time a class instance or array is created, the memory for the new object is allocated fromheap424.JVM™400 includes an instruction that allocates memory space within the memory forheap424 but does not include instructions for freeing space within the memory.Memory management410 in the depicted example manages memory space within the memory allocated toheap424.Memory management410 may include a garbage collector, which automatically reclaims memory used by objects that are not referenced. Additionally, a garbage collector also may move objects to reduce heap fragmentation.
Currently, a Java™ class loader inclass loader subsystem402loads class files426 in order to execute a Java™ program. The Java™ class loader identifies class files426 that are required by the Java™ program by traversingclasspath environment variable428.classpath environment variable428 tells the Java™ class loader inJVM™400 where to findclass files426, including user-defined class files to load. The Java™ class loader traverses the classpath environment variable and examines all of the directories or JAR files in the classpath environment variable to determine if the required class files are present. However, as the number of directories and JAR files to be examined increases in the classpath environment variable, class loading efficiency suffers because there are more and more directories or JAR files to search through. In addition, if the Java™ program is executed repeatedly, the time spent on searching for the required classes may affect performance of the program.
Aspects of the present invention provide a computer implemented method, data processing system, and computer usable program code for classpath optimization by generating a new classpath environment variable that only include elements containing classes that are required by a current Java™ program. These elements include directories and JAR files. In an illustrative embodiment, aspects of the present invention may be implemented in a Java™ class loader withinclass loader subsystem402 ofJVM™400 inFIG. 4.
Turning now toFIG. 5, a diagram illustrating a known script for setting the classpath environment variable is depicted. As shown inFIG. 5,script500 sets theclasspath environment variable501 to include every directory and JAR file that is in a user environment. This includes a library in /home/utexas/staff/mausolf/mycondor directory502, each file in each jar file of thelibrary504, and a rt.jar file in the /usr/java/j2sdkl.4.1—05/jre/lib directory506. Thus, everything in the user environment, including all of the directories and JAR files, are included in the classpath environment variable regardless of whether it is relevant to a current Java™ program.
Turning now toFIG. 6, a diagram illustrating a known classpath environment variable generated using a known script is depicted. Afterscript500 inFIG. 5 is run,classpath environment variable600 is generated.Classpath environment variable600 includes every directory and JAR file in the user environment including those that are not relevant to the current Java™ program.
To avoid examining all directories or JAR files identified in the classpath environment variable that are irrelevant to the program, the aspects of the present invention generate a new classpath environment variable by including only those directories or JAR files that contain classes that are required by the current Java™ program. The aspects of the present invention generate the new classpath environment variable by examining the import statements in the source code for the current Java™ program and removing directories or JAR files that do not contain classes identified in the import statements. An import statement includes the source code of another file into the source code of the current file.
The import statements in the source code specify classes that are required for the Java™ program. For example, import com.ibm.my_project.* statement indicates all class files within the directory com/ibm/my_project are required for the Java™ program. If dependencies exist in the required class files of the import statements, the aspects of the present invention examine the required class files themselves in a recursive manner until all required classes are identified. In order to identify directories or JAR files in the import statements to be removed, aspects of the present invention examine each of the import statements to identify the required class files.
Next, the aspects of the present invention search each directory identified in the classpath environment variable for the required class files. If none of the required class files are present in the directory, the fully qualified name of the JAR file in the directory is removed from the classpath environment variable. If the classpath environment variable includes JAR files, the aspects of the present invention searches each JAR file identified in the classpath environment variable for the required class files by utilizing a “jar-tvf” command. The “jar -tvf” command lists the content of a JAR file. If none of the required class files is present in the JAR file, the JAR file is removed from the classpath environment variable.
The aspects of the present invention continue to search for the required class files until the required class files are located. Once all of the required class files in the import statements are located, a new classpath environment variable is generated with the remaining directories or JAR files in the classpath environment variable. In other words, the new classpath environment variable only includes those directories or JAR files that contain the required classes. In this way, the time spent searching for the required classes in the classpath environment variable may be minimized and the Java™ class loader may locate and load the required classes faster.
Turning now toFIG. 7, a diagram illustrating a known classpath environment variable is depicted. As shown inFIG. 7,classpath environment variable700 identifies two directories where the JVM™ may find the class files. In this example, the two directories are /user/local/classes702 and /user/local/lib704. Within each of the two directories, there may be hundreds of class files and libraries. Thus, searching through the two directories for class files at runtime may be time consuming.
Turning now toFIG. 8, a diagram illustrating a known Java™ program source file is depicted. As shown inFIG. 8, Java™program source code800 includes two import statements, which indicate the class files that are required by the current Java™ program. The two import statements areimport statements802 and804.Import statement802 indicates that class file com.ibm.my_project.registration is required.Import statement804 indicates that class file com.ibm.my_project.utilities is required.
Currently, the Java™ class loader searches through hundreds of class files and libraries indirectories702 and704 inFIG. 7 to find the required class files as indicated inimport statements802 and804 inFIG. 8. In order to avoid searching through all of the files and libraries, aspects of the present invention generate a new classpath environment variable that includes only directories and JAR files that contain the required class files, such that the search time may be reduced.
In order to generate the new classpath environment variable, aspects of the present invention examine each ofimport statements802 and804 inFIG. 8 to identify required class files com.ibm.my_project.registration and com.ibm.my_project.utilities. Next, the aspects of the present invention search each directory identified in the classpath environment variable, /user/local/classes702 and /user/local/lib704 for the required class files. If none of the required class files are present in the directory, the fully qualified name of a JAR file in the directory is removed from the classpath environment variable. For example, if no com.ibm directory exists in /user/local/classes702, the fully qualified name of the JAR file in /user/local/classes directory702 is removed from the classpath environment variable, such that the directory would not be searched by the Java™ class loader.
On the other hand, /user/local/lib704 includes a plurality of JAR files, including my_project.jar. The aspects of the present invention searches each JAR file identified in /user/local/lib704 for the required class files by utilizing a jar -tvf command. The jar -tvf command lists the contents of a JAR file. For example, a “jar-tvf | grep com.ibm.my_project” command is used to list the contents of the my_project.jar.
Based on the list of contents, the aspects of the present invention determines if the required class files com.ibm.my_project.registration and com.ibm.my_project.utilities are present in my_project.jar. If the required class files are absent, the fully qualified path name of the jar file in the directory is removed from the classpath environment variable. However, if the required class files are present, a new classpath environment variable is generated with only the directories or JAR files that contain the required class files.
Turning now toFIG. 9, a diagram illustrating an exemplary classpath environment variable is depicted in accordance with an illustrative embodiment of the present invention. As shown inFIG. 9, newclasspath environment variable900 is generated by the aspects of the present invention after traversingclasspath environment variable700 and removing any directories or JAR files that do not contain the required class files for the currentJava™ program800, as described above.
Newclasspath environment variable900 includes only two JAR files: /user/local/lib/runtime.jar902 and /user/local/lib/my_project.jar904. In this example,runtime.jar902 is needed by the Java™ runtime environment.My_project.jar904 is a single JAR file that is required byprogram800. Instead of searching through hundreds of class files and libraries in /user/local/classes802 and each JAR file in /user/local/lib804 as shown inFIG. 8, the Java™ class loader may simply loadruntime.jar902 andmy_project.jar904 to executeprogram800. Once the newclasspath environment variable900 is generated, it may be stored in a classpath file with a name of the current Java™ program and a .classpath extension. For example, test.classpath.
In one illustrative embodiment, when the user invokes the program, such asprogram800, the program is manually invoked by using a “java -c” command with the parameter “new_classpath” program_name. For example, “java -c test.classpath test”. The “java -c” command specifies the classpath to be used by the Java™ class loader. In an alternative embodiment, when the user invokes the program, the Java™ runtime environment looks for a classpath file with the program name “test” followed by the .classpath extension. If the classpath file is found, the new classpath environment variable is set for the program and the program is invoked using the new classpath environment variable. However, if the classpath file is not found, the aspects of the present invention may generate a new classpath file automatically according to the steps as described above. The aspect of the present invention then uses the new classpath file to invoke the program and saves the classpath file with a .classpath extension for subsequent use. However, the automatic generation of optimized classpath environment variable is optional. For programs that are run infrequently, the user may not want the overhead of optimizing the classpath environment variable each time the program is run. On the other hand, if the program is invoked frequently and is time sensitive, the user may want to optimize the classpath environment variable such that each subsequent execution of the program will take less time and be more efficient. This efficiency is achieved because the optimized classpath environment variable is only generated once for the program, but is subsequently used each time the program is run.
Turning now toFIG. 10, a flowchart of an exemplary process for classpath environment variable optimization is depicted in accordance with an illustrative embodiment of the present invention. This process may be implemented within a Java™ class loader within a class loader subsystem, such asclass loader subsystem402 ofJVM™400 inFIG. 4.
As shown inFIG. 10, from the perspective of the Java™ class loader, the process begins when aspects of the present invention examines the import statements of the Java™ program to identify class files that are required by the program (step1000). Next, aspects of the present invention search the next directory that is identified in the classpath environment variable for the required class files (step1002). A determination is made by aspects of the present invention as to whether the directory contains the required class files (step1004). If the directory contains the required class files, the directory is included in the new classpath environment variable (step1006). If the directory does not contain the required class files, the aspects of the present invention remove the fully qualified name of the JAR file in the directory from the classpath environment variable (step1008).
A determination is then made as to whether additional directories are present in the classpath environment variable (step1010). If additional directories are present, the process returns to step1002 to search the next directory for the required class files. However, if additional directories are not present, the aspects of the present invention search the next JAR file identified in the classpath environment variable for the required class files using the “jar-tvf” command (step1012). The “jar -tvf” command lists the contents of the JAR file.
A determination is then made as to whether the contents of the JAR file contain the required class files (step1014). If the JAR file contains the required class files, the JAR file is included in the new classpath environment variable (step1016). If the JAR file does not contain the required class files, the aspects of the present invention remove the JAR file from the classpath environment variable (step1018).
Next, a determination made as to whether additional JAR files are present in the classpath environment variable (step1020). If additional directories are present, the process returns to step1012 to search the next JAR file for the required class files. However, if additional JAR files are not present, the aspects of the present invention generate the new classpath environment variable with the directories and JAR files that contain the required class files (step1022). The new classpath environment variable is then stored in a classpath file with the program name and .classpath extension (step1024).
Later, when the user invokes the Java™ program (step1025), in one embodiment, the Java™ runtime environment looks for a classapth file with a program name followed by a .classpath extension (step1026). A determination is made by the Java™ runtime environment as to whether the classpath file exists (step1028). If the classpath file exists, the program is invoked by using a “java -c “new_classpath” program_name” command (step1030). The new_classpath is the name of the classpath file stored with the new classpath environment variable instep1024. Thus, the process terminates thereafter. However, if the classpath file does not exist, the process may return to step1000 and repeat the steps to automatically generate a new classpath environment variable. Instead of having to search through all of the directories and JAR files in the classpath environment variable each time a repeatedly executed program is executed, the new classpath environment variable in the classpath file may be used repeatedly by the Java™ runtime environment.
It is noted that automatic generation of a new classpath environment variable is optional. For programs that are run infrequently, the user may not want the overhead of optimizing the classpath environment variable. On the other hand, if the program is invoked frequently and is time sensitive, the user may want to optimize the classpath environment variable such that each subsequent execution of the program will be more efficient. This efficiency is achieved because the optimized classpath environment variable is only generated once for the program, but is subsequently used each time the program is run. Thus, the optimized classpath environment variable does not have to be generated for a program that is run infrequently or that is not time-sensitive.
In addition to including only directories and JAR files that contain the required classes, other types of classpath optimizations may be performed. For example, the classpath environment variable may be reordered starting from directories or JAR files with the highest number of classes to directories or JAR files with the lowest number of classes. By reordering the classpath environment variable based on the number of classes the directories or JAR files contain, the likelihood that the required class files are found early in the search process is increased because the probability that the required classes are found is higher in a bigger set of directories and JAR files than a smaller set of directories and JAR files. This optimization decreases the search time of the Java™ class loader and the execution time of the program, hence, improves the program performance.
Turning now toFIG. 11, an exemplary implementation of classpath environment variable optimization is depicted in accordance with an illustrative embodiment of the present invention. This exemplary implementation utilizes the process as depicted inFIG. 10. As shown inFIG. 11, the optimization process begins when a user invokes an optimization command “javaoptCp” to optimize the classpath environment variable for a Java™ program named “my_program.java” (step1100). Java™ source file1102 includes two import statements, which specify two required class files: com.ibm.my_project.registration1104 and com.ibm.my_project.utilities1106.
Upon receiving the “javaOptcp” command, aspects of the present invention identify the required classes in the import section in source code of Java™ source file1102 (step1108). The aspects of the present invention then locate the required classes in the directories as specified by classpath environment variable1110 (step1112).Classpath environment variable1110 includes directories and JAR files that are required by different Java™ programs. Once the required classes are located, the aspects of the present invention remove the directories that do not contain required classes from the classpath environment variable (step1114), since these directories are not required for this particular program, my_program.java. Newclasspath environment variable1116 is generated by aspects of the present invention as a result, which includes only “/home/mausolf/dev”directory1116.
Aspects of the present invention then save the new classpath environment variable in an optimized classpath file with the same filename as the Java™ program but with a .classpath extension (step1118). In this example implementation, the optimized classpath file isMy_program.classpath1120. Once the new classpath environment variable is saved in the classpath file, the classpath optimization process completes.
Later, the user invokes the Java™ program by using a “javaopt” command (step1122). The “javaopt” command looks for a classpath file with a .classpath extension of the program name. If the classpath file is found, the command sets the program classpath to the new classpath environment variable in the classpath file by calling a “java -classpath” command. By using the “javaopt” command, search time by the Java™ class loader may be minimized. The program then executes and the program invocation process terminates.
It is noted thatclasspath optimization steps1100,1108,1112,1114, and1118 inFIG. 11 are executed only once to optimize the classpath, whileprogram invocation step1122 inFIG. 11 may be executed multiple times for a program that is executed repeatedly. It is also noted that the classpath file is unique to a program. Thus, each program is associated with a different classpath file.
In summary, the aspects of the present invention optimizes Java™ program performance, in particular repeatedly executed Java™ program performances, by dynamically filtering and reordering the classpath. The aspects of the present invention create an optimized classpath that is specific to a particular program in advance of program execution. Furthermore, the aspects of the present invention may be integrated with enterprise development tools, such as Eclipse, to optimize the performance of programs during their development. In this manner, the optimized classpath may be deployed all of the way from the development phase to production phase along with the application such that the optimization is separate from the time-sensitive processing of the production resources.
The invention can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment containing both hardware and software elements. In a preferred embodiment, the invention is implemented in software, which includes but is not limited to firmware, resident software, microcode, etc.
Furthermore, the invention can take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system. For the purposes of this description, a computer-usable or computer readable medium can be any apparatus that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
The medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device) or a propagation medium. Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a random access memory (RAM), a read-only memory (ROM), a rigid magnetic disk and an optical disk. Current examples of optical disks include compact disk-read only memory (CD-ROM), compact disk-read/write (CD-R/W), and digital video disc (DVD).
A data processing system suitable for storing and/or executing program code will include at least one processor coupled directly or indirectly to memory elements through a system bus. The memory elements can include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution.
Input/output or I/O devices (including but not limited to keyboards, displays, pointing devices, etc.) can be coupled to the system either directly or through intervening I/O controllers.
Network adapters may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modems, and Ethernet cards are just a few of the currently available types of network adapters.
The description of the present invention has been presented for purposes of illustration and description, and is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art. The embodiment was chosen and described in order to best explain the principles of the invention, the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.