Movatterモバイル変換


[0]ホーム

URL:


US5727167A - Thresholding support in performance monitoring - Google Patents

Thresholding support in performance monitoring
Download PDF

Info

Publication number
US5727167A
US5727167AUS08/654,068US65406896AUS5727167AUS 5727167 AUS5727167 AUS 5727167AUS 65406896 AUS65406896 AUS 65406896AUS 5727167 AUS5727167 AUS 5727167A
Authority
US
United States
Prior art keywords
memory
counter
data
clock
cycle counter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
US08/654,068
Inventor
Harry Dwyer, III
Frank Eliot Levine
Edward Hugh Welbon
Charles Gordon Wright
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines CorpfiledCriticalInternational Business Machines Corp
Priority to US08/654,068priorityCriticalpatent/US5727167A/en
Application grantedgrantedCritical
Publication of US5727167ApublicationCriticalpatent/US5727167A/en
Anticipated expirationlegal-statusCritical
Expired - Fee Relatedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

A thresholding mechanism and method for performance monitoring of memory array access distribution times is disclosed. A data request signal sent to the memory hierarchy activates a first counter, having a first count value. A clock coupled to the first counter increments the first count value with each clock cycle, while also decrementing a decrementer having a predetermined threshold value. The first counter is deactivated by a completion signal when the data request is completed. A second counter having a second count value is incremented when the first count value is greater than the threshold value by the time the data request is complete.

Description

This application is a continuation of application Ser. No. 08/422,363 filed on Apr. 14, 1995 now abandoned.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention generally relates to performance monitoring in a data processing system, and, more particularly, to an efficient architected mechanism that analyzes memory hierarchy performance only if a memory cache event exceeds a variable user-specified amount of time.
2. Description of the Prior Art
A performance monitor is a facility incorporated into a processor to monitor selected characteristics to help system developers to debug and analyze their systems by determining the state of the machine at a specific point in time.
Performance monitors produce information relating to the utilization of the processor's instruction execution and storage control functions; information that is of value to computer architects seeking to enhance the performance of an existing processor or develop data for the design of a new processor.
The optimization of software for execution on complex uniprocessors or multiprocessors involves the study of a program's memory access patterns and their interaction with a system's memory hierarchy. The memory hierarchy behavior must be understood in order to develop algorithms that schedule tasks (and perhaps partition them), and structure and distribute data optimally.
Currently there are two approaches to the acquisition of information about a system's memory hierarchy behavior while running an application; utilizing test instruments or incorporating software-accessible counters into a processor chip.
Utilizing the first approach, test instruments are attached to various test points in a processing system to monitor signals. However, there are several drawbacks to this approach. One is that it is difficult to associate a particular instruction with an activity observed on a bus external to a complex processor. Complex pipelines, instruction prefetching, and data buffering make it difficult to associate events external to a processor with an internal processor state.
Also, it is difficult to find appropriate test points and to attach instrumentation to them. The most significant drawback to this approach, however, is that advanced processors may incorporate one or more levels of cache (L1, L2, etc.) and memory hierarchy wholly within the processor chip. Access to these levels with test instruments is therefore not possible. In sum, this approach provides limited information with significant difficulty.
In an effort to resolve some of the problems with the first approach, designers have taken to incorporating software-accessible counters into the processor chip. These counters simply count the number of L1 and L2 cache hits/misses. A limitation of this approach is that the granularity of the information provided is large. In other words, a cache access may not take the same number of machine cycles each time it occurs, these access times in an advanced processor may vary over a wide range of values depending on other cache or processor activity.
In this second approach, a simple count of L1 or L2 hits from the software counter, producing such representative data as shown in FIG. 1 for example, does not reveal the shape of the access time distribution curve associated with L1 and L2 cache accesses. An access time distribution curve is a graph showing the number of occurrences of a certain event as a function of the access time in cycles. The distribution of access times experienced by an application to various levels of memory hierarchy, which is essential to optimizing software, is thus lost with the above approach.
In light of the foregoing, there exists a need for a mechanism and method to identify the distribution of access times for all levels of memory hierarchy.
SUMMARY OF THE INVENTION
The present invention is directed to an apparatus and method for providing detailed information on the distribution of access times while accessing all levels of a system's memory hierarchy during an application's execution, which substantially obviates one or more of the problems due to the limitations and disadvantages of the related art.
In general, a processor is able to associate the data returned from a memory hierarchy access with a particular load instruction or request. A processor must have this capability to forward the data to the appropriate destinations and to determine that the load instruction is completed.
The present invention uses this capability to start a counter (subsequently incremented by a processor or bus clock) when a particular load request is issued to the memory hierarchy. The count is halted when the data associated with the particular request is available. The counter now contains the access time in machine cycles for the selected access. If this access time exceeds a value that has been loaded into a special register, a second counter is incremented.
By accumulating counts of accesses that exceed decreasing threshold values, a histogram can be generated that reveals the access time distribution of memory requests. This distribution reflects the proportionate numbers of "hits" at all memory levels (e.g., L1, L2, and main memory) and will also reflect so-called smearing effects caused by unanticipated contention for buses or other system resources. These conflicts can cause the access time to a given memory level (e.g., L2) to vary over a range of values. This detailed distribution reveals system effects and access time information that are not captured with the prior art approaches.
To achieve these and other advantages and in accordance with the purpose of the invention, as embodied and broadly described, the invention provides for a thresholding mechanism for a performance monitoring system comprising a memory hierarchy for processing a data request signal; a first counter having a first count value that is activated by the data request signal and deactivated by a data completion signal; a timing means coupled to the first counter, the timing means incrementing the first count value with each clock cycle after the first counter is activated; a decrementer responsive to the timing means, the decrementer having a threshold value that is decremented with each clock cycle; and a second counter having a second count value that is incremented when the first count value is greater than the threshold value when the data completion signal is received.
In another aspect, the invention provides for a method of determining memory accesses as a function of elapsed clock cycles, the method comprising the steps of receiving a data request signal in a memory hierarchy; activating a first count value of a first counter upon receiving the data request signal; incrementing the first count value with each clock cycle; decrementing a threshold value of a decrementer with each clock cycle; deactivating the incrementing step when a data completion signal is received; comparing the first count value to the threshold value; and incrementing a second count value when the first count value is greater than the threshold value.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are intended to provide further explanation of the invention as claimed.
BRIEF DESCRIPTION OF THE DRAWINGS
The foregoing and other objects, aspects and advantages will be better understood from the following detailed description of a preferred embodiment of the invention with reference to the drawings, in which:
FIG. 1 is a bar graph of the number of L1, L2, and main memory hits gathered by a related art software-accessible counter incorporated into a processor chip;
FIG. 2 is a data processing system utilizing the method and system of the present invention;
FIG. 3 is a block diagram of a threshold mechanism for load or store instructions;
FIG. 4 illustrates a monitor mode control register (MMCR) utilized to manage a plurality of counters;
FIG. 5 is a sample timing diagram of the thresholding mechanism where the threshold is not exceeded; and
FIG. 6 is a sample timing diagram of the thresholding mechanism where the threshold is exceeded; and
FIG. 7 is a graph providing an illustrative access time distribution curve produced in accordance with the present invention.
DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT OF THE INVENTION
Referring now to the drawings, and more particularly to FIG. 2, there is shown a data processing system employing the method and system of the present invention. As shown, thedata processing system 1 may include one or more of aprocessing unit 2,display device 4,keyboard 6, andmouse 8, and various other components both integral and peripheral. As is well-known in the art, a user may input data todata processing system 1 utilizingkeyboard 6 ormouse 8.Data processing system 1 outputs data to a user viadisplay device 4. Theprocessing unit 2 includes means for interfacingdisplay device 4,keyboard 6, andmouse 8, to a central processor, such as for example, a PowerPC microprocessor (PowerPC is a trademark of IBM Corporation).
Referring to FIG. 3, there is depicted a block diagram of a thresholding mechanism for a performance monitor, designated generally as 10, for use by a data processing system such as that depicted in FIG. 2. The thresholding mechanism includes amemory hierarchy 11, and is shown to include, by way of example and not by limitation, anL1 cache 12, anL2 cache 14, and amain memory 16. It is understood that any number or combination of cache levels and main memory are considered within the scope of the term memory hierarchy.
In operation, a processor must be able to associate or tag data returned from a memory hierarchy access with a particular load instruction or request in order to forward the data to the appropriate destination and determine that the load instruction is completed.
Using this capability, when a memory request ordata request signal 17 is issued to thememory hierarchy 11, the signal activates acounter 18 that is subsequently incremented, with each clock cycle, by aclock 20. Thememory request 17 may either be a data load instruction or a data store instruction.
Theclock 20 may be a conventional processor clock. Alternatively, a bus clock may be utilized instead of the processor clock as a base for the thresholding and counting functions. Bus clocks are often more difficult to support in the hardware due to synchronization problems, but better for software granularity. In another alternative, the thresholding count could represent an integral number of processor clock cycles.
At the same time that thecycle counter 18 is activated, a decrementer orthresholder 22 is initialized with a threshold value (X) that is variable and software setable. The purpose of incorporating a threshold value (X) is to be able to characterize certain data cache misses, for example, L1 or L2 data cache misses. Specifically, the system should be able to identify (via repeated runs and sampling) the time distribution required to satisfy L1 or L2 cache misses. By accumulating counts of accesses that exceed decreasing threshold values, designers would have a clearer picture of conflicts that cause the access time to a given memory level to vary over a range of values.
The software threshold initialization is accomplished using an implementation dependent number (2-4) of counters (i.e., PMC1, PMC2, PMC3, PMC4) that are used to count processor/storage related events. Monitor Mode Control Registers (MMCR) establish the function of the bit counters PMCn, with each MMCR controlling two bit counters. The counters and control registers reside on the processor chip and are accessible for read or write via mfspr (moved from special purpose register) or mtspr (moved to special purpose register) instructions.
The interaction and operation of the control registers and counters in a multi-microprocessor system are discussed in U.S. patent application, Ser. No. 08/353,006, filed Dec. 9, 1994 which is now U.S. Pat. No. 5,544,527, by Frank C. Gover et al, the disclosure of which is hereby incorporated by reference. Portions are reproduced in appropriate sections below for ease of discussion and reference.
In the disclosed embodiment, there are two 32-bit counters PMC1 (cycle counter 18), PMC2 (event counter 24), controlled by a 32-bit control register 23 (MMCR0) whose bits are partitioned into bit fields that allow for selection of events (signals) to be recorded (counted). Selection of allowable combinations of events causes the counters to operate concurrently. Smaller or larger bit counters and registers may be utilized to correspond to a particular processor and bus architecture, and design considerations such as the desired functionality of the performance monitor and the chip area within a processor. It is apparent that depending on the intended application, additional MMCR's and PMC's may be utilized and are considered to be within the scope of this invention.
With reference to FIG. 4, there is illustrated a representation of MMCR0 which controls the operation of counters PMC1 and PMC2. As illustrated, for example, MMCR0 is partitioned into a number of bit fields whose settings select events to be counted, enable performance monitor interrupts, specify the conditions under which counting is enabled, and set the threshold value (X).
In the illustrated example, bits 0-4 and 18 determine the scenarios under which PMC1 and PMC2 are enabled to count.Bits 5, 16, and 17 are utilized to control interrupt signals triggered by PMC1 and PMC2. Bits 19-25 and 26-31 are utilized to select the events monitored by PMC1 and PMC2 respectively. Bits 6-9 are utilized to control the time or event based transitions. Finally, the threshold value is variably set by bits 10-15.
It is apparent that depending on the particular features sought to be profiled, many different event counting and sampling scenarios may be utilized with the scope of this invention. One method of implementation, for example, would be to have the threshold value (X) represent the number of cycles for the item being counted in the first counter PMC1 to be executed and to accumulate values in the PMC2 counter only when the count exceeds the threshold value (X). For example, if the item being counted in PMC1 was data cache miss cycles, and the threshold value (X) was set to 5, then only L1 cache miss cycles that took greater than 5 cycles would be counted in PMC2. The second bit counter PMC2 could be used to count total number of cycles or time.
Referring again to embodiment of FIG. 3, after the threshold value is initialized, the decrementer/thresholder 22 is decremented by one with each clock cycle. The threshold value (X) is said to be exceeded when the decrementer reaches zero before the data instruction completes. It is not exceeded if the data request sequence is completed before the decrementer reaches zero. The term "completed" has a different meaning depending on the type of data instruction. For the lead instructions, "complete" indicates that the data associated with the same request was "received"; for the store instructions, "complete" indicates that the data was written or "stored" to the cache. A second counter PMC2, i.e.event counter 24, is incremented every time the threshold value (X) is exceeded.
FIG. 5 illustrates a sample timing diagram of the thresholding mechanism where the threshold is not exceeded. In this case the threshold value of (X) has been set to 3 clock cycles. When the tagged lead instruction is sent,cycle counter 18 is incremented by one with each clock cycle. At the same time, the threshold decrementer begins decrementing by one from its threshold value (X) with each clock cycle. As shown in FIG. 5, since the tagged data is received after one clock cycle, the count value incycle counter 18 is one and thedecrementer 22 threshold value is two. Since the threshold has not been exceeded--the data was received before the decrementer reached zero--the count value is less than the decrementer (threshold) value and thus theevent counter 24 would not be incremented.
The opposite condition, that is where the threshold value is exceeded, is shown in the sample timing diagram in FIG. 6. As before, the threshold value of (X) has been set to 3 clock cycles. When the tagged lead instruction is sent,cycle counter 18 is again incremented by one with each clock cycle. At the same time, the threshold decrementer begins decrementing by one from its threshold value (X) with each clock cycle. Since the tagged data was received four clock cycles after activation (cycle 5), the count value is four. Since thedecrementer 22 threshold value had already reached zero bycycle 4, the threshold was exceeded. In this case the count value exceeded the threshold value and thus theevent counter 24 is incremented by one.
Through repeated rims with different threshold values, processor system developers can accumulate counts of accesses whose timings exceed decreasing threshold values, thereby generating a cache hit distribution chart, or histogram, as shown in FIG. 7, which reflects the proportionate number of "hits" at the various memory levels. In analyzing the example data in FIG. 7, it can be seen that main memory access times vary over a wide range--perhaps due to interleaved memory bank conflicts or system bus conflicts. In this case the processor designer would conduct further analysis to determine the exact cause of the wide distribution of access times, and hopefully, design around it. The distribution of L1 and L2 access times reveals how well their structures support the application on which the example data is based.
Rather than causing a performance interrupt at each instance where the threshold value is exceeded, the counters may be programmed to overflow, for example, by setting initial values for the counters to cause an interrupt to be triggered on the 100th data cache miss that exceeds the specified threshold value. In this way, samples are taken only of those items that are counted.
In multiple processor hardware systems such as PowerPC processors, where cache coherency is supported, one can identify when hits are occurring in other processor caches. For example, counts can be made of L2 lateral cache hits in another processor's L2 cache. This information can be profiled by programming the counters to give interrupts when these conditions occur. L2 cache hits in other caches are very expensive time-wise, and by identifying these "hot spots", there is an opportunity to modify the code to avoid this condition.
The interaction and operation of the performance monitor interrupt with interrupt masking for a multiprocessor system (MP) are discussed in U.S. patent application, Ser. No. 08/358,220, (Attorney Docket No. AT9-94-181) filed Dec. 16, 1994 by R. L. Arndt et al, the disclosure of which is hereby incorporated by reference.
In operation, the performance monitor interrupt for each processor in a multiprocessor system contains a series of associated special purpose registers (SPR). To ensure that there is no loss of data due to interrupt masking, when the interrupt condition is detected (via either a time base transition or a counter overflow), the processors will capture the effective instruction and operand (if any) addresses of an instruction in execution and present an interrupt to the interrupt resolution logic. The instruction and operand addresses are saved in two SPRs, a Saved Instruction Address (SIA) and Saved Data Address (SDA), respectively, designated for this purpose at the time of the system wide interrupt signalling. When the interrupt is actually serviced, the content of the SDA and SIA will reflect the processor state at the time of the system wide signalling. The monitoring facility also captures the state, or count values, of the counters, PMC1 and PMC2. The resulting information can be utilized by designers to optimize system performance.
While the invention has been described in terms of the embodiments described above, those skilled in the art will recognize that the invention can be practiced with modification within the spirit and scope of the appended claims.

Claims (10)

Having thus described the invention, what is claimed as new and desire to secure by Letters Patent is as follows:
1. A thresholding mechanism for a performance monitor system in a data processing system comprising:
a memory hierarchy including at least one level of cache memory and a main memory for processing a memory request signal and outputting a data completion signal when a memory request signal has been processed;
a cycle counter activated by the memory request signal to said memory hierarchy and deactivated by the data completion signal from said memory hierarchy;
a clock coupled to the cycle counter, said clock incrementing the cycle counter with each clock cycle after the cycle counter is activated;
a monitor mode control register storing a threshold value, the threshold value being variable and software settable;
a thresholder connected to said cycle counter to receive an output count value when the cycle counter is deactivated by a data completion signal, said thresholder comparing the threshold value stored in said monitor mode control register with the output count value from the cycle counter, said thresholder generating an output when the count value exceeds the threshold value;
an event counter incremented by the output from said thresholder; and
output means connected to said event counter for generating an output for performance analysis, said output means accumulating counts of memory accesses for said at least one level of cache and said main memory that exceed decreasing threshold values set in said monitor mode control register and generating a histogram of access time distributions of memory requests for each of said one level of cache and said main memory.
2. The thresholding mechanism for a performance monitor system recited in claim 1, wherein the memory request signal is selected from the group consisting of a data load instruction and a data store instruction.
3. The thresholding mechanism for a performance monitor system recited in claim 1, wherein the cycle and event counters are 32-bit software implemented counters.
4. The thresholding mechanism for a performance monitor system recited in claim 1, wherein the clock comprises a processor clock.
5. The thresholding mechanism for a performance monitor system recited in claim 1, wherein the clock comprises a bus clock.
6. A method of determining a memory access distribution as a function of elapsed clock cycles, the method comprising the steps of:
receiving a memory request signal in a memory hierarchy including at least one level of cache memory and a main memory;
generating a data completion signal when a memory request signal has been processed by said memory hierarchy;
activating a cycle counter upon receiving the memory request signal;
incrementing the cycle counter with each clock cycle of a clock while the cycle counter is activated;
storing a threshold value in a monitor mode control register;
setting variable and decreasing threshold values in said monitor mode control register;
deactivating said cycle counter when a data completion signal is received from said memory hierarchy;
comparing a count value accumulated by the cycle counter to the threshold value stored in said monitor mode control register when the cycle counter is deactivated by a data completion signal;
incrementing an event counter when the count value accumulated by the cycle counter is greater than said threshold value;
accumulating counts from said event counter of memory accesses for said at least one level of cache and main memory that exceed decreasing threshold values set in said monitor mode control register; and
generating an output for performance analysis, said output including a histogram of access time distributions of memory requests for each of said one level of cache and said main memory.
7. The method of determining a memory access distribution as a function of elapsed clock cycles recited in claim 6, wherein the activating step commences when one of a data load instruction and a data store instruction is received in the memory hierarchy.
8. The method of determining a memory access distribution as a function of elapsed clock cycles recited in claim 6, wherein the deactivating step occurs when data associated with the memory request signal is received from the memory hierarchy.
9. The method of determining a memory access distribution as a function of elapsed clock cycles recited in claim 6, wherein the deactivating step occurs when data associated with the memory request signal is stored in the memory hierarchy.
10. The method of determining a memory access distribution as a function of elapsed clock cycles recited in claim 6, wherein the clock cycles are selected from the group consisting of processor clock cycles and bus clock cycles.
US08/654,0681995-04-141996-06-11Thresholding support in performance monitoringExpired - Fee RelatedUS5727167A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US08/654,068US5727167A (en)1995-04-141996-06-11Thresholding support in performance monitoring

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US42236395A1995-04-141995-04-14
US08/654,068US5727167A (en)1995-04-141996-06-11Thresholding support in performance monitoring

Related Parent Applications (1)

Application NumberTitlePriority DateFiling Date
US42236395AContinuation1995-04-141995-04-14

Publications (1)

Publication NumberPublication Date
US5727167Atrue US5727167A (en)1998-03-10

Family

ID=23674572

Family Applications (1)

Application NumberTitlePriority DateFiling Date
US08/654,068Expired - Fee RelatedUS5727167A (en)1995-04-141996-06-11Thresholding support in performance monitoring

Country Status (1)

CountryLink
US (1)US5727167A (en)

Cited By (29)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5937437A (en)*1996-10-281999-08-10International Business Machines CorporationMethod and apparatus for monitoring address translation performance
US6009481A (en)*1996-09-301999-12-28Emc CorporationMass storage system using internal system-level mirroring
US6055650A (en)*1998-04-062000-04-25Advanced Micro Devices, Inc.Processor configured to detect program phase changes and to adapt thereto
US6134710A (en)*1998-06-262000-10-17International Business Machines Corp.Adaptive method and system to minimize the effect of long cache misses
US6249906B1 (en)1998-06-262001-06-19International Business Machines Corp.Adaptive method and system to minimize the effect of long table walks
US6446029B1 (en)1999-06-302002-09-03International Business Machines CorporationMethod and system for providing temporal threshold support during performance monitoring of a pipelined processor
US6499116B1 (en)1999-03-312002-12-24International Business Machines Corp.Performance of data stream touch events
US6748558B1 (en)2000-05-102004-06-08Motorola, Inc.Performance monitor system and method suitable for use in an integrated circuit
US20050050530A1 (en)*2003-08-272005-03-03Sun Microsystems, Inc.Methods for execution control acquisition of a program and for executing an optimized version of a program
US20050060694A1 (en)*2003-09-162005-03-17Sun Microsystems, Inc.Methods and hardware for safe memory allocation in arbitrary program environments
US20050071817A1 (en)*2003-09-302005-03-31International Business Machines CorporationMethod and apparatus for counting execution of specific instructions and accesses to specific data locations
US20050081107A1 (en)*2003-10-092005-04-14International Business Machines CorporationMethod and system for autonomic execution path selection in an application
US20060005083A1 (en)*2004-06-302006-01-05International Business Machines CorporationPerformance count tracing
US20060123167A1 (en)*2004-12-082006-06-08Jeppsen Roger CRequest conversion
GB2429552A (en)*2005-08-232007-02-28Hewlett Packard Development CoStart/Stop Circuit for a Performance Counter
US20070073984A1 (en)*2005-09-292007-03-29International Business Machines CorporationApparatus and method for monitoring performance of a storage area network
US7225110B2 (en)2001-08-162007-05-29International Business Machines CorporationExtending width of performance monitor counters
US8195858B1 (en)*2009-07-282012-06-05Nvidia CorporationManaging conflicts on shared L2 bus
US8321618B1 (en)2009-07-282012-11-27Nvidia CorporationManaging conflicts on shared L2 bus
US20130036185A1 (en)*2011-08-022013-02-07Cavium, Inc.Method and apparatus for managing transport operations to a cluster within a processor
US8615619B2 (en)2004-01-142013-12-24International Business Machines CorporationQualifying collection of performance monitoring events by types of interrupt when interrupt occurs
US20140007114A1 (en)*2012-06-292014-01-02Ren WangMonitoring accesses of a thread to multiple memory controllers and selecting a thread processor for the thread based on the monitoring
US8689190B2 (en)2003-09-302014-04-01International Business Machines CorporationCounting instruction execution and data accesses
US8782664B2 (en)2004-01-142014-07-15International Business Machines CorporationAutonomic hardware assist for patching code
US9298651B2 (en)2013-06-242016-03-29International Business Machines CorporationContinuous in-memory accumulation of hardware performance counter data
US9323655B1 (en)*2011-06-302016-04-26Emc CorporationLocation of data among storage tiers
US9600336B1 (en)2015-08-282017-03-21International Business Machines CorporationStoring service level agreement compliance data
US9965381B1 (en)*2011-06-302018-05-08EMC IP Holding Company LLCIndentifying data for placement in a storage system
US10002063B2 (en)2015-10-202018-06-19International Business Machines CorporationMonitoring performance of multithreaded workloads

Citations (27)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US3906454A (en)*1973-05-181975-09-16Bell Telephone Labor IncComputer monitoring system
US4034353A (en)*1975-09-151977-07-05Burroughs CorporationComputer system performance indicator
US4438490A (en)*1981-10-011984-03-20Honeywell Information Systems Inc.Clock control of a central processing unit from a monitor interface unit
US4459656A (en)*1981-10-011984-07-10Honeywell Information Systems Inc.Clocking system for a computer performance monitoring device
US4636967A (en)*1983-10-241987-01-13Honeywell Inc.Monitor circuit
US4811347A (en)*1986-01-301989-03-07U.S. Philips CorporationApparatus and method for monitoring memory accesses and detecting memory errors
US4821178A (en)*1986-08-151989-04-11International Business Machines CorporationInternal performance monitoring by event sampling
US4849879A (en)*1986-09-021989-07-18Digital Equipment CorpData processor performance advisor
US4905171A (en)*1987-11-091990-02-27International Business Machines CorporationWorkstation controller performance monitor
US4969088A (en)*1988-04-261990-11-06International Business Machines CorporationHardware mechanism for automatically detecting hot-spot references and diverting same from memory traffic in a multiprocessor computer system
US5038278A (en)*1988-04-011991-08-06Digital Equipment CorporationCache with at least two fill rates
US5043885A (en)*1989-08-081991-08-27International Business Machines CorporationData cache using dynamic frequency based replacement and boundary criteria
US5062055A (en)*1986-09-021991-10-29Digital Equipment CorporationData processor performance advisor
US5074144A (en)*1989-02-271991-12-24Gary F. KrofchalkVehicle performance monitoring system
US5088058A (en)*1988-08-261992-02-11Unisys CorporationApparatus and method for evaluating and predicting computer I/O performance using I/O workload snapshots for model input
US5097412A (en)*1987-04-241992-03-17Hitachi, Ltd.Method for simulating the operation of programs in a distributed processing system
US5103394A (en)*1984-04-301992-04-07Hewlett-Packard CompanySoftware performance analyzer
US5142670A (en)*1988-07-261992-08-25International Business Machines CorporationMethod and apparatus for calculating disk-access footprints for use in selecting a storage management method
US5151982A (en)*1988-03-301992-09-29Kabushiki Kaisha ToshibaData processing system
US5151981A (en)*1990-07-131992-09-29International Business Machines CorporationInstruction sampling instrumentation
US5168563A (en)*1989-03-291992-12-01Hewlett-Packard CompanyVarious possible execution paths measurement and analysis system for evaluating before writing source codes the efficiency performance of software designs
US5193179A (en)*1988-08-091993-03-09Harris CorporationActivity monitor system non-obtrusive statistical monitoring of operations on a shared bus of a multiprocessor system
US5223827A (en)*1991-05-231993-06-29International Business Machines CorporationProcess and apparatus for managing network event counters
US5388242A (en)*1988-12-091995-02-07Tandem Computers IncorporatedMultiprocessor system with each processor executing the same instruction sequence and hierarchical memory providing on demand page swapping
US5423020A (en)*1990-04-251995-06-06International Business Machines CorporationApparatus and method for optimizing bus usage by varying the amount of data transferred on a DMA operation
US5426741A (en)*1991-02-201995-06-20Digital Equipment CorporationBus event monitor
US5488377A (en)*1995-03-281996-01-30Mcdonnell Douglas CorporationMethod and apparatus for controlling the false alarm rate of a receiver

Patent Citations (27)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US3906454A (en)*1973-05-181975-09-16Bell Telephone Labor IncComputer monitoring system
US4034353A (en)*1975-09-151977-07-05Burroughs CorporationComputer system performance indicator
US4438490A (en)*1981-10-011984-03-20Honeywell Information Systems Inc.Clock control of a central processing unit from a monitor interface unit
US4459656A (en)*1981-10-011984-07-10Honeywell Information Systems Inc.Clocking system for a computer performance monitoring device
US4636967A (en)*1983-10-241987-01-13Honeywell Inc.Monitor circuit
US5103394A (en)*1984-04-301992-04-07Hewlett-Packard CompanySoftware performance analyzer
US4811347A (en)*1986-01-301989-03-07U.S. Philips CorporationApparatus and method for monitoring memory accesses and detecting memory errors
US4821178A (en)*1986-08-151989-04-11International Business Machines CorporationInternal performance monitoring by event sampling
US5062055A (en)*1986-09-021991-10-29Digital Equipment CorporationData processor performance advisor
US4849879A (en)*1986-09-021989-07-18Digital Equipment CorpData processor performance advisor
US5097412A (en)*1987-04-241992-03-17Hitachi, Ltd.Method for simulating the operation of programs in a distributed processing system
US4905171A (en)*1987-11-091990-02-27International Business Machines CorporationWorkstation controller performance monitor
US5151982A (en)*1988-03-301992-09-29Kabushiki Kaisha ToshibaData processing system
US5038278A (en)*1988-04-011991-08-06Digital Equipment CorporationCache with at least two fill rates
US4969088A (en)*1988-04-261990-11-06International Business Machines CorporationHardware mechanism for automatically detecting hot-spot references and diverting same from memory traffic in a multiprocessor computer system
US5142670A (en)*1988-07-261992-08-25International Business Machines CorporationMethod and apparatus for calculating disk-access footprints for use in selecting a storage management method
US5193179A (en)*1988-08-091993-03-09Harris CorporationActivity monitor system non-obtrusive statistical monitoring of operations on a shared bus of a multiprocessor system
US5088058A (en)*1988-08-261992-02-11Unisys CorporationApparatus and method for evaluating and predicting computer I/O performance using I/O workload snapshots for model input
US5388242A (en)*1988-12-091995-02-07Tandem Computers IncorporatedMultiprocessor system with each processor executing the same instruction sequence and hierarchical memory providing on demand page swapping
US5074144A (en)*1989-02-271991-12-24Gary F. KrofchalkVehicle performance monitoring system
US5168563A (en)*1989-03-291992-12-01Hewlett-Packard CompanyVarious possible execution paths measurement and analysis system for evaluating before writing source codes the efficiency performance of software designs
US5043885A (en)*1989-08-081991-08-27International Business Machines CorporationData cache using dynamic frequency based replacement and boundary criteria
US5423020A (en)*1990-04-251995-06-06International Business Machines CorporationApparatus and method for optimizing bus usage by varying the amount of data transferred on a DMA operation
US5151981A (en)*1990-07-131992-09-29International Business Machines CorporationInstruction sampling instrumentation
US5426741A (en)*1991-02-201995-06-20Digital Equipment CorporationBus event monitor
US5223827A (en)*1991-05-231993-06-29International Business Machines CorporationProcess and apparatus for managing network event counters
US5488377A (en)*1995-03-281996-01-30Mcdonnell Douglas CorporationMethod and apparatus for controlling the false alarm rate of a receiver

Cited By (44)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6009481A (en)*1996-09-301999-12-28Emc CorporationMass storage system using internal system-level mirroring
US5937437A (en)*1996-10-281999-08-10International Business Machines CorporationMethod and apparatus for monitoring address translation performance
US6055650A (en)*1998-04-062000-04-25Advanced Micro Devices, Inc.Processor configured to detect program phase changes and to adapt thereto
US6249906B1 (en)1998-06-262001-06-19International Business Machines Corp.Adaptive method and system to minimize the effect of long table walks
US6134710A (en)*1998-06-262000-10-17International Business Machines Corp.Adaptive method and system to minimize the effect of long cache misses
US6499116B1 (en)1999-03-312002-12-24International Business Machines Corp.Performance of data stream touch events
US6446029B1 (en)1999-06-302002-09-03International Business Machines CorporationMethod and system for providing temporal threshold support during performance monitoring of a pipelined processor
US6748558B1 (en)2000-05-102004-06-08Motorola, Inc.Performance monitor system and method suitable for use in an integrated circuit
US7225110B2 (en)2001-08-162007-05-29International Business Machines CorporationExtending width of performance monitor counters
US20050050530A1 (en)*2003-08-272005-03-03Sun Microsystems, Inc.Methods for execution control acquisition of a program and for executing an optimized version of a program
US7318222B2 (en)*2003-08-272008-01-08Sun Microsystems, Inc.Methods for execution control acquistion of a program and for executing an optimized version of a program
US20050060694A1 (en)*2003-09-162005-03-17Sun Microsystems, Inc.Methods and hardware for safe memory allocation in arbitrary program environments
US7269830B2 (en)*2003-09-162007-09-11Sun Microsystems, Inc.Methods and hardware for safe memory allocation in arbitrary program environments
US20050071817A1 (en)*2003-09-302005-03-31International Business Machines CorporationMethod and apparatus for counting execution of specific instructions and accesses to specific data locations
US7937691B2 (en)*2003-09-302011-05-03International Business Machines CorporationMethod and apparatus for counting execution of specific instructions and accesses to specific data locations
US8689190B2 (en)2003-09-302014-04-01International Business Machines CorporationCounting instruction execution and data accesses
US8381037B2 (en)2003-10-092013-02-19International Business Machines CorporationMethod and system for autonomic execution path selection in an application
US20050081107A1 (en)*2003-10-092005-04-14International Business Machines CorporationMethod and system for autonomic execution path selection in an application
US8782664B2 (en)2004-01-142014-07-15International Business Machines CorporationAutonomic hardware assist for patching code
US8615619B2 (en)2004-01-142013-12-24International Business Machines CorporationQualifying collection of performance monitoring events by types of interrupt when interrupt occurs
US20060005083A1 (en)*2004-06-302006-01-05International Business Machines CorporationPerformance count tracing
US20060123167A1 (en)*2004-12-082006-06-08Jeppsen Roger CRequest conversion
US7373565B2 (en)2005-08-232008-05-13Hewlett-Packard Development Company, L.P.Start/stop circuit for performance counter
GB2429552B (en)*2005-08-232010-09-01Hewlett Packard Development CoStart/Stop Circuit For Performance Counter
US20070168807A1 (en)*2005-08-232007-07-19Richard AdkissonStart/stop circuit for performance counter
GB2429552A (en)*2005-08-232007-02-28Hewlett Packard Development CoStart/Stop Circuit for a Performance Counter
US20080276054A1 (en)*2005-09-292008-11-06International Business Machines CorporationMonitoring performance of a storage area network
US20070073984A1 (en)*2005-09-292007-03-29International Business Machines CorporationApparatus and method for monitoring performance of a storage area network
US8140750B2 (en)*2005-09-292012-03-20International Business Machines CorporationMonitoring performance of a storage area network
US8244976B2 (en)2005-09-292012-08-14International Business Machines CorporationMonitoring performance of a storage area network
US8195858B1 (en)*2009-07-282012-06-05Nvidia CorporationManaging conflicts on shared L2 bus
US8321618B1 (en)2009-07-282012-11-27Nvidia CorporationManaging conflicts on shared L2 bus
US9965381B1 (en)*2011-06-302018-05-08EMC IP Holding Company LLCIndentifying data for placement in a storage system
US9323655B1 (en)*2011-06-302016-04-26Emc CorporationLocation of data among storage tiers
US9391892B2 (en)*2011-08-022016-07-12Cavium, Inc.Method and apparatus for managing transport operations to a cluster within a processor
US9319316B2 (en)2011-08-022016-04-19Cavium, Inc.Method and apparatus for managing transfer of transport operations from a cluster in a processor
US9525630B2 (en)2011-08-022016-12-20Cavium, Inc.Method and apparatus for assigning resources used to manage transport operations between clusters within a processor
US9531690B2 (en)2011-08-022016-12-27Cavium, Inc.Method and apparatus for managing processing thread migration between clusters within a processor
US20130036185A1 (en)*2011-08-022013-02-07Cavium, Inc.Method and apparatus for managing transport operations to a cluster within a processor
US20140007114A1 (en)*2012-06-292014-01-02Ren WangMonitoring accesses of a thread to multiple memory controllers and selecting a thread processor for the thread based on the monitoring
US9575806B2 (en)*2012-06-292017-02-21Intel CorporationMonitoring accesses of a thread to multiple memory controllers and selecting a thread processor for the thread based on the monitoring
US9298651B2 (en)2013-06-242016-03-29International Business Machines CorporationContinuous in-memory accumulation of hardware performance counter data
US9600336B1 (en)2015-08-282017-03-21International Business Machines CorporationStoring service level agreement compliance data
US10002063B2 (en)2015-10-202018-06-19International Business Machines CorporationMonitoring performance of multithreaded workloads

Similar Documents

PublicationPublication DateTitle
US5727167A (en)Thresholding support in performance monitoring
US5557548A (en)Method and system for performance monitoring within a data processing system
US5970439A (en)Performance monitoring in a data processing system
EP0582635B1 (en)Method for increasing the speed of data processing in a computer system
US5937437A (en)Method and apparatus for monitoring address translation performance
US7197586B2 (en)Method and system for recording events of an interrupt using pre-interrupt handler and post-interrupt handler
US6480966B1 (en)Performance monitor synchronization in a multiprocessor system
US7895382B2 (en)Method and apparatus for qualifying collection of performance monitoring events by types of interrupt when interrupt occurs
KR100390610B1 (en)Method and system for counting non-speculative events in a speculative processor
US5991708A (en)Performance monitor and method for performance monitoring within a data processing system
US5835702A (en)Performance monitor
US7421681B2 (en)Method and system for autonomic monitoring of semaphore operation in an application
US6360337B1 (en)System and method to perform histogrammic counting for performance evaluation
US5691920A (en)Method and system for performance monitoring of dispatch unit efficiency in a processing system
US7225309B2 (en)Method and system for autonomic performance improvements in an application via memory relocation
US7962314B2 (en)Mechanism for profiling program software running on a processor
US5193179A (en)Activity monitor system non-obtrusive statistical monitoring of operations on a shared bus of a multiprocessor system
US5675729A (en)Method and apparatus for performing on-chip measurement on a component
US5752062A (en)Method and system for performance monitoring through monitoring an order of processor events during execution in a processing system
US5751945A (en)Method and system for performance monitoring stalls to identify pipeline bottlenecks and stalls in a processing system
US6961930B1 (en)Efficient, transparent and flexible latency sampling
US8381037B2 (en)Method and system for autonomic execution path selection in an application
US8789028B2 (en)Memory access monitoring
US5802378A (en)Performance monitoring in multiprocessor system with interrupt masking
US5822758A (en)Method and system for high performance dynamic and user programmable cache arbitration

Legal Events

DateCodeTitleDescription
FEPPFee payment procedure

Free format text:PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAYFee payment

Year of fee payment:4

REMIMaintenance fee reminder mailed
LAPSLapse for failure to pay maintenance fees
STCHInformation on status: patent discontinuation

Free format text:PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FPLapsed due to failure to pay maintenance fee

Effective date:20060310


[8]ページ先頭

©2009-2025 Movatter.jp