Movatterモバイル変換


[0]ホーム

URL:


skip to main content
10.5555/243846.243861acmconferencesArticle/Chapter ViewAbstractPublication PagesmicroConference Proceedingsconference-collections
micro

Export Citations

    • Please download or close your previous search result export first before starting a new bulk export.
      Preview is not available.
      By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.

    Hot cold optimization of large Windows/NT applications

    Published:02 December 1996Publication History
    Metrics
    Total Citations25
    Total Downloads391
    Last 12 Months62
    Last 6 weeks11

    New Citation Alert added!

    This alert has been successfully added and will be sent to:

    You will be notified whenever a record that you have chosen has been cited.

    To manage your alert preferences, click on the button below.

    Manage my Alerts

    New Citation Alert!

    Abstract

    A dynamic instruction trace often contains many unnecessary instructions that are required only by the unexecuted portion of the program. Hot-cold optimization (HCO) is a technique that realizes this performance opportunity. HCO uses profile information to partition each routine into frequently executed (hot) and infrequently executed (cold) parts. Unnecessary operations in the hot portion are removed, and compensation code is added on transitions from hot to cold as needed. We evaluate HCO on a collection of large Windows NT applications. HCO is most effective on the programs that are call intensive and have flat profiles, providing a 3-8% reduction in path length beyond conventional optimization.

    References

    [1]
    D. Blickstein, et al, "The GEM optimizing compiler system," Digital Technical Journal, 4(4): 12 l- 136.
    [2]
    B. Calder and D. Grunwald, "Reducing branch costs via branch alignment," in ASPLOS Vi Proc., San Jose, CA, Nov. 1994
    [3]
    B. Calder, D. Grunwald, and A. Srivastava, "The predictability of branches in libraries," in Proc. of the 28th Annual Intl. Symp. on Microarchitecture, pp. 24-34, Ann Arbor, MI, Nov. 1995
    [4]
    Alpha NT Calling Standard. http://www.partner.digital.com/www-swdev/pages/Home/ TECH/documents/alpha_cookbook/biblio.htm
    [5]
    P.P. Chang, S.A. Mahlke, and W.W. Hwu, "Using profile information to assist classic code optimizations," Software Prac. and Exp., 21(12): 1301-1321, 1991
    [6]
    F.C. Chow, "Minimizing register usage penalty at procedure calls," in Proc. A CM SIGPLAN Conf. on Programming Language Design and Implementation '88, A CM pp. 85-94, Atlanta, GA, June 1988
    [7]
    A.V. Aho, R. Sethi, and J.D. Ullman. Compilers: Principles, Techniques, and Tools. Addison-Wesley, 1985.
    [8]
    D. Goodwin and K. Wilken, "Optimal and nearoptimal global register allocation using 0-1 integer programming," Software-Practice & Exp., To appear 1996.
    [9]
    R.E. Hank, W.W. Hwu, and B.R. Rau, "Region-based compilation," in Proc. of the 28th Annual Intl. Syrup. on Microarchitecture, pp. 158-168, Ann Arbor, MI, Nov. 1995
    [10]
    W.W. Hwu and P.P. Chang, "Achieving high instruction cache performance with an optimizing compiler," in Proc. 16th Annual Intl. Symp. on Computer Architecture, Jerusalem, Israel, June 1989
    [11]
    J. Knoop, O. Rfithing, B Steffen, "Partial dead code elimination," in Proc. A CM SIGPLAN Conf. on Programming Language Design and Implementation '94, pp. 147-158, Orlando, FL, June 1994
    [12]
    S.M. Kurlander and C.N. Fischer, "Minimum cost interprocedural register allocation," in The 23rd A CM SIGPLAN SIGA CT Symp. on Principles of Programming Languages, pp. 230-241, St. Petersburg, Florida, Jan., 1996.
    [13]
    J.R. Larus and E. Schnarr, "EEL: Machineindependent executable editing," in Proc. A CM SIGPLAN Conf. on Programming Language Design and Impl. '95, pp. 291-300, La Jolla, CA, June 1995
    [14]
    S.A. Mahlke, et al., "Effective compiler support for predicated execution using the hyperblock," in Proc. of the 25th Annual Intl. Symp. on Microarchitecture, pp. 45-54, Dec. 1992.
    [15]
    S. McFarling, "Program optimization for instruction caches," in ASPLOS Iii Proc., pp. 183-193, Boston, MA, April 1989.
    [16]
    K. Pettis and R.C. Hansen, "Profile Guided Code Positioning" in Proc. ACM SIGPLAN Conf. on Programming Language Design and Implementation '90, pp. 16-27, White Plains, NY, June 1990
    [17]
    V. Santhanam and D. Odnert, "Register allocation across procedure and module boundaries" in Proc. A CM SIGPLAN Conf. on Programming Language Design and Implementation '90, White Plains, NY, June 1990
    [18]
    R. L. Sites and S. Perl, "PatchWrx -- A dynamic execution tracing tool," http://www.research.digital.com /SRC/personal/Dick_Sites/patchwrx/PatchWrx.html.
    [19]
    A. Srivastava and D. Wall, "Link-time optimization of address calculation on a 64-bit architecture," in Proc. ACM SiGPLAN Conf. on Programming Language Design and Implementation '94, pp. 49-60, Orlando, FL, June 1994
    [20]
    A. Srivastava and A. Eustace. "ATOM: A system for building customized program analysis tools," in Proc. SIGPLAN 94 Conf. on Programming Language Design and Implementation, pp. 85-96, Orlando, Florida, June 1994
    [21]
    D.W. Wall, "Global register allocation at link time" in Proc. SIGPLAN 86 Symp. on Compiler Construction, pp. 264- 275, Palo Alto, CA, June 1986
    [22]
    L.S. Wilson, C.A. Neth, M.J. Rickabaugh, "Delivering binary object modification tools for program analysis and optimization," volume 8,1 of Digital Technical Journal, pp. 18-31, 1996

    Cited By

    View all
    • Ayers GNagendra NAugust DCho HKanev SKozyrakis CKrishnamurthy TLitz HMoseley TRanganathan PManne SHunter HAltman E(2019)AsmDBProceedings of the 46th International Symposium on Computer Architecture10.1145/3307650.3322234(462-473)Online publication date: 22-Jun-2019
    • Sun EKaeli DCohn RHollingsworth JKumar N(2009)A binary instrumentation tool for the Blackfin processorProceedings of the Workshop on Binary Instrumentation and Applications10.1145/1791194.1791201(43-51)Online publication date: 12-Dec-2009
    • Tallent NMellor-Crummey JFagan M(2009)Binary analysis for measurement and attribution of program performanceACM SIGPLAN Notices10.1145/1543135.154252644:6(441-452)Online publication date: 15-Jun-2009
    • Show More Cited By

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MICRO 29: Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
    December 1996
    359 pages
    ISBN:0818676418
    Copyright © Copyright (c) 1996 Institute of Electrical and Electronics Engineers, Inc. All rights reserved.

    Sponsors

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 02 December 1996

    Check for updates

    Author Tags

    1. NT
    2. optimization
    3. profile
    4. register allocation

    Qualifiers

    • Article

    Conference

    MICRO96
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 484 of 2,242 submissions, 22%

    Upcoming Conference

    MICRO 2025
    58th IEEE/ACM International Symposium on Microarchitecture
    October 18 - 22, 2025
    Seoul , Republic of Korea

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)62
    • Downloads (Last 6 weeks)11
    Reflects downloads up to 05 May 2025

    Other Metrics

    Citations

    Cited By

    View all
    • Ayers GNagendra NAugust DCho HKanev SKozyrakis CKrishnamurthy TLitz HMoseley TRanganathan PManne SHunter HAltman E(2019)AsmDBProceedings of the 46th International Symposium on Computer Architecture10.1145/3307650.3322234(462-473)Online publication date: 22-Jun-2019
    • Sun EKaeli DCohn RHollingsworth JKumar N(2009)A binary instrumentation tool for the Blackfin processorProceedings of the Workshop on Binary Instrumentation and Applications10.1145/1791194.1791201(43-51)Online publication date: 12-Dec-2009
    • Tallent NMellor-Crummey JFagan M(2009)Binary analysis for measurement and attribution of program performanceACM SIGPLAN Notices10.1145/1543135.154252644:6(441-452)Online publication date: 15-Jun-2009
    • Tallent NMellor-Crummey JFagan MHind MDiwan A(2009)Binary analysis for measurement and attribution of program performanceProceedings of the 30th ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/1542476.1542526(441-452)Online publication date: 15-Jun-2009
    • Das ALu JChen HKim JYew PHsu WChen D(2005)Performance of Runtime Optimization on BLASTProceedings of the international symposium on Code generation and optimization10.1109/CGO.2005.25(86-96)Online publication date: 20-Mar-2005
    • Suganuma TYasue TNakatani TCytron RGupta R(2003)A region-based compilation technique for a Java just-in-time compilerProceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation10.1145/781131.781166(312-323)Online publication date: 9-Jun-2003
    • Sullivan GBruening DBaron IGarnett TAmarasinghe SGregg DErtl M(2003)Dynamic native optimization of interpretersProceedings of the 2003 workshop on Interpreters, virtual machines and emulators10.1145/858570.858576(50-57)Online publication date: 8-Jun-2003
    • Suganuma TYasue TNakatani T(2003)A region-based compilation technique for a Java just-in-time compilerACM SIGPLAN Notices10.1145/780822.78116638:5(312-323)Online publication date: 9-May-2003
    • Bruening DGarnett TAmarasinghe SJohnson RConte THwu W(2003)An infrastructure for adaptive dynamic optimizationProceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization10.5555/776261.776290(265-275)Online publication date: 23-Mar-2003
    • Haber GKlausner MEisenberg VMendelson BGurevich MJohnson RConte THwu W(2003)Optimization opportunities created by global data reorderingProceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization10.5555/776261.776286(228-237)Online publication date: 23-Mar-2003
    • Show More Cited By

    View Options

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Copied!

    Copying failed.

    Share on social media

    Affiliations

    RobertCohn
    Digital Equipment Corporation, Hudson, Massachusetts
    P. GeoffreyLowney
    Digital Equipment Corporation, Hudson, Massachusetts
    View Table of Conten
    Your Search Results Download Request

    We are preparing your search results for download ...

    We will inform you here when the file is ready.

    Download now!
    Your Search Results Download Request

    Your file of search results citations is now ready.

    Download now!
    Your Search Results Download Request

    Your search export query has expired. Please try again.


    [8]ページ先頭

    ©2009-2025 Movatter.jp