Movatterモバイル変換

565Accesses
14Citations
Explore all metrics

Abstract

In high-performance computing applications, a high-level I/O call will trigger activities on a multitude of hardware components. These are massively parallel systems supported by huge storage systems and internal software layers. Their complex interplay currently makes it impossible to identify the causes for and the locations of I/O bottlenecks. Existing tools indicate when a bottleneck occurs but provide little guidance in identifying the cause or improving the situation.

We have thus initiatedScalable I/O for Extreme Performance to find solutions for this problem. To achieve this goal inSIOX, we will build a system to record access information on all layers and components, to recognize access patterns, and to characterize the I/O system. The system will ultimately be able to recognize the causes of the I/O bottlenecks and propose optimizations for the I/O middleware that can improve I/O performance, such as throughput rate and latency. Furthermore, the SIOX system will be able to support decision making while planning new I/O systems.

In this paper, we introduce the SIOX system and describe its current status: We first outline our approach for collecting the required access information. We then provide the architectural concept, the methods for reconstructing the I/O path and an excerpt of the interface for data collection. This paper focuses especially on the architecture, which collects and combines the relevant access information along the I/O path, and which is responsible for the efficient transfer of this information. An abstract modelling approach allows us to better understand the complexity of the analysis of the I/O activities on parallel computing systems, and an abstract interface allows us to adapt the SIOX system to various HPC file systems.

This is a preview of subscription content,log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

A Methodology for Performance Analysis of Applications Using Multi-layer I/O

Upgrading a high performance computing environment for massive data processing

ArticleOpen access16 October 2019

PIOM-PX: A Framework for Modeling the I/O Behavior of Parallel Scientific Applications

Notes

References

Babu S, Borisov N, Uttamchandani S, Routray R, Singh A (2009) DIADS: addressing the “My-problem-or-yours” syndrome with integrated SAN and database diagnosis. In: FAST’09: proceedings of the 7th conference on file and storage technologies. USENIX Association, Berkeley, pp 57–70
Google Scholar
Barham P, Donnelly ARI, Mortier R (2004) Using magpie for request extraction and workload modelling. Microsoft Research
Chaarawi M, Gabriel E, Keller R, Graham RL, Dongarra JJ (2011) OMPIO: a modular software architecture for MPI I/O. Springer, Berlin/Heidelberg
Google Scholar
Geimer M, Wolf F, Wylie BJN, Becker D, Böhme D, Frings W, Hermanns MA, Mohr B, Szebenyi Z (2009) Recent developments in the scalasca toolset. In: Tools for high performance computing, proceedings of the 3rd international workshop on parallel tools. Springer, Berlin
Google Scholar
Hermanns MA, Geimer M, Wolf F, Wylie BJN (2009) Verifying causality between distant performance phenomena in large-scale MPI applications. In: Proceedings of the 17th Euromicro international conference on parallel, distributed, and network-based processing (PDP), Weimar, Germany. IEEE Computer Society Press, Los Alamitos, pp 78–84
Google Scholar
Knüpfer A, Nagel WE (2006) Compressible memory data structures for event-based trace analysis. Future Gener Comput Syst 22:359–368
Article Google Scholar
Knüpfer A, Brunst H, Doleschal J, Jurenz M, Lieber M, Mickler H, Müller MS, Nagel WE (2008) The Vampir performance analysis tool-set. In: Tools for high performance computing, proceedings of the 2nd international workshop on parallel tools. Springer, Berlin, pp 139–155
Chapter Google Scholar
Kunkel J (2011) HDTrace—a tracing and simulation environment of application and system interaction. Tech. Rep. 2, Deutsches Klimarechenzentrum GmbH, Bundesstraße 45a, 20146, Hamburg
Kunkel J, Ludwig T (2011) IOPm—modeling the I/O path with a functional representation of parallel file system and hardware architecture, to be published
Lofstead J, Zheng F, Klasky S, Schwan K (2009) Adaptable, metadata rich IO methods for portable high performance IO. IEEE Computer Society, Washington
Google Scholar
Minartz T, Molka D, Kunkel J, Knobloch M, Kuhn M, Ludwig T (2012) Handbook of energy-aware and green computing. Chapman and Hall/CRC Press Taylor and Francis Group LLC, Boca Raton, p 600
Google Scholar
Noeth M, Ratn P, Mueller F, Schulz M, de Supinski BR (2009) ScalaTrace: scalable compression and replay of communication traces for high performance computing. J Parallel Distrib Comput 69:696–710
Article Google Scholar
Shende SS, Malony AD (2006) The TAU parallel performance system. Int J High Perform Comput Appl 20(2):287–311
Article Google Scholar
Thakur R, Gropp W, Lusk E (1999) On implementing MPI-IO portably and with high performance. ACM Press, New York
Google Scholar
Thereska E, Salmon B, Salmon O, Strunk J, Wachs M, Abd-el-Malek M, Lopez J, Ganger GR (2006) Stardust: tracking activity in a distributed storage system. In: ACM SIGMETRICS conference on measurement and modeling of computer systems. ACM Press, New York, pp 3–14
Google Scholar

Download references

Author information

Authors and Affiliations

Bundesstraße 45a, 20146, Hamburg, Germany
Marc C. Wiedemann
Universität Hamburg—Deutsches Klimarechenzentrum GmbH, Hamburg, Germany
Marc C. Wiedemann, Julian M. Kunkel, Michaela Zimmer & Thomas Ludwig
High Performance Computing Center Stuttgart (HLRS), Universität Stuttgart, Stuttgart, Germany
Michael Resch, Thomas Bönisch, Xuan Wang & Andriy Chut
Zentrum für Informationsdienste und Hochleistungsrechnen, Technische Universität Dresden, Dresden, Germany
Alvaro Aguilera, Wolfgang E. Nagel, Michael Kluge & Holger Mickler

Authors

Marc C. Wiedemann
View author publications
You can also search for this author inPubMed Google Scholar
Julian M. Kunkel
View author publications
You can also search for this author inPubMed Google Scholar
Michaela Zimmer
View author publications
You can also search for this author inPubMed Google Scholar
Thomas Ludwig
View author publications
You can also search for this author inPubMed Google Scholar
Michael Resch
View author publications
You can also search for this author inPubMed Google Scholar
Thomas Bönisch
View author publications
You can also search for this author inPubMed Google Scholar
Xuan Wang
View author publications
You can also search for this author inPubMed Google Scholar
Andriy Chut
View author publications
You can also search for this author inPubMed Google Scholar
Alvaro Aguilera
View author publications
You can also search for this author inPubMed Google Scholar
Wolfgang E. Nagel
View author publications
You can also search for this author inPubMed Google Scholar
Michael Kluge
View author publications
You can also search for this author inPubMed Google Scholar
Holger Mickler
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toMarc C. Wiedemann.

Additional information

We want to express our gratitude to the “Deutsches Zentrum für Luft- und Raumfahrt e.V.” as responsible project agency and to the “Bundesministerium für Bildung und Forschung” for the financial support under grant01 IH 11008 A-C.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wiedemann, M.C., Kunkel, J.M., Zimmer, M.et al. Towards I/O analysis of HPC systems and a generic architecture to collect access patterns.Comput Sci Res Dev28, 241–251 (2013). https://doi.org/10.1007/s00450-012-0221-5

Download citation

Published:23 May 2012
Issue Date:May 2013
DOI:https://doi.org/10.1007/s00450-012-0221-5

Movatterモバイル変換

Towards I/O analysis of HPC systems and a generic architecture to collect access patterns

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Methodology for Performance Analysis of Applications Using Multi-layer I/O

Upgrading a high performance computing environment for massive data processing

PIOM-PX: A Framework for Modeling the I/O Behavior of Parallel Scientific Applications

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Access this article

Subscribe and save

Buy Now