Movatterモバイル変換


[0]ホーム

URL:


Skip to main content

Advertisement

Springer Nature Link
Log in

Trade-Offs in Automatic Provenance Capture

  • Conference paper
  • First Online:

Abstract

Automatic provenance capture from arbitrary applications is a challenging problem. Different approaches to tackle this problem have evolved, most notablya. system-event trace analysis,b. compile-time static instrumentation, andc. taint flow analysis usingdynamic binary instrumentation. Each of these approaches offers different trade-offs in terms of the granularity of captured provenance, integration requirements, and runtime overhead. While these aspects have been discussed separately, a systematic and detailed study, quantifying and elucidating them, is still lacking. To fill this gap, we begin to explore these trade-offs for representative examples of these approaches for automatic provenance capture by means of evaluation and measurement. We base our evaluation on UnixBench—a widely used benchmark suite within systems research. We believe this approach will make our results easier to compare with future studies.

The original version of this chapter was revised.

An erratum to this chapter can be found at10.1007/978-3-319-40593-3_29

An erratum to this chapter can be found athttp://dx.doi.org/10.1007/978-3-319-40593-3_29

This is a preview of subscription content,log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide -see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Similar content being viewed by others

Notes

  1. 1.

    These are sometimes termed OS level provenance systems.

  2. 2.
  3. 3.

    OPM can then be easily converted to the W3C PROV recommendation [11].

  4. 4.

    Byte ranges have amemberof: field pointing back to the file they come from.

  5. 5.

References

  1. ProvBench: A Provenance Repository for Benchmarking (2013).https://github.com/provbench. Accessed Feb 2016

  2. Balakrishnan, N., Bytheway, T., Sohan, R., Hopper, A.: OPUS: a lightweight system for observational provenance in user space. In: Proceedings of USENIX TaPP 2013, Lombard, IL, USA, April 2013

    Google Scholar 

  3. Bates, A., Tian, D., Butler, K.R.B., Moyer, T.: Trustworthy whole-system provenance for the Linux Kernel. In: Proceedings of USENIX SEC 2015, Washington, DC, USA, August 2015

    Google Scholar 

  4. Braun, U., Garfinkel, S.L., Holland, D.A., Muniswamy-Reddy, K.-K., Seltzer, M.I.: Issues in automatic provenance collection. In: Moreau, L., Foster, I. (eds.) IPAW 2006. LNCS, vol. 4145, pp. 171–183. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  5. Carata, L., Akoush, S., Balakrishnan, N., Bytheway, T., Sohan, R., Seltzer, M., Hopper, A.: A primer on provenance. ACM Queue12(3), 10:10–10:23 (2014)

    Article  Google Scholar 

  6. Chapman, A., Blaustein, B.T., Seligman, L., Allen, M.D.: PLUS: a provenance manager for integrated information. In: Proceedings of IEEE IRI 2011, Las Vegas, NV, USA, August 2011

    Google Scholar 

  7. Firth, H., Missier, P.: ProvGen: generating synthetic PROV graphs with predictable structure. In: Ludaescher, B., Plale, B. (eds.) IPAW 2014. LNCS, vol. 8628, pp. 16–27. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  8. Frew, J., Metzger, D., Slaughter, P.: Automatic capture and reconstruction of computational provenance. Concurr. Comput.: Pract. Exp.20(5), 485–496 (2008)

    Article  Google Scholar 

  9. Gehani, A., Tariq, D.: SPADE: Support for Provenance Auditing in Distributed Environments. In: Narasimhan, P., Triantafillou, P. (eds.) Middleware 2012. LNCS, vol. 7662, pp. 101–120. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  10. Glavic, B.: Big data provenance: challenges and implications for benchmarking. In: Rabl, T., Poess, M., Baru, C., Jacobsen, H.-A. (eds.) WBDB 2012. LNCS, vol. 8163, pp. 72–80. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  11. Groth, P., Moreau, L.: PROV-Overview. An Overview of the PROV Family of Documents. W3C Working Group Note NOTE-prov-overview-20130430, W3C.http://www.w3.org/TR/2013/NOTE-prov-overview-20130430/

  12. Holland, D.A., Seltzer, M.I., Braun, U., Muniswamy-Reddy, K.K.: PASSing the provenance challenge. Concurr. Comput.: Pract. Exp.20(5), 531–540 (2008)

    Article  Google Scholar 

  13. Kemerlis, V.P., Portokalidis, G., Jee, K., Keromytis, A.D.: libdft: practical dynamic data flow tracking for commodity systems. In: Proceedings of VEE 2012, London, UK, March 2012

    Google Scholar 

  14. Lattner, C., Adve, V.: LLVM: a compilation framework for lifelong program analysis & transformation. In: Proceedings of CGO 2004, Palo Alto, CA, USA (2004)

    Google Scholar 

  15. Luk, C.K., et al.: Pin: building customized program analysis tools with dynamic instrumentation. In: Proceedings of PLDI 2005, Chicago, IL, USA, June 2005

    Google Scholar 

  16. Ma, S., Zhang, X., Xu, D.: ProTracer: towards practical provenance tracing by alternating between logging and tainting. In: Proceedings of NDSS 2016, San Diego, CA, USA, February 2016

    Google Scholar 

  17. Moreau, L., et al.: The open provenance model core specification (v1.1). Future Gener. Comput. Syst.27(6), 743–756 (2011)

    Article MathSciNet  Google Scholar 

  18. Pohly, D.J., McLaughlin, S., McDaniel, P., Butler, K.: Hi-Fi: collecting high-fidelity whole-system provenance. In: Proceedings of ACSAC 2012, Orlando, FL, USA, December 2012

    Google Scholar 

  19. Simmhan, Y.L., Plale, B., Gannon, D.: A survey of data provenance in e-Science. SIGMOD Rec.34(3), 31–36 (2005)

    Article  Google Scholar 

  20. Smith, B., Lucas, K., et al.: UnixBench: The original BYTE UNIX benchmark suite (2011).https://github.com/kdlucas/byte-unixbench. Accessed Feb 2016

  21. Stamatogiannakis, M., Groth, P., Bos, H.: Looking inside the black-box: capturing data provenance using dynamic instrumentation. In: Ludaescher, B., Plale, B. (eds.) IPAW 2014. LNCS, vol. 8628, pp. 155–167. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  22. Tariq, D., Ali, M., Gehani, A.: Towards automated collection of application-level data provenance. In: Proceedings of USENIX TaPP 2012, Boston, MA, USA (2012)

    Google Scholar 

Download references

Acknowledgments

This material is based upon work supported by the National Science Foundation under Grant IIS-1116414. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Author information

Authors and Affiliations

  1. Computer Science Institute, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands

    Manolis Stamatogiannakis, Remco Vermeulen & Herbert Bos

  2. SRI International, Menlo Park, USA

    Hasanat Kazmi, Hashim Sharif & Ashish Gehani

  3. Elsevier Labs, Amsterdam, The Netherlands

    Paul Groth

Authors
  1. Manolis Stamatogiannakis

    You can also search for this author inPubMed Google Scholar

  2. Hasanat Kazmi

    You can also search for this author inPubMed Google Scholar

  3. Hashim Sharif

    You can also search for this author inPubMed Google Scholar

  4. Remco Vermeulen

    You can also search for this author inPubMed Google Scholar

  5. Ashish Gehani

    You can also search for this author inPubMed Google Scholar

  6. Herbert Bos

    You can also search for this author inPubMed Google Scholar

  7. Paul Groth

    You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toManolis Stamatogiannakis.

Editor information

Editors and Affiliations

  1. COPPE/UFRJ, Rio de Janeiro, Brazil

    Marta Mattoso

  2. Illinois Institute of Technology, Chicago, Illinois, USA

    Boris Glavic

Rights and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Stamatogiannakis, M.et al. (2016). Trade-Offs in Automatic Provenance Capture. In: Mattoso, M., Glavic, B. (eds) Provenance and Annotation of Data and Processes. IPAW 2016. Lecture Notes in Computer Science(), vol 9672. Springer, Cham. https://doi.org/10.1007/978-3-319-40593-3_3

Download citation

Publish with us

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 5719
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 7149
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide -see info

Tax calculation will be finalised at checkout

Purchases are for personal use only


[8]ページ先頭

©2009-2025 Movatter.jp