Embed presentation














![Manta: Unix for Big Data•Manta allows for an arbitrarily scalable variant ofMcIlroy’s solution to Bentley’s challenge:mfind -t o /bcantrill/public/v7/usr/man | mjob create -o -m "tr -cs A-Za-z 'n' | tr A-Z a-z | sort | uniq -c" -r "awk '{ x[$2] += $1 }END { for (w in x) { print x[w] " " w } }' | sort -rn | sed ${1}q"•This description not only terse, it is high performing: datais left at rest — with the “map” phase doing heavyreduction of the data stream•As such, Manta — like Unix — is not merely syntacticsugar; it converges compute and data in a new way](/image.pl?url=https%3a%2f%2fimage.slidesharecdn.com%2ffuturestack-131218013018-phpapp02%2f75%2fBringing-the-Unix-Philosophy-to-Big-Data-15-2048.jpg&f=jpg&w=240)




The document summarizes the Unix philosophy of building systems out of small, single-purpose programs and how this approach can be applied to big data problems. It describes how Joyent's Manta object storage system brings this philosophy to big data by combining ZFS for scalable storage with OS-level virtualization using zones to allow Unix tools and approaches to be used on large datasets. Manta allows computations to be run directly on stored data rather than requiring data movement, enabling terse Unix-style one-liners to solve problems like word counting on big data.
Introduction to Unix, its minimalist philosophy, and how it revolutionized systems thinking.
Comparison of Unix solutions and traditional programming approaches to word frequency challenges.
Connection of Big Data problems to earlier Unix challenges, highlighting the lack of Unix philosophy.
Challenges of scaling for Big Data, including multi-tenancy and the need to leverage Unix philosophies.
Overview of scalable storage protocols: block, file, and object, emphasizing their pros and cons.
Discussion on OS-level virtualization vs hardware virtualization, introducing lightweight containers.
Combining ZFS and Zones for an efficient object store that leverages Unix for Big Data applications.
Introduction of Manta, a scalable system utilizing Unix philosophy for processing large data efficiently.
Manta's design principles including consistency preferences, hierarchical storage, and SDK support.
Prospects of compute/data convergence and Manta's role as a pioneering system in future Big Data solutions.
Information on Manta, including product details, documentation, and community engagement opportunities.














![Manta: Unix for Big Data•Manta allows for an arbitrarily scalable variant ofMcIlroy’s solution to Bentley’s challenge:mfind -t o /bcantrill/public/v7/usr/man | mjob create -o -m "tr -cs A-Za-z 'n' | tr A-Z a-z | sort | uniq -c" -r "awk '{ x[$2] += $1 }END { for (w in x) { print x[w] " " w } }' | sort -rn | sed ${1}q"•This description not only terse, it is high performing: datais left at rest — with the “map” phase doing heavyreduction of the data stream•As such, Manta — like Unix — is not merely syntacticsugar; it converges compute and data in a new way](/image.pl?url=https%3a%2f%2fimage.slidesharecdn.com%2ffuturestack-131218013018-phpapp02%2f75%2fBringing-the-Unix-Philosophy-to-Big-Data-15-2048.jpg&f=jpg&w=240)



