Introduction to the Zig Programming Language

The past few months I took a break from working onGenesis Digital Audio Workstationto work, instead, on creating anew programming language.

I am nothing if not ambitious, and my goal is to create a new programminglanguage that ismore pragmatic than C. This is like to trying to bemore evil than the devil himself.

So, in order, these are the priorities of Zig:

Pragmatic: At the end of the day, all that really matters is whether the language helped you do what you were trying to do better than any other language.
Optimal: The most natural way to write a program should result in top-of-the-line runtime performance, equivalent to or better than C. In places where performance is king, the optimal code should be clearly expressible.
Safe: Optimality may be sitting in the driver's seat, but safety is sitting in the passenger's seat, wearing its seatbelt, and asking nicely for the other passengers to do the same.
Readable: Zig prioritizes reading code over writing it. Avoid complicated syntax. Generally there should be a canonical way to do everything.

Design Decisions

Widely Diverging Debug and Release Builds

Zig has the concept of adebug build vs arelease build. Here is a comparison of priorities for debug mode vs release mode:

	Debug Mode	Release Mode
Time Spent Compiling	Code must compile fast. Use all manner of caching, shared objects, multithreading, whatever must be done in order to produce a binary as soon as possible.	Making a release build could take orders of magnitude longer than a debug build and that is acceptable.
Runtime Performance	Could be order of magnitude slower than release build and that is acceptable.	Optimal performance. Aggressive optimizations. Take the time needed to produce a highly efficient runtime efficiency. No compromises here.
Undefined Behavior	Whatwould be undefined behavior in a release build, is defined behavior in a debug build, and that is for the runtime to trap. That is, crash. This includes things like array bounds checking, integer overflow, reaching unreachable code. Not all undefined behavior can be caught, but a comfortably large amount can.	Undefined behavior in release mode has unspecified consequences, and this lets the optimizer produce optimal code.

The build mode is available to the source code via the expression@import("builtin").mode.

Note: Since this blog post, Zig has gainedtwo more release modes:

Release Safe
Release Small

Complete C ABI Compatibility

Part of being pragmatic is recognizing C's existing success. Interopwith C is crucial. Zig embraces C like the mean older brother who you are a littleafraid of but you still want to like you and be your friend.

In Zig, functions look like this:

fn doSomething() {    // ...}

The compiler is free to inline this function, change its parameters,and otherwise do whatever it wants, since this is an internal function.However if you decide to export it:

export fn doSomething() {    // ...}

Now this function has the C ABI, and the name shows up in the symbol tableverbatim. Likewise, you can declare an external function prototype:

extern fn puts(s: [*]const u8) c_int;

In Zig, like in C, you typically do not create a "wrapper" or "bindings" toa library, you just use it. But if you had to type out or generate all theextern function prototypes, this would be a binding. That is why Zig has the abilityto parse .h files:

use @cImport({    @cInclude("stdio.h");});

This exposes all the symbols in stdio.h - including the#define statements -to the zig program, and then you can callputs orprintf just likeyou would in C.

One of Zig's use cases isslowly transitioning a large C project to Zig.Zig can produce simple .o files for linking against other .o files, and it canalso generate .h files based on what you export. So you could write part of yourapplication in C and part in Zig, link all the .o files together and everythingplays nicely with each other.

Optional Type Instead of Null Pointer

One area that Zig provides safety without compromising efficiency orreadability is with the optional type.

The question mark symbolizes the optional type. You can convert a type to an optionaltype by putting a question mark in front of it, like this:

// normal integerconst normal_int: i32 = 1234;// optional integerconst optional_int: ?i32 = 5678;

Now the variableoptional_int could be ani32, ornull.

Instead of integers, let's talk about pointers. Null references are the source of many runtimeexceptions, and even stand accused of beingthe worst mistake of computer science.

Zig does not have them.

Instead, you can use an optional pointer. This secretly compiles down to a normal pointer,since we know we can use 0 as the null value for the maybe type. But the compilercan check your work and make sure you don't assign null to something that can't be null.

Typically the downside of not having null is that it makes the code more verbose towrite. But, let's compare some equivalent C code and Zig code.

Task: call malloc, if the result is null, return null.

C code

// malloc prototype included for referencevoid *malloc(size_t size);struct Foo *do_a_thing(void) {    char *ptr = malloc(1234);    if (!ptr) return NULL;    // ...}

Zig code

// malloc prototype included for referenceextern fn malloc(size: size_t) ?[*]u8;fn doAThing() ?*Foo {    const ptr = malloc(1234) orelse return null;    // ...}

Here, Zig is at least as convenient, if not more, than C. And, the type of "ptr" is[*]u8not?[*]u8. Theorelse operator unwrapped the maybe type and thereforeptr is guaranteed to be non-null everywhere it is used in the function.

The other form of checking against NULL you might see looks like this:

void do_a_thing(struct Foo *foo) {    // do some stuff    if (foo) {        do_something_with_foo(foo);    }    // do some stuff}

In Zig you can accomplish the same thing:

fn doAThing(optional_foo: ?*Foo) {    // do some stuff    if (optional_foo) |foo| {      doSomethingWithFoo(foo);    }    // do some stuff}

Once again, the notable thing here is that inside the if block,foo is no longer an optional pointer, it is a pointer, whichcannot be null.

One benefit to this is that functions which take pointers as arguments canbe annotated with the "nonnull" attribute -__attribute__((nonnull)) inGCC.The optimizer can sometimes make better decisions knowing that pointer argumentscannot be null.

Note: when this blog post was written, Zig did not distinguish betweenSingle Item Pointers and Unknown Length Pointers. You canread about this in the documentation.

Errors

One of the distinguishing features of Zig is its exception handling strategy.

Zig introduces two primitive types:

Error Sets
Error Unions

An error set can be declared like this:

const FileOpenError = error {  FileNotFound,  OutOfMemory,  UnexpectedToken,};

An error set is a lot like an enum, except errors from different error setswhich share a name, are defined to have the same numerical value. So eacherror name has a globally unique integer associated with it. The integer value0 is reserved.

You can refer to these error values with field access syntax, such asFileOpenError.FileNotFound. There is syntactic sugar for creating an ad-hocerror set and referring to one of its errors:error.SomethingBroke. Thisis equivalent toerror{SomethingBroke}.SomethingBroke.

In the same way that pointers cannot be null, an error set value is always an error.

const err = error.FileNotFound;

Most of the time you will not find yourself using an error set type. Instead,likely you will be using the error union type. Error unions are created withthe binary operator!, with the error set on the left and any othertype on the right:ErrorSet!OtherType.

Here is a function to parse a string into a 64-bit integer:

const ParseError = error {    InvalidChar,    Overflow,};pub fn parseU64(buf: []const u8, radix: u8) ParseError!u64 {    var x: u64 = 0;    for (buf) |c| {        const digit = charToDigit(c);        if (digit >= radix) {            return error.InvalidChar;        }        // x *= radix        if (@mulWithOverflow(u64, x, radix, &x)) {            return error.Overflow;        }        // x += digit        if (@addWithOverflow(u64, x, digit, &x)) {            return error.Overflow;        }    }    return x;}

Notice the return type isParseError!u64. This means that the functioneither returns an unsigned 64 bit integer, or one of theParseError errors.

Within the function definition, you can see some return statements that returnan error set value, and at the bottom a return statement that returns au64.Both types implicitly cast toParseError!u64.

Note: this blog post was written before Zig had the concept ofError Sets vsanyerror, andbefore Zig hadError Set Inference.Most functions in Zig can rely on error set inference, which would make the prototype ofparseU64 look like this:

pub fn parseU64(buf: []const u8, radix: u8) !u64 {    ...

What it looks like to use this function varies depending on what you'retrying to do. One of the following:

You want to provide a default value if it returned an error.
If it returned an error then you want to return the same error.
You know with complete certainty it will not return an error, so want to unconditionally unwrap it.
You want to take a different action for each possible error.

If you want to provide a default value, you can use thecatch expression:

fn doAThing(str: []u8) void {    const number = parseU64(str, 10) catch 13;    // ...}

In this code,number will be equal to the successfully parsed string, ora default value of 13. The type of the right hand side of thecatch expression mustmatch the unwrapped error union type, or of typenoreturn.

Let's say you wanted to return the error if you got one, otherwise continue with thefunction logic:

fn doAThing(str: []u8) !void {    const number = parseU64(str, 10) catch |err| return err;    // ...}

There is a shortcut for this. Thetry expression:

fn doAThing(str: []u8) !void {    const number = try parseU64(str, 10);    // ...}

try evaluates an error union expression. If it is an error, it returnsfrom the current function with the same error. Otherwise, the expression results inthe unwrapped value.

Maybe you know with complete certainty that an expression will never be an error. In this case you can do this:

const number = parseU64("1234", 10) catch unreachable;

Here we know for sure that "1234" will parse successfully. So we put theunreachable keyword on the right hand side.unreachable generatesa panic in debug mode and undefined behavior in release mode. So, while we're debugging theapplication, if therewas a surprise error here, the application would crashappropriately.

There is no syntactic shortcut forcatch unreachable. This encourages programmersto think carefully before using it.

Finally, you may want to take a different action for every situation. For that, we haveif combined withswitch:

fn doAThing(str: []u8) {    if (parseU64(str, 10)) |number| {        doSomethingWithNumber(number);    } else |err| switch (err) {        error.Overflow => {            // handle overflow...        },        // we promise that InvalidChar won't happen (or crash in debug mode if it does)        error.InvalidChar => unreachable,    }}

The important thing to note here is that ifparseU64 is modified to return a differentset of errors, Zig will emit compile errors for handling impossible error codes, and for not handlingpossible error codes.

The other component to error handling is defer statements.In addition to an unconditionaldefer, Zig haserrdefer,which evaluates the deferred expression on block exit path if and only ifthe function returned with an error from the block.

Example:

fn createFoo(param: i32) !Foo {    const foo = try tryToAllocateFoo();    // now we have allocated foo. we need to free it if the function fails.    // but we want to return it if the function succeeds.    errdefer deallocateFoo(foo);    const tmp_buf = allocateTmpBuffer() orelse return error.OutOfMemory;    // tmp_buf is truly a temporary resource, and we for sure want to clean it up    // before this block leaves scope    defer deallocateTmpBuffer(tmp_buf);    if (param > 1337) return error.InvalidParam;    // here the errdefer will not run since we're returning success from the function.    // but the defer will run!    return foo;}

The neat thing about this is that you get robust error handling withoutthe verbosity and cognitive overhead of trying to make sure every exit pathis covered. The deallocation code is always directly following the allocation code.

A couple of other tidbits about error handling:

These primitives give enough expressiveness that it's completely practical that failing to check for an error is a compile error. If you really want to ignore the error, you can usecatch unreachable and get the added benefit of crashing in debug mode if your assumption was wrong.
Since Zig understands error types, it can pre-weight branches in favor of errors not occuring. Just a small optimization benefit that is not available in other languages.
There are no C++ style exceptions or stack unwinding or anything fancy like that. Zig simply makes it convenient to pass error codes around.

Alternate Standard Library

Part of the Zig project is providing an alternative to libc.

libc has a lot of useful stuff in it, but it also hascruft.Since we're starting fresh here, we can create a new API without someof the mistakes of the 70s still haunting us, and with our 20-20 hindsight.

Further, calling dynamically linked functions isslow. Zig's philosophy is that compilingagainst the standard library in source form is worth it. In C this would becalled Link Time Optimization - where you generate Intermediate Representationinstead of machine code and then do another compile step at link time. In Zig,we skip the middle man, and create a single compilation unit with everythingin it, then run the optimizations.

So, you can choose to link against libc and take advantage of it, or you canchoose to ignore it and use the Zig standard library instead. Note, however,that virtually every C library you depend on probably also depends on libc, whichdrags libc as a dependency into your project. Using libc is still a firstclass use case for Zig.

Alternatives to the Preprocessor

The C preprocessor is extremely powerful. Maybe a littletoo powerful.

The problem with the preprocessor is that it turns one language intotwo languages that don't know about each other.

Here are some examples of where the preprocessor messes things up:

The compiler cannot catch even simple syntax errors in code that is excluded via#ifdef.
IDEs cannot implement a function, variable, or field renaming feature that works correctly. Among other mistakes, it will miss renaming things that are in code excluded via#ifdef.
Preprocessor defines do not show up in debug symbols by default.
#include is the single biggest contributor to slow compile times in both C and C++.
Preprocessor defines are problematic for bindings generators for other languages.

Regardless of the flaws, C programmers find ourselves using the preprocessorbecause it provides necessary features, such as conditional compilation,a constant that can be used for array sizes, and generics.

Zig plans to provide better alternatives to solve these problems. For example,the constant expression evaluator of Zig allows you to do this:

const array_len = 10 * 2 + 1;const Foo = struct {    array: [array_len]i32,};

This is not an amazing concept, but it eliminates one use case for#define.

Next, conditional compilation. In Zig, compilation variables are availablevia@import("builtin").

The declarations available in this import evaluate to constant expressions.You can write normal code using these constants:

const builtin = @import("builtin");fn doSomething() {    if (builtin.mode == builtin.Mode.ReleaseFast) {        // do the release behavior    } else {        // do the debug behavior    }}

This isguaranteed to leave out the if statement when the code is generated.

One use case for conditional compilation is demonstrated inlibsoundio:

static const enum SoundIoBackend available_backends[] = {#ifdef SOUNDIO_HAVE_JACK    SoundIoBackendJack,#endif#ifdef SOUNDIO_HAVE_PULSEAUDIO    SoundIoBackendPulseAudio,#endif#ifdef SOUNDIO_HAVE_ALSA    SoundIoBackendAlsa,#endif#ifdef SOUNDIO_HAVE_COREAUDIO    SoundIoBackendCoreAudio,#endif#ifdef SOUNDIO_HAVE_WASAPI    SoundIoBackendWasapi,#endif    SoundIoBackendDummy,};

Here, we want a statically sized array to have different contents depending onwhether we have certain libraries present.

In Zig, it would look something like this:

const opts = @import("build_options");const available_backends =    (if (opts.have_jack)        []SoundIoBackend{SoundIoBackend.Jack}    else        []SoundIoBackend{})    ++    (if (opts.have_pulse_audio)        []SoundIoBackend{SoundIoBackend.PulseAudio}    else        []SoundIoBackend{})    ++    (if (opts.have_alsa)        []SoundIoBackend{SoundIoBackend.Alsa}    else        []SoundIoBackend{})    ++    (if (opts.have_core_audio)        []SoundIoBackend{SoundIoBackend.CoreAudio}    else        []SoundIoBackend{})    ++    (if (opts.have_wasapi)        []SoundIoBackend{SoundIoBackend.Wasapi}    else        []SoundIoBackend{})    ++    []SoundIoBackend{SoundIoBackend.Dummy};

Here we take advantage of the compile-time array concatenation operator,++.It's a bit more verbose than the C equivalent, but the important thing is that it'sone language, not two.

Finally, generics.Zig implements generics by allowing programmers to markparameters to functions as known at compile-time.

Milestone: Tetris Implemented in Zig

This past week I achieved a fun milestone: a fully playable Tetris cloneimplemented in Zig, with the help of libc,GLFW, andlibpng.

If you're using Linux on the x86_64 architecture, which is currently theonly supported target, you coulddownload a Zig buildand thenbuild this Tetris game.

Otherwise, here's a video of me demoing it:

Resources

If you are interested in the language, feel free to participate.

Home Page
Source code and issue tracker:https://github.com/ziglang/zig
IRC channel:#zig on Freenode
Financial Support:Become a sponsor
Official Documentation

Movatterモバイル変換