Bitmasks for nicer APIs

Written on 10 Dec 2020

Bitmasks is one of those things where the basic idea is simple to understand:it’s just0s and1s being toggled on and off. But actually “having it click”to the point where it’s easy to work with can be a bit trickier. At least, it is(or rather, was) for me 😅

With a bitmask you hide (or “mask”) certain bits of a number, which can beuseful for various things as we’ll see later on. There are two reasons one mightuse bitmasks: for efficiency or for nicer APIs. Efficiency is rarely an issueexcept for some embedded or specialized use cases, but everyone likes nice APIs,so this is about that.

A while ago I added colouring support to my littlezli library. Addingcolours to your terminal is not very hard as such, just print an escape code:

fmt.Println("\x1b[34mRed text!\x1b[0m")

But a library makes this a bit easier. There’s already a bunch of libraries outthere for Go specifically, the most popular being Fatih Arslan’scolor:

color.New(color.FgRed).Add(color.Bold).Add(color.BgCyan).Println("bold red")

This is stored as:

type (    Attributeint    Colorstruct { params  []Attribute })

I wanted a simple way to add some colouring, which looks a bit nicer than themethod chain in thecolor library, and eventually figured out you don’t need a[]int to store all the different attributes but that a singleuint64 will doas well:

zli.Colorf("bold red", zli.Red | zli.Bold | zli.Cyan.Bg())// Or alternatively, use Color.String():fmt.Printf("%sbold red%s\n", zli.Red|zli.Bold|zli.Cyan.Bg(), zli.Reset)

Which in my eyes looks a bit nicer than Fatih’s library, and also makes iteasier to add 256 and true colour support.

All of the below can be used in any language by the way, and little of this isspecific to Go. You will need Go 1.13 or newer for the binary literals to work.

Here’s howzli stores all of this in auint64:

                                   fg true, 256, 16 color mode ─┬──┐                                bg true, 256, 16 color mode ─┬─┐│  │                                                             │ ││  │┌── parsing error ┌───── bg color ────────────┐ ┌───── fg color ────────────┐ │ ││  ││┌─ term attr v                           v v                           v v vv  vvv         v 0000_0000 0000_0000 0000_0000 0000_0000 0000_0000 0000_0000 0000_0000 0000_0000 ^         ^         ^         ^         ^         ^         ^         ^64        56        48        40        32        24        16         8

I’ll go over it in detail later, but in short (from right to left):

The first 9 bits are flags for the basic terminal attributes such as bold,italic, etc.
The next bit is to signal a parsing error for true colour codes (e.g.#123123).
There are 3 flags for the foreground and background colour each to signal thata colour should be applied, and how it should be interpreted (there are 3different ways to set the colour: 16-colour, 256-colour, and 24-bit “truecolour”, which use different escape codes).
The colours for the foreground and background are stored separately, becauseyou can apply both a foreground and background. These are 24-bit numbers.
A value of 0 is reset.

With this, you can make any combination of the common text attributes, the aboveexample:

zli.Colorf("bold red", zli.Red | zli.Bold | zli.Cyan.Bg())

Would be the following in binary layout:

                                              fg 16 color mode ────┐                                           bg 16 color mode ───┐   │                                                               │   │        bold                bg color ─┬──┐                fg color ─┬──┐   │   │           │                          v  v                          v  v   v   v           v 0000_0000 0000_0000 0000_0110 0000_0000 0000_0000 0000_0001 0010_0100 0000_0001 ^         ^         ^         ^         ^         ^         ^         ^64        56        48        40        32        24        16         8

We need to go through several steps to actually do something meaningful withthis. First, we want to get all the flag values (the first 9 bits); a “flag” isa bit being set to true (1) or false (0).

const (    Bold         =0b0_0000_0001    Faint        =0b0_0000_0010    Italic       =0b0_0000_0100    Underline    =0b0_0000_1000    BlinkSlow    =0b0_0001_0000    BlinkRapid   =0b0_0010_0000    ReverseVideo =0b0_0100_0000    Concealed    =0b0_1000_0000    CrossedOut   =0b1_0000_0000)func applyColor(cuint64) {if c & Bold !=0 {// Write escape code for bold    }if c & Faint !=0 {// Write escape code for faint    }// etc.}

& is the bitwise AND operator. It works just as the more familiar&& exceptthat it operates on every individual bit where0 isfalse and1 istrue.The end result will be1 if both bits are “true” (1). An example with justfour bits:

0011 & 0101 = 0001

This can be thought of as four separate operations (from left to right):

0 AND 0 = 0      both false0 AND 1 = 0      first value is false, so the end result is false1 AND 0 = 0      second value is false1 AND 1 = 1      both true

So whatif c & Bold != 0 does is check if the “bold bit” is set:

Only bold set:  0 0000 0001& 0 0000 0001= ―――――――――――  0 0000 0001Underline bit set:  0 00001000& 0 0000 0001= ―――――――――――  0 0000 0000                0 since there are no cases of "1 AND 1"Bold and underline bits set:  0 00001001& 0 0000 0001= ―――――――――――  0 0000 0001                Only "bold AND bold" is "1 AND 1"

As you can see,c & Bold != 0 could also be written asc & Bold == Bold.

The colours themselves are stored as a regular number like any other, exceptthat they’re “offset” a number of bits. To get the actual number value we needto clear all the bits we don’t care about, and shift it all to the right:

const (    colorOffsetFg   =16    colorMode16Fg   =0b0000_0100_0000_0000    colorMode256Fg  =0b0000_1000_0000_0000    colorModeTrueFg =0b0001_0000_0000_0000    maskFg          =0b00000000_00000000_00000000_11111111_11111111_11111111_00000000_00000000)func getColor(cuint64) {if c & colorMode16Fg !=0  {        cc := (c & maskFg) >> colorOffsetFg// ..write escape code for this color..    }}

First we check if the “16 colour mode” flag is set using the same method as theterminal attributes, and then we AND it withmaskFg to clear all the bits wedon’t care about:

                                   fg true, 256, 16 color mode ─┬──┐                                bg true, 256, 16 color mode ─┬─┐│  │                                                             │ ││  │┌── parsing error ┌───── bg color ────────────┐ ┌───── fg color ────────────┐ │ ││  ││┌─ term attr v                           v v                           v v vv  vvv         v 0000_0000 0000_0000 0000_0110 0000_0000 0000_0000 0000_0001 0010_0100 0000_1001AND maskFg 0000_0000_0000_0000_0000_0000_1111_1111_1111_1111_1111_1111_0000_0000_0000_0000= 0000_0000 0000_0000 0000_0000 0000_0000 0000_0000 0000_0001 0000_0000 0000_0000 ^         ^         ^         ^         ^         ^         ^         ^64        56        48        40        32        24        16         8

After the AND operation we’re left with just the 24 bits we care about, andeverything else is set to0. To get a normal number from this we need to shiftthe bits to the right with>>:

1010 >> 1 = 0101    All bits shifted one position to the right.1010 >> 2 = 0010    Shift two, note that one bit gets discarded.

Instead of>> 16 you can also subtract65535 (a 16-bit number):(c &maskFg) - 65535. The end result is the same, but bit shifts are much easier toreason about in this context.

We repeat this for the background colour (except that we shift everything 40bits to the right). The background is actually a bit easier since we don’t needto AND anything to clear bits, as all the bits to the right will just bediscarded:

cc := c >> ColorOffsetBg

For 256 and “true” 24-bit colours we do the same, except that we need to senddifferent escape codes for them, which is a detail that doesn’t really matterfor this explainer about bitmasks.

To set the background colour we use theBg() function to transforms aforeground colour to a background one. This avoids having to defineBgCyanconstants like Fatih’s library, and makes working with 256 and true coloureasier.

const (    colorMode16Fg   =0b00000_0100_0000_0000    colorMode16Bg   =0b0010_0000_0000_0000    maskFg          =0b00000000_00000000_00000000_11111111_11111111_11111111_00000000_00000000)func Bg(cuint64)uint64 {if c & colorMode16Fg !=0 {        c = c ^ colorMode16Fg | colorMode16Bg    }return (c &^ maskFg) | (c & maskFg <<24)}

First we check if the foreground colour flags is set; if it is then move thatbit to the corresponding background flag.

| is the OR operator; this works like|| except on individual bits like inthe above example for&. Note that unlike|| it won’t stop if the firstcondition is false/0: if any of the two values are1 the end result will be1:

0 OR 0 = 0      both false0 OR 1 = 1      second value is true, so end result is true1 OR 0 = 1      first value is true1 OR 1 = 1      both true  0011| 0101= ――――  0111

^ is the “exclusive or”, or XOR, operator. It’s similar to OR except that itonly outputs1 if exactly one value is1, and not if both are:

0 XOR 0 = 0      both false0 XOR 1 = 1      second value is true, so end result is true1 XOR 0 = 1      first value is true1 XOR 1 = 0      both true, so result is 0  0011^ 0101= ――――  0110

Putting both together,c ^ colorMode16Fg clears the foreground flag and|colorMode16Bg sets the background flag.

The last line moves the bits from the foreground colour to the backgroundcolour:

return (c &^ maskFg) | (c & maskFg <<24)

&^ is “AND NOT”: these are two operations: first it will inverse the rightside (“NOT”) and then ANDs the result. So in our example themaskFg value isinversed:

 0000_0000_0000_0000_0000_0000_1111_1111_1111_1111_1111_1111_0000_0000_0000_0000NOT 1111_1111_1111_1111_1111_1111_0000_0000_0000_0000_0000_0000_1111_1111_1111_1111

We then used this inversedmaskFg value to clear the foreground colour,leaving everything else intact:

 1111_1111_1111_1111_1111_1111_0000_0000_0000_0000_0000_0000_1111_1111_1111_1111AND 0000_0000 0000_0000 0000_0110 0000_0000 0000_0000 0000_0001 0010_0100 0000_1001= 0000_0000 0000_0000 0000_0110 0000_0000 0000_0000 0000_0000 0010_0100 0000_1001 ^         ^         ^         ^         ^         ^         ^         ^64        56        48        40        32        24        16         8

C and most other languages don’t have this operator and have~ for NOT (whichGo doesn’t have), so the above would be(c & ~maskFg) in most other languages.

Finally, we set the background colour by clearing all bits that are not part ofthe foreground colour, shifting them to the correct place, and ORing this to getthe final result.

I skipped a number of implementation details in the above example for clarity,especially for people not familiar with Go.The full code is of courseavailable. Putting all ofthis together gives a fairly nice API IMHO in about 200 lines of code whichmostly avoids boilerplateism.

I only showed the 16-bit colours in the examples, in reality most of this isduplicated for 256 and true colours as well. It’s all the same logic, just withdifferent values. I also skipped over the details of terminal colour codes, asthis article isn’t really about that.

In many of the above examples I used binary literals for the constants, and thisseemed the best way to communicate how it all works for this article. This isn’tnecessarily the best or easiest way to write things in actual code, especiallynot for such large numbers. In the actual code it looks like:

const (    ColorOffsetFg =16    ColorOffsetBg =40)const (    maskFg Color = (256*256*256 -1) << ColorOffsetFg    maskBg Color = maskFg << (ColorOffsetBg - ColorOffsetFg))// Basic terminal attributes.const (    Reset Color =0    Bold  Color =1 << (iota -1)    Faint// ...)

Figuring out how this works is left as an exercise for the reader :-)

Another thing that might be useful is a little helper function to print a numberas binary; it helps visualise things if you’re confused:

func bin(cuint64) {    reBin := regexp.MustCompile(`([01])([01])([01])([01])([01])([01])([01])([01])`)    reverse :=func(sstring)string {        runes := []rune(s)for i, j :=0,len(runes)-1; i < j; i, j = i+1, j-1 {            runes[i], runes[j] = runes[j], runes[i]        }returnstring(runes)    }    fmt.Printf("%[2]s →%[1]d\n", c,        reverse(reBin.ReplaceAllString(reverse(fmt.Sprintf("%064b", c)),`$1$2$3${4}_$5$6$7$8 `)))}

I put a slighly more advanced version of this atzgo.at/zstd/zfmt.Binary.

You can also write a little wrapper to make things a bit easier:

type Bitflag64uint64uint64func (f Bitflag64) Has(flag Bitflag64)bool {return f&flag !=0 }func (f *Bitflag64) Set(flag Bitflag64)     { *f = *f | flag }func (f *Bitflag64) Clear(flag Bitflag64)   { *f = *f &^ flag }func (f *Bitflag64) Toggle(flag Bitflag64)  { *f = *f ^ flag }

If you need more than 64 bits then not all is lost; you can usetype thingy[2]uint64.

Here’s an example where I did it wrong:

type APITokenPermissionsstruct {    Countbool    Exportbool    SiteReadbool    SiteCreatebool    SiteUpdatebool}

This records the permissions for an API token the user creates. Looks nice, buthow do you check thatonlyCount is set?

if p.Count &&!p.Export &&!p.SiteRead &&!p.SiteCreate &&!p.SiteUpdate { .. }

Ugh; not very nice, and neither is checking if multiple permissions are set:

if perm.Export && perm.SiteRead && perm.SiteCreate && perm.SiteUpdate { .. }

Had I stored it as a bitmask instead, it would have been easier:

// Only Count is set.if perm &^ Count ==0 { .. }// Everything in permSomething is set.const permSomething = perm.Export | perm.SiteRead | perm.SiteCreate | perm.SiteUpdateif perm & permSomething ==0 { .. }

No one likes functions with these kind of signatures either:

f(false,false,true)f(true,false,true)

But with a bitmask things can look a lot nicer:

const (    AddWarpdrive   =0b001    AddTractorBeam =0b010    AddPhasers     =0b100)f(AddPhasers)f(AddWarpdrive | AddPhasers)

Other Programming posts

25 Apr 2020Storing files in .git
22 Feb 2021Go is not an easy language
7 Oct 2019On being the right size
29 Nov 2020Stupid light software
22 Mar 2019Easy means easy to debug
5 Dec 2019Good comments read well and are to the point
7 Feb 2016The downsides of JSON for config files
7 Jan 2019Testing isn’t everything
4 Sep 2016YAML: probably not so great after all
16 Nov 2024Against best practices
25 Nov 2020An API is a user interface

Other Go posts

10 Apr 2018Go testing style guide
28 Oct 2024Jia Tanning Go code
11 Apr 2020Statically compiling Go programs
21 Nov 2019Go’s features of last resort
5 Feb 2020Running Go CLI programs in the browser with WASM
22 Feb 2021Go is not an easy language
1 May 2019Using flags for configuration in Go

Movatterモバイル変換