Movatterモバイル変換

Posted at 2015-05-01 13:24 |RSS feed (Full text feed) |Blog Index
Next article:Friday Q&A 2015-05-29: Concurrent Memory Deallocation in the Objective-C Runtime
Previous article:Friday Q&A 2015-04-17: Let's Build Swift.Array
Tags:fridayqna fuzzing security

Friday Q&A 2015-05-01: Fuzzing with afl-fuzz

byMike Ash

With computer security high on everyone's minds these days, tools that help assess and improve the security of our code are extremely useful. Today I'm going to talk about one such tool,afl-fuzz, which has seen a lot of attention lately and produces some interesting results. I'll discuss how it works and how to use it on your own code.

Fuzzing
American Fuzzy Lop, orafl-fuzz, is a fuzzing tool. To understand what it does and how to use it, you must first understand the basic concept of fuzzing.

The idea is to stress-test programs by feeding them large quantities of computer-generated input. It's similar in concept to what you as a programmer might do when writing unit tests. You'd come up with what you think are some representative inputs and run them through the code being tested to make sure it behaves properly. With a fuzzer, the computer is generating the inputs and constantly trying your program with them. Rather than testing for correct output, it's searching for bugs that manifest in the form of crashes, hangs, or assertion failures.

There are different kinds of fuzzers, based on how they generate the input. It's possible to write a fuzzer that generates purely random inputs, but this won't produce useful results for many types of programs. An XML parser, for example, will return an error almost immediately when presented with pure random data, meaning that most of the code in the parser goes untested.

One way to improve on this is to give the fuzzer knowledge of what kind of structure the program expects. For the example of an XML parser, you might make a fuzzer that generates random legal XML documents and tests against that. Minor illegal variations on legal XML would help test error handling and edge cases. This can work extremely well, but requires a lot of domain-specific work to give the fuzzer knowledge of the program's inputs.

Another approach is to start with examples, and then produce test cases by mutating the examples by flipping bits, shuffling data around, inserting or removing data, or mixing data from multiple examples. Coming up with examples to give to the fuzzer is typically much less work than coding it to produce good examples on its own, but the fuzzer will be limited to things similar to the examples given.

afl-fuzz
afl-fuzz is an open source fuzzer that runs on various UNIX-like OSes, including Mac OS X. It can be found here:

http://lcamtuf.coredump.cx/afl/

It takes an interesting approach that combines certain aspects of the usual approaches. It uses example inputs and mutation to generate new inputs. However, it intelligently guides these mutations using the target's behavior to select interesting inputs for further mutation.

The target's behavior is monitored by injecting additional code at compile time.afl-fuzz provides a wrapper around eithergcc orclang. If you compile a program using the wrapper, the resulting binary has the necessary instrumentation injected. (afl-fuzz is also able to monitor uninstrumented binaries by running them in QEMU, but this currently doesn't work on the Mac.)

The instrumentation monitors branches taken by the program. Any given input to the program will produce a sequence of branches. Moving from one branch to another branch is a transition, and each transition is recorded. The fuzzer then favors inputs which generate new transitions, with the result that inputs are generated which explore new paths through the code. Unusual paths through the code are more likely to turn up bugs, and this helps generate those unusual paths.

In short,afl-fuzz searches for inputs that make your program behave differently. When it finds such inputs, it then alters them further to see if it can make your program behave even more differently. This lets it cover large chunks of your program's potential execution space without any special knowledge of the structure of the input it expects.

Building
The source code forafl-fuzz can be obtained from its web page above. It builds out of the box on Mac OS X by simply typingmake inside the source directory. It's a fairly small program and the build completes nearly instantaneously. The built binaries are placed in the source directory. You can install them wherever you want, or just leave them in place and run them using the appropriate path.

Creating Test Cases
You can't typically take an arbitrary program, compile it withafl-fuzz, and run it in the fuzzer. It won't know how your program expects to receive input. To settle this question,afl-fuzz passes inputs to the program by sending them on standard input. If your program doesn't already read from standard input, you'll need to write a small test harness that reads fromstdin and then invokes the code you want to fuzz.

If you're using Cocoa (and yes,afl-fuzz works fine with Objective-C), such a test harness could be written like this:

intmain(intargc,char**argv){@autoreleasepool{NSFileHandle*stdin=[NSFileHandlefileHandleWithStandardInput];NSData*data=[stdinreadDataToEndOfFile];// Do something interesting with data here}}

Just what you do with the data is up to you. If you're testing a UTF-8 parser, you'd pass the data to your parser. If you have an image decoder, treat the data like an image and try to decode it. If you're testing a data structure, you could treat each line of input as a piece of data to insert or delete. Note that while non-instrumented libraries like the ones provided by the system will happily run underafl-fuzz, the lack of instrumentation means that they're a black box to the fuzzer, and only the behavior of your own code informs its explorations. So don't try[NSImage imageWithData: data] and expect anything interesting to happen.

Of course, it's not necessary to useNSFileHandle like this. If your code deals with input byte-by-byte, callinggetchar() in a loop works nicely. If your code wants anNSStream, create one from/dev/stdin and pass it in.

Compiling Test Cases
To compile a test case, compile it as you normally would at the command line, but use theafl-clang tool rather than the normalclang command.afl-clang will compile your code withclang, inserting the necessary instrumentation as it goes along. For a simple test case that doesn't use anything besides the standard C library, you can compile it by just passing the name of the source file:

afl-clangtest-case.c

This creates an executable nameda.out which you can then run directly, or have the fuzzer work on.

For Objective-C programs, you need to also link against the Foundation framework, or the Cocoa framework if you use any of the UI parts:

afl-clang-frameworkFoundationtest-case.m

If your test case encompasses multiple files, you can compile them all together by passing them all into the command:

afl-clang-frameworkFoundationtest-case.msupport-file1.msupport-file2.m

If you prefer a name other thana.out, you can specify it with-o:

afl-clang-frameworkFoundationtest-case.m-otest-case

Running
afl-fuzz needs at least example test case to start from. You can provide more than one if you like. The test cases are plain files that live in a subdirectory that you pass to the fuzzer. We'll call ittestcases:

mkdirtestcasesecho"Test case here.">testcases/case1

Fuzzer results go in another subdirectory, which we'll callfindings. There's no need to create it, asafl-fuzz will create it when needed. We can then runafl-fuzz on the test case, giving it thetestcases andfindings directories:

afl-fuzz-itestcases-ofindings./a.out

This doesn't quite work properly on a standard Mac.afl-fuzz distinguishes between hangs and crashes by waiting for a timeout to expire. If the program is still running when the timeout fires, it's considered a hang. If the program crashes before that, it's a crash. Otherwise it's a successful run. The problem is that the OS X crash reporter triggers when the test case crashes, and generating the crash report takes substantially longer thanafl-fuzz's default timeout of 20 milliseconds. This causesafl-fuzz to report all crashes as hangs.

The best workaround for this would be to disable the system crash reporter while fuzzing. A simpler workaround is simply to increase the timeout, which can be done by passing the-t flag. I used 20000 milliseconds instead of the default. This will seriously hurt performance if there are hangs, but works well enough:

afl-fuzz-itestcases-ofindings-t20000./a.out

This will present you with a nice terminal interface:

americanfuzzylop1.67b(a.out)┌─processtiming─────────────────────────────────────┬─overallresults─────┐│runtime:0days,0hrs,37min,5sec│cyclesdone:2││lastnewpath:0days,0hrs,5min,13sec│totalpaths:112││lastuniqcrash:noneseenyet│uniqcrashes:0││lastuniqhang:noneseenyet│uniqhangs:0│├─cycleprogress────────────────────┬─mapcoverage─┴───────────────────────┤│nowprocessing:48*(42.86%)│mapdensity:108(0.16%)││pathstimedout:0(0.00%)│countcoverage:6.41bits/tuple│├─stageprogress────────────────────┼─findingsindepth────────────────────┤│nowtrying:splice17│favoredpaths:3(2.68%)││stageexecs:948/1000(94.80%)│newedgeson:6(5.36%)││totalexecs:875k│totalcrashes:0(0unique)││execspeed:641.7/sec│totalhangs:0(0unique)│├─fuzzingstrategyyields───────────┴───────────────┬─pathgeometry────────┤│bitflips:0/23.2k,0/23.2k,0/23.2k│levels:4││byteflips:0/2906,0/2019,0/2421│pending:87││arithmetics:0/88.7k,0/31.0k,0/11.7k│pendfav:0││knownints:0/9017,0/71.1k,0/116k│ownfinds:17││dictionary:0/0,0/0,0/0│imported:n/a││havoc:0/156k,17/309k│variable:0││trim:7.43%/1872,43.63%├────────────────────────┘└─────────────────────────────────────────────────────┘[cpu:39%]

There's a lot of info here, most of which I'll leave you to explore on your own. There are a few bits that are particularly interesting:

uniq crashes anduniq hangs show how many crashes and hangs have been discovered so far. The moment these say something other than zero, you've found something interesting!
last new path is how long it's been since the fuzzer found a new path through the program. This is an indication of how much progress it's making. If it's been a long time since the fuzzer found a new path, it may no longer be able to find any interesting inputs.
exec speed is how quickly it's able to run test cases. The higher this number, the more test cases the fuzzer can run through, and the faster it can discover interesting things.

As the fuzzer runs, it places information in thefindings directory. Inputs that cause crashes are placed in files in thecrashes subdirectory. Inputs that cause hangs go in thehangs subdirectory. Thequeue subdirectory contains the inputs that are interesting but haven't caused any crashes or hangs, and can be used to get an idea of the fuzzer's progress, and what kind of inputs it's working with.

Example
I wrote a small test case that quickly exercises the fuzzer and verifies that it's working as expected. This test case reads standard input and callsabort() if the input starts withdeadbeef:

intmain(intargc,char**argv){@autoreleasepool{NSFileHandle*stdin=[NSFileHandlefileHandleWithStandardInput];NSData*data=[stdinreadDataToEndOfFile];constchar*ptr=[databytes];if(ptr[0]=='d'){if(ptr[1]=='e'){if(ptr[2]=='a'){if(ptr[3]=='d'){if(ptr[4]=='b'){if(ptr[5]=='e'){if(ptr[6]=='e'){if(ptr[7]=='f'){abort();}}}}}}}}}}

The structure of this program is pretty weird. A singlememcmp call could be used instead of the massive nestedif statements. I wrote it this way to take advantage of the fact that the fuzzer instruments branches. With amemcmp call, the fuzzer is unlikely to ever discover the magicdeadbeef input needed to crash the program. With each character checked separately, the fuzzer's branch instrumentation can discover them one by one.

The program also doesn't bother to check the length of the input. As a result, if it receives fewer than eight bytes, it'll end up reading garbage. Since the intent of the program is to have problems for the fuzzer to find, this really is a feature, not a bug.

Let's compile it and make sure it works as intended:

$afl-clang-frameworkFoundationdeadbeef-test.m$echohello|./a.out$echodeadbeef|./a.outAborttrap:6

Perfect! Let's make a simple test case and get the fuzzer started:

$echo>testcases/foo$afl-fuzz-itestcases-ofindings-t20000./a.out

Eventually,uniq crashes goes to one; the fuzzer reports a crash. This took a couple of hours on my computer. We can see what input the fuzzer found to trigger a crash:

$lsfindings/crashes/README.txtid:000000,sig:06,src:000007,op:havoc,rep:2$catfindings/crashes/id\:000000\,sig\:06\,src\:000007\,op\:havoc\,rep\:2deadbeef

Exactly as expected.

It's interesting to look at the other interesting inputs the fuzzer generated along the way:

$forfinfindings/queue/*;doecho$f:;cat$f;echo;echo---;donefindings/queue/id:000000,orig:foo:---findings/queue/id:000001,src:000000,op:havoc,rep:64,+cov:d---findings/queue/id:000002,src:000001,op:havoc,rep:4,+cov:deSd---findings/queue/id:000003,src:000002,op:havoc,rep:2,+cov:deae---findings/queue/id:000004,src:000002,op:havoc,rep:1,+cov:dead---findings/queue/id:000005,src:000004,op:havoc,rep:2,+cov:deadbbdd---findings/queue/id:000006,src:000005,op:flip4,pos:5,+cov:deadbe�d---findings/queue/id:000007,src:000006,op:havoc,rep:2,+cov:deadbeea---

The progression shows up clearly, as the fuzzer works out the sequence byte by byte by seeing what triggers a new code path. There's some garbage at the end of some of the inputs, which is to be expected since it's a randomized process and surplus suffixes don't affect how the program operates.

This same principle works on more complex programs, with the fuzzer generating input that takes it down new and interesting paths in the code, and potentially finding bugs along the way.

Conclusion
There isn't much that's more entertaining than feeding crazy inputs to your programs and watching them crash.afl-fuzz automates the process and makes it much more likely that something amusing will occur. It can help you make your code more secure too, which is a nice bonus.

That's it for today. Come back next time for more computer amusements. Friday Q&A is driven by reader suggestions, so if you have an idea for a topic to cover next time or any other time, pleasesend it in!

Did you enjoy this article? I'm selling whole books full of them! Volumes II and III are now out! They're available as ePub, PDF, print, and on iBooks and Kindle.Click here for more information.

Comments:

Robert Abramovitzat2015-05-04 01:55:01:

When porting afl-1.74b the following warning is displayed:

WARNING: Fuzzing on MacOS X is slow because of the unusually high overhead of
fork() on this OS. Consider using Linux or *BSD. You can also use VirtualBox
(virtualbox.org) to put AFL inside a Linux or *BSD VM.

Should this advice be followed? Or is this just a symptom of the CrashReporter issue mentioned in the article.

mikeashat2015-05-06 16:58:37:

Apparently fork is considerably slower on OS X. I'm not sure why, but that's what they say. This will slow afl-fuzz down. It still works (as evidenced by the above!) but if you can run your code on another OS then it would presumably perform better. If you can't (for example, because you use lots of Cocoa stuff) then it still works on the Mac.

toasted_flakesat2015-05-13 12:50:00:

Note the fuzzing process is 2 order of magnitudes slower if you don't disable crash reporter.

launchctl unload -w /System/Library/LaunchAgents/com.apple.ReportCrash.plist
sudo launchctl unload -w /System/Library/LaunchDaemons/com.apple.ReportCrash.Root.plist

Comments RSS feed for this page

Add your thoughts, post a comment:

Spam and off-topic posts will be deleted without notice. Culprits may be publicly humiliated at my sole discretion.

Code syntax highlighting thanks toPygments.

Name:
The Answer to the Ultimate Question of Life, the Universe, and Everything?
Comment:
	Formatting:`<i> <b> <blockquote> <code>`.
	NOTE: Due to an increase in spam, URLs are forbidden! Please provide search terms or fragment your URLs so they don't look like URLs.