Dailydave mailing list archives

Fuzzing

From: Jared DeMott <demottja () msu edu>
Date: Mon, 15 May 2006 12:25:59 -0400

"We developed fuzz testing, the sending of unstructured random input toan application program." -- B.P. Miller

"A highly automated testing technique that covers numerous boundarycases using invalid data (from files, network protocols, API calls, andother targets) as application input to better ensure the absence ofexploitable vulnerabilities. The name comes from modem applications’tendency to fail due to random input caused by line noise on “fuzzy”telephone lines.

...

A complete fuzzer iteration, starting from generation: The fuzzer beginsby getting semivalid data via one of the two main methods for use intesting: generation or mutation. The fuzzer then submits the data andtracks whether the erroneous input causes the application to crash (inwhich case, it saves the data for later analysis). If not, the fuzzerautomatically proceeds to the next iteration" -- Peter Oehlert

"Fuzzer: A fuzzer is a program that attempts to discover securityvulnerabilities by sending random input to an application. If theprogram contains a vulnerability that can leads to an exception, crashor server error (in the case of web apps), it can be determined that avulnerability has been discovered. Fuzzers are often termed FaultInjectors for this reason, they generate faults and send them to anapplication. Generally fuzzers are good at finding buffer overflow, DoS,SQL Injection, XSS, and Format String bugs. They do a poor job atfinding vulnerabilities related to information disclosure, encryptionflaws and any other vulnerability that does not cause the program tocrash." -- Jack Koziol

"Fuzzing” is an automated software testing technique that generates andsubmits random or sequential data to various areas of an application inan attempt to uncover security vulnerabilities. For example, whensearching for buffer overflows, a tester can simply generate data ofvarious sizes and send it to one of the application entry points toobserve how the application handles it." -- Sacha Faust

"For those who hid themselves successfully so far from the hype, theterm fuzzing is, well, not defined. Everyone talks about it, but itmeans different things to different people. In general, it meansthrowing semi-valid data against a system to automate security testing.Many of today's issues in input parsers can be uncovered by constantlythrowing data at them and watching them crash at some point in time.Supposedly, many people find their 0day bugs that way. The result isthat more and more fuzzing tools appear, talks are held on conferencesand people start to think that they can secure their products if theyjust sit long enough in the line-of-fire of a fuzzer before beingshipped." -- FX

"Fault injection or fuzzing is not completely independent technique.Fault injection is normally combined with automated running analysistools in order to simulate the use of targeted programs software. Theword fuzzing comes from fuzz[3], the first fault injection tooldedicated to uncover buffer overflows. This naive but efficient approachfor finding buffer overflows is simply to supply long arguments orinputs to a program and see what happens. Fuzzers like Spike[2] andPeach[8] are both available for this task. Other tools like PROTOS[20]or Security Bug Catcher[22], much closer to fault injection than fuzzingare more complex. Using a complete description of a protocol and anautomated finite state machine of the program, they are able to detectif sensible states like authentication can be avoided. Thus, if anauthenticated procedure is avoided due to buffer overflow or designerror, this kind of tool can detect it. Unfortunately, these tools musthave a complete description of protocols and states of audited programssoftware which represents a hard and long manual work.

...

Black box testing with fault injection and stress testing i.e. fuzzingis an approach whereby an auditor uses sets of scripts designed to feeda program various inputs, different in size and structure. It is usuallypossible to specify how this input should be constructed and maybe howthe tool should change it according to the program’s behavior.

...

The cardinality of the potential space of inputs defines the complexityof fault injectors: fuzzers basically substitute variables for smaller,bigger and malformed strings or values. By using a random characterstring generator, Fuzz owns an infinite potential space of inputs. ...Substituting variables with random values is *irrelevant*. By using alibrary of finite substituted strings or values drastically reduces thesize of the potential space of inputs." -- Martin Vaugnoux

"Fuzzing in the security context is often used to find buffer overflows,format string bugs, and integer overflows. These sorts of applicationerrors can typically be detected by monitoring application crashes." --Dave Aitel

"Fuzz testing or fuzzing is a software testing technique. The basic ideais to attach the inputs of a program to a source of random data("fuzz"). If the program fails (for example, by crashing, or by failingbuilt-in code assertions), then there are defects to correct." -- Wikipedia


---------------------------------------------------------

Both Miller and Jack mention randomness. But Jack mentions security.Peter also mentions security, but not randomness, he says invalid data.Some use the word fault injection or stress testing. Does the way wemake data invalid HAVE to include randomness? Vaugnoux prefers a list ofattacks rather than random data.

Hmm....so what is the definition of fuzzing? (This is important fordeeper academic study. Is it also important in practice?) Alldefinitions point to a search for bugs -- and all are essentiallyperforming black box testing. The unstructured, ad hoc, random nature ofthis testing (relatively cheap) makes it different....sometimes. Butthan Peter & Mike come along, have access to src, design a morestructured/targeted fuzzer, and make fuzzing part of the Microsoftdevelopment process (not ad hoc and not as cheap), and they still callthis fuzzing. It's clear that pure randomness is only good "one leveldeep". We need to add some structure (protocol knowledge) to advancemultiple layers into any non-trivial protocol. I think this is how B.P.Miller never had to face this issue: he was always fuzzing cmd line argsor mouse/key events. Thus there was never a notion of multiple layers.

Fuzzing is a testing technique (usually software) used to find bugs.That's about the only consensus currently. So my question is to all youfuzzers and software testers: what is the difference between fuzzing andsoftware testing? (If someone knows of a good software testing list,please forward this on.) In theory, they seem very similar. In practice,the second party, security focus of fuzzing has made it effective infinding exploitable bugs. But more academically, what is the differencebetween the two (or the definition of each).

The flip side of fuzzing (and testing) is determining when a fault hasoccurred. This seems to receive less attention than how input data ismalformed. Here's an important question when defining fuzzing: What arethe different (current and future) methods/trends for detecting when afailure/fault/bug has been found?

------------------------------------------------------------

"As in previous studies, our measure of reliability is simple and crude:the absence of a crash or hang. The command line tests run from scriptsand check for the presence of a core file (crash) or non-responsivenessbased on a time-out (hang). Of course, if the program completes withouta crash or hang, but prints nonsensical results, we do not classify thatas a failure. *Other types of testing* are better equipped for suchfailures.For GUI applications, the technique is similar. We run the applicationsunder fuzz-aqua, checking for a system-generated crash log or timing-outif the program hangs." - B.P. Miller

"...the Windows operating system uses exception handling to signalfailure cases to an application and to other parts of the OS. A debuggercan see these exceptions, so building one in to a fuzzer allows it todetermine when an application crashes. Other ways to check applicationcorrectness during fuzzing includelooking for spikes in the program’s memory usage or CPU utilization ...From these simple failure models, we can envision more complex andcomplete failure and success models that actually examine the system toensure that it is working correctly after parsing malformed data. Irecommend using an extensible pattern in the fuzzer to implement checksfor the success or failure of applications parsing the malformed data.This is true particularly because what constitutes success and failurewill change over the fuzzer’s lifetime." -- Peter Oehlert


Jared

Current thread:

Fuzzing Jared DeMott (May 15)
- Re: Fuzzing Matthew Franz (May 15)
- Re: Fuzzing Dustin D. Trammell (May 16)
  - Re: Fuzzing Jared DeMott (May 16)