Further hardening glibc malloc() against single byte overflows

alexfoo · on May 18, 2017

> Now, if the attacker has an off-by-one corruption with a small value (NUL or \x01 - \x07) that hits the lowest significant byte of a length (malloc_chunk->size), the attacker can only use that to cause the length to effectively shrink. This is because all heap chunks are at least 8 bytes under the covers. Shrinking a chunk's length means it will never match the prev_size stored at the end of that chunk. Even if the attacker deploys their one byte overflow multiple times, this new check should always catch them.

Is the LSB of the heap chunk size always >= 8?

What about a malloc_chunk->size with a multiple of 256? (Or anything else with an LSB < 7). With a one byte overflow one of this they could cause it to think that the size is up to 7 bytes more than the size of the real chunk.

scarybeast · on May 18, 2017

Yeah, good question.

The lower bits of ->size are actually masked off when considering a chunk's size, because they are flags:

#define SIZE_BITS (PREV_INUSE | IS_MMAPPED | NON_MAIN_ARENA)

/* Get size, ignoring use bits */ #define chunksize(p) ((p)->size & ~(SIZE_BITS))

So you really can't increase the size by less than 8. However, I know what you're now thinking: an attacker with a 1-byte overflow can mess with the flags! That would be a topic for another blog post, but I'm not aware of any techniques where messing with the flags would permit a clean ASLR bypass.

alexfoo · on May 18, 2017

Ah, good point.

From: https://sploitfun.wordpress.com/2015/02/10/understanding-gli...

    Last 3 bits of this field contains flag information.

        PREV_INUSE (P) – This bit is set when previous chunk is allocated.
        IS_MMAPPED (M) – This bit is set when chunk is mmap’d.
        NON_MAIN_ARENA (N) – This bit is set when this chunk belongs to a thread arena.

It certainly doesn't look like those could be used against ASLR.

damienkatz · on May 18, 2017

That part also confuses me.

loeg · on May 18, 2017

Couldn't you trivially harden against single byte overflows by just changing your malloc implementation to add one to the requested allocation size?

mikeash · on May 18, 2017

No doubt. However, if the single byte off the end is reliably accessible, then programs may come to rely on it by accident. If a program is allocating n but using n+1, then a single-byte overflow would access n+2 and the problem repeats. Better to have that single byte off the end be reliably crashy to touch, but not exploitable.

You'd also incur substantial space overhead for small allocations in many cases. I'm not familiar with Linux's implementation, but on the Mac, for example, all allocations are a multiple of 16 bytes. It's common to allocate 16 or 32 bytes for small objects, so padding the allocation by one byte will bump you up to 32 and 48 bytes respectively.

Eridrus · on May 18, 2017

One of the funniest things I've seen in code I saw in PHP core a decade ago; they had a buffer underflow where they would overwrite arr[-1] with some character. Their solution was to save the contents of arr[-1] before the loop, then restore it afterwards.

mikeash · on May 18, 2017

Sweet Jesus. I could almost understand it if they allocated an extra byte and then used an offset base pointer....

bonzini · on May 18, 2017

You could do that at the compiler level, and only for types that end in a flexible array member, or only when malloc's argument looks like "sizeof (T) + x". That would generally avoid the space overhead, and in the flexible array member case you could e.g. add a whole int (4 byes) for an int-typed flexible array member. But I am not sure it's a good idea, for the other reason you mentioned.

Asooka · on May 18, 2017

malloc already hands you more than you ask for in a lot of cases, check out malloc_usable_size.

scarybeast · on May 18, 2017

This is true. But in the case where the malloc heap metadata is under attack, the attacker will usually just allocate exactly the right size to ensure that the off-by-one goes off the end of the chunk, instead of into slack space.

pjmlp · on May 18, 2017

Yet another patch on the Swiss cheese of memory corruption, with very little impact on future CVE database entries.

pussypusspuss · on May 18, 2017

Now this is what I want to see more of on HN.

ythn · on May 18, 2017

You sure you don't want more politics mingled in?

faragon · on May 18, 2017

Just write programs without overflows, and malloc() will not be a problem.

giosch · on May 18, 2017

It's a miracle! We have the solution to every bug in every program that will ever be written! Just do not put bugs in your code, you fools! ...

faragon · on May 18, 2017

It is not about being "smart", it is about reducing the risk. E.g. you can avoid using malloc() directly, using abstractions, even in C, e.g. [1].

[1] https://github.com/faragon/libsrt

DiabloD3 · on May 18, 2017

Technically, that's why Rust was invented.

simion314 · on May 18, 2017

Bu Rust also must have an allocator under the hood that is unsafe and rust apps can call C libraries or C kernel so why do I see the Rust strike team complaining that something that they use indirectly is improved.

pjmlp · on May 18, 2017

There is a big difference in using a programing language where unsafe code is explicit and easy to track down, versus one where each line of code is a possible security exploit.

Also Rust isn't the only option to write more secure code, it was already possible before C was even created using Algol and PL/I variants.

Quote from Tony Hoare's ACM award article in 1981, regarding Algol use in the industry, a programming language almost 10 years older than C.

"A consequence of this principle is that every occurrence of every subscript of every subscripted variable was on every occasion checked at run time against both the upper and the lower declared bounds of the array. Many years later we asked our customers whether they wished us to provide an option to switch off these checks in the interests of efficiency on production runs. Unanimously, they urged us not to--they already knew how frequently subscript errors occur on production runs where failure to detect them could be disastrous. I note with fear and horror that even in 1980 language designers and users have not learned this lesson. In any respectable branch of engineering, failure to observe such elementary precautions would have long been against the law."

EDIT: younger => older

simion314 · on May 18, 2017

Yes, there are many languages that are safer, including c++ collection can be used safely but you don't see Java/c# devs popping up in a C/C++ related thread mentioning again their favorite language. Btw there are also languages that are safer then Rust and you do not see those people asking to not use Rust, again better tool for the job(where in most of the cases the project is a huge one and is done).

pjmlp · on May 18, 2017

How young are you?

I imagine you missed the BBS and USENET flamewars against C.

simion314 · on May 18, 2017

I have internet access for 10 years.

pjmlp · on May 18, 2017

Which means you missed all that BBS and USENET bashing fun.

No, bashing C is a common practice from those of us on the memory safe side of the fence since the early days.

Take the paper "A History of CLU"[0] describing how CLU was designed and implemented in 1975.

"I believe this is a better approach than providing a generally unsafe language like C, or a language with unsafe features, like Mesa [Mitchell, 1978], since it discourages programmers from using the unsafe features casually."

There are tons of other examples, all available in old papers, BBS and USENET archives.

[0] http://publications.csail.mit.edu/lcs/pubs/pdf/MIT-LCS-TR-56...

simion314 · on May 18, 2017

Thanks, I will read it, so are you of the opinion that there is no job that C is the best tool? Btw I am not a C developer and I would never use C except if I am asked to work on a project that uses C already. I would use C++ with Qt for GUI though.

pjmlp · on May 18, 2017

Exactly, C only became widely adopted by the industry thanks to AT&T only being allowed to charge a symbolic price for UNIX and making the source code available to universities.

Which 80's startups like Sun and SGI used as basis for their workstation OSes.

Bjarne created C++, because after having to use BCPL instead of Simula to finish his PhD, he never wanted to work like that ever again.

So C with Classes started as a tool for Bjarne to target C, while staying productive and able to write safer code.

kdbg · on May 18, 2017

Rust uses a different allocator actually, jemalloc which doesnt store data inline like ptmalloc does. So an overflow could overwrite other heap stored data it wouldn't overwrite heap metadata or result in a vulnerability from the allocator code.

Granted, if you link/call in code that uses ptmalloc (glibc's malloc) in Rust it is still an issue but unsafe code in Rust itself won't be vulnerable to this sort of attack.

jlg23 · on May 18, 2017

Rust uses jemalloc.

rurban · on May 18, 2017

No. Technically that's why "safe" languages were invented. Rust is one of the worse examples of those, as you can hardly call Rust safe. Only Rust fanboys do so.

Pascal, ADA, LISP, ATS, Java, Go, D, pony and all of the lisp and functional languages are safe.

steveklabnik · on May 18, 2017

> as you can hardly call Rust safe

Care to expand on that? I'm curious.

rurban · on May 20, 2017

unsafe memory and unsafe concurrency. i will not expand further, because it will be downvoted by the rust fanboys.

kibwen · on May 20, 2017

Even aside from preemptively complaining about downvotes, refusing to substantiate your claims is the quickest way to lose karma. :P

steveklabnik · on May 20, 2017

Okay. If you find something isn't safe, in safe code, please file a bug. What you're asserting shouldn't be true.

rurban · on May 22, 2017

Do you see the problem now? You have a whole chapter about unsafe, with 4 major cases. Stdlib is full of unsafe. And you don't even talk about unsafe threaded code. One of the biggest safety problems nowadays. Memory safety is solved since decades with GC. Concurrency safety also for a few years.

Noughmad · on May 18, 2017

No, but we do have a solution to one kind of bugs, which is buffer overflows. Just because it doesn't solve every bug doesn't mean it doesn't help.