33-Year-Old Unix Bug Fixed In OpenBSD

Please create an account to participate in the Slashdot moderation system

33-Year-Old Unix Bug Fixed In OpenBSD 162

Posted by kdawson on Tuesday July 08, 2008 @08:10PM from the yet-another-stack-overflow dept.

Ste sends along the cheery little story of Otto Moerbeek, one of the OpenBSD developers, who recently found and fixed a 33-year-old buffer overflow bug in Yacc. "But if the stack is at maximum size, this will overflow if an entry on the stack is larger than the 16 bytes leeway my malloc allows. In the case of of C++ it is 24 bytes, so a SEGV occurred. Funny thing is that I traced this back to Sixth Edition UNIX, released in 1975."

This discussion has been archived. No new comments can be posted.

33-Year-Old Unix Bug Fixed In OpenBSD

Search 162 Comments Log In/Create an Account

Comments Filter:

Yeah, it's probably you. (Score:3, Informative)

by Estanislao Mart�nez ( 203477 ) writes: on Tuesday July 08, 2008 @08:42PM (#24108943) Homepage

I bet you they're not talking about the system stack pointer. Remember, yacc is a parser generator; parsing algorithms always use some sort of stack data structure. So, the "stack pointer" in question is just a plain old pointer, pointing into a stack that yacc's generated code uses.

Parent Share
twitter facebook
Re:Great! (Score:5, Informative)

by Dadoo ( 899435 ) writes: on Tuesday July 08, 2008 @09:16PM (#24109285) Journal

While I'm sure you're trolling, I feel I should point out that, 1) I agree with you, and 2) this has apparently been fixed, on Linux:
http://agnimidhun.blogspot.com/2007/08/vi-editor-causes-brain-damage-ha-ha-ha.html [blogspot.com]

Parent Share
twitter facebook
Re:Other Unixes (Score:5, Informative)

by X0563511 ( 793323 ) writes: on Tuesday July 08, 2008 @09:19PM (#24109313) Homepage Journal

Yes. But OpenBSD fixed it, so they get credit for the fix. It's up to the maintainers of the other unix(ish) versions to implement the fix.

Parent Share
twitter facebook
Re:Yeah, it's probably you. (Score:1, Informative)

by Anonymous Coward writes: on Tuesday July 08, 2008 @09:21PM (#24109351)

Exactly. The code:yym = yylen[yyn]; yyval = yyvsp[1-yym];
This is one of the reasons that I hate C code (but I love it most of the time). If your stack was an object (preferably a STL vector), bugs like this wouldn't arise in a way that they could be exploited (your program would instead terminate with an uncaught exception that would point you exactly where your bug was).

Parent Share
twitter facebook
Re:Yeah, it's probably you. (Score:3, Informative)

by Skrapion ( 955066 ) writes: <skorpionNO@SPAMfirefang.com> on Tuesday July 08, 2008 @10:07PM (#24110061) Homepage

Actually, the [] operator of an STL vector doesn't throw any exceptions, and will happily allow you to reference an index which is out of bounds.
That's not a bad thing, because it's more efficient when you already know that your index is in rage. But if you don't know that, you're better off using the at() function.

Parent Share
twitter facebook
Re:Was it really a bug back then? (Score:3, Informative)

by jd ( 1658 ) writes: <imipak@yahoGINSBERGo.com minus poet> on Tuesday July 08, 2008 @10:09PM (#24110085) Homepage Journal

It would have been a bug, but not necessarily one that would have security implications, though that could be system-dependent. The summary mentions a specific malloc was used to get a segfault. Another malloc library may well not have faulted. That would only matter if it was possible via the buffer overflow to get yacc to do something (such as run your code) with privileges other than those you would ordinarily have had.
Now, looking at it just as a bug, if the yacc script overflowed the buffer, yacc can either stop cleanly or crash untidily. It has the same effect - nothing much happens - unless, for some weird reason, the kernel holds onto the memory. That would be a kernel bug, though, the yacc bug would merely be a catalyst for exposing it.

Parent Share
twitter facebook
Comment removed (Score:1, Informative)

by account_deleted ( 4530225 ) writes: on Tuesday July 08, 2008 @11:12PM (#24110957)

Comment removed based on user account deletion

Parent Share
twitter facebook
Re:Great! (Score:3, Informative)

by drinkypoo ( 153816 ) writes: <drink@hyperlogos.org> on Tuesday July 08, 2008 @11:15PM (#24110987) Homepage Journal

Instead of "ls a*"? Seriously? Hopefully, someone will mod you funny.
Unix has extremely low overhead spawning processes. If you prelink and have a little cache this is plenty fast :P
Seriously though, this is a serious annoyance in the way Unix does business. Shell globbing is very convenient for programmers, but not so convenient for users in an awful lot of situations.

Parent Share
twitter facebook
Re:You do realize.. (Score:5, Informative)

by QuantumG ( 50515 ) * writes: <qg@biodome.org> on Tuesday July 08, 2008 @11:59PM (#24111405) Homepage Journal

yacc is not a compiler,
Excuse me?
Yet Another Compiler Compiler most definitely is a compiler.

Parent Share
twitter facebook
Re:Great! (Score:5, Informative)

by Just Some Guy ( 3352 ) writes: <kirk+slashdot@strauser.com> on Wednesday July 09, 2008 @12:18AM (#24111601) Homepage Journal

if you want ls -l style output, "find -name 'a*' -exec ls -l {} \;"
Yeah, because nothing endears you with the greybeards like racing through the process table as fast as possible. Use something more sane like:
$ find -name 'a*' -print0 | xargs -0 ls -l

which only spawns a new process every few thousand entries or so.

Parent Share
twitter facebook
Re:Great! (Score:5, Informative)

by Jeffrey Baker ( 6191 ) writes: on Wednesday July 09, 2008 @12:19AM (#24111613)

It's both. The kernel is responsible for setting up the execution environment, and in the past it used a fixed 32 pages for the arguments. 32 pages on an ordinary PC is 128KiB, which is the old limit. The new limit is that any one argument can be up to 32 pages, and all the arguments taken together can be 0x7FFFFFFF bytes, which is ~2GiB.
Here's the diff: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commitdiff;h=b6a2fea39318e43fee84fa7b0b90d68bed92d2ba;hp=bdf4c48af20a3b0f01671799ace345e3d49576da [kernel.org]
After that, it was up to libc people to fix the globbing routines. Ulrich Drepper, taking some time off from his full-time job of being an asshole on mailing lists, managed to work this into glibc 2.8:
http://sourceware.org/ml/libc-alpha/2008-04/msg00050.html [sourceware.org]

Parent Share
twitter facebook
Re:You do realize.. (Score:4, Informative)

by wb8wsf ( 106309 ) writes: <steve@wb8wsf.org> on Wednesday July 09, 2008 @12:45AM (#24111815)

OpenBSD still uses GCC, version 3.3.5 on i386. I can't say which version is used on the other platforms.
You are talking of PCC, which is being worked on by some of the OpenBSD developers, but I think its a parallel project, see http://pcc.ludd.ltu.se/
for more information.
Jem Matzen talked of this too, see http://www.thejemreport.com/mambo/content/view/369/

Parent Share
twitter facebook
Re:Yeah, it's probably you. (Score:3, Informative)

by setagllib ( 753300 ) writes: on Wednesday July 09, 2008 @12:59AM (#24111949)

Best of all, even if you use assert() (or similar) for really explicit bounds checking, GCC will omit it from code paths where it's deemed to be unused. So if your accesses are being inlined (and if they're not, take a long hard look at your life) then the already-safe paths won't have the check overhead even in a debug build.
Yes, I've tested it. Yes, it's impressive.

Parent Share
twitter facebook
Re:Great! (Score:3, Informative)

by QuoteMstr ( 55051 ) writes: <dan.colascione@gmail.com> on Wednesday July 09, 2008 @03:09AM (#24113097)

On modern systems, find -name 'a*' -exec ls -l {} +
Personally, however, I prefer find -name a\* -exec ls -l {} +
Also, you probably want to add a -type f before the -exec, unless you also want to list directories.
Either that, or make the command ls -ld to not list the contents of directories.

Parent Share
twitter facebook
Re:Great! (Score:3, Informative)

by evilviper ( 135110 ) writes: on Wednesday July 09, 2008 @03:27AM (#24113221) Journal

I only want to delete files I'm sure are in the archive. How would I do that?
tar tf archive.tar | while read FILENAME ; do rm "$FILENAME" done

Parent Share
twitter facebook
Re:Yeah, it's probably you. (Score:5, Informative)

by tomhudson ( 43916 ) writes: <barbara,hudson&barbara-hudson,com> on Wednesday July 09, 2008 @07:46AM (#24114675) Journal

From the link you cited:
By 1971, our miniature computer center was beginning to have users. We all wanted to create interesting software more easily. Using assembler was dreary enough that B, despite its performance problems, had been supplemented by a small library of useful service routines and was being used for more and more new programs. Among the more notable results of this period was Steve Johnson's first version of the yacc parser-generator [Johnson 79a].

The code for yacc was certainly not originally written in c - c didn't exist at that time.
In 1978 Brian Kernighan and I published The C Programming Language [Kernighan 78]. Although it did not describe some additions that soon became common, this book served as the language reference until a formal standard was adopted more than ten years later.

The "archaic behaviour" was never part of that standard - it was a mistake in early implementations while they were still "working out the details" of the language, well before K & R, as Ritchie says:
After the TMG version of B was working, Thompson rewrote B in itself (a bootstrapping step). During development, he continually struggled against memory limitations: each language addition inflated the compiler so it could barely fit, but each rewrite taking advantage of the feature reduced its size. For example, B introduced generalized assignment operators, using x=+y to add y to x. The notation came from Algol 68 [Wijngaarden 75] via McIlroy, who had incorporated it into his version of TMG. (In B and early C, the operator was spelled =+ instead of += ; this mistake, repaired in 1976, was induced by a seductively easy way of handling the first form in B's lexical analyzer.)

It wasn't an archaism in c - it was an archaism from b that was removed during the development of what became c. Small difference, and for all practical purposes, it gives the same result - previously-working code that wasn't reviewed as the language evolved towards a standard ended up with "implementation-dependent behaviour" - bugs ... The worst part is that the buggy code is syntactically correct, so no compiler warnings. Of course, if your conforming compiler doesn't give a warning, you assume that the code written with the experimental versions is still valid.

Parent Share
twitter facebook
Comment removed (Score:2, Informative)

by account_deleted ( 4530225 ) writes: on Wednesday July 09, 2008 @06:51PM (#24126521)

Comment removed based on user account deletion

Parent Share
twitter facebook

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

33-Year-Old Unix Bug Fixed In OpenBSD 162

33-Year-Old Unix Bug Fixed In OpenBSD More Login

33-Year-Old Unix Bug Fixed In OpenBSD

Yeah, it's probably you. (Score:3, Informative)

Re:Great! (Score:5, Informative)

Re:Other Unixes (Score:5, Informative)

Re:Yeah, it's probably you. (Score:1, Informative)

Re:Yeah, it's probably you. (Score:3, Informative)

Re:Was it really a bug back then? (Score:3, Informative)

Comment removed (Score:1, Informative)

Re:Great! (Score:3, Informative)

Re:You do realize.. (Score:5, Informative)

Re:Great! (Score:5, Informative)

Re:Great! (Score:5, Informative)

Re:You do realize.. (Score:4, Informative)

Re:Yeah, it's probably you. (Score:3, Informative)

Re:Great! (Score:3, Informative)

Re:Great! (Score:3, Informative)

Re:Yeah, it's probably you. (Score:5, Informative)

Comment removed (Score:2, Informative)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot