Actions

icon Post
text/html Subscribe
text/html Unsubscribe

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [cxx-abi-dev] gcc unwind ABI change for forced unwind


  • To: "Dennis Handly" <dhandly@xxxxxxxxxx>
  • Subject: Re: [cxx-abi-dev] gcc unwind ABI change for forced unwind
  • From: Dave Butenhof <David.Butenhof@xxxxxx>
  • Date: Thu, 22 May 2003 10:25:38 -0400

Greetings.

I've been forwarded part of this discussion and asked to comment. For those who don't know me, the relevant context is probably that I know nothing about gcc internals, a little about its external manifestation, a moderate amount about C++ syntax and semantics without being by any means a C++ expert, and I know rather a lot about threads and POSIX. I'm the principal architect for the POSIX threads library on Tru64 UNIX and OpenVMS, where cancel and thread exit are exceptions implemented using the system libexc library -- also used by C++, Ada, and others for their exceptions. That said, I have both opinions and a fair amount of experience behind (and often in front of) those opinions. ;-)

>From: Cary Coutant <cary@xxxxxxxxxx>
Content-Transfer-Encoding: 7bit
Message-Id: <A34F2B63-8BD7-11D7-8E8F-003065589C02@xxxxxxxxxx>
X-Mailer: Apple Mail (2.552)
Subject: [cxx-abi-dev] gcc unwind ABI change for forced unwind
Status: RO

Many of you are probably aware of (and several of you participated in)
a discussion thread on the gcc-patches mailing list about a new unwind
API that Richard Henderson had to add to support forced unwinds
resulting from (among possibly other things) thread cancellation. I
thought it would be appropriate to bring this issue to this mailing list.

Courtesy of Jim Wilson (who posted a note to the libunwind mailing
list, which brought it to my attention), here are some pointers to the
discussion threads leading up to this.

http://gcc.gnu.org/ml/gcc-patches/2003-04/msg00008.html
http://gcc.gnu.org/ml/gcc-patches/2003-04/msg02246.html
http://gcc.gnu.org/ml/gcc-patches/2003-05/msg00473.html
http://gcc.gnu.org/ml/gcc-patches/2003-05/msg00160.html

As I understand the central issue, we would like to run C++ cleanups on
a thread cancellation, in addition to the cleanups registered through
the POSIX C bindings to the pthreads library. Cleanups resulting from
local automatic objects that need destruction are easy, but the problem
is what to do about catch(...) blocks. Richard's approach was to end
such blocks with a call to the new API, "_Unwind_Resume_or_Rethrow()",
if the block did not already end with a rethrow.

I think Jason Merrill hit the nail on the head when he said (on 4/30):

> The problem is that catch(...) is overloaded in C++.  It's used both for
> code that wants to write a cleanup inline and rethrow and for code that
> wants to trap all exceptions.

There was some discussion about whether catch(...) blocks should run at
all when doing a forced unwind, and whether forced unwinds should be
allowed to penetrate a function declared throw(). I think I saw a
consensus on the latter issue that thread cancellation and
longjmp_unwind are not really exceptions, and must be allowed to
proceed. On the former issue, however, there didn't seem to be a clear
resolution.

Personally, I dislike the attempt to separate "cleanup" from "finalization" (handle/catch). I don't like the idea of an exception that can't be finalized, because it reduces the application's ability to control behavior.

Our cancel exception is in every way a normal exception (though there doesn't happen to be a C++ *name* for this exception). By CONVENTION, we declare that it should normally be handled via "finally" clauses rather than "catch" clauses, because it expresses some component's desire to terminate the thread and that should usually be honored. Similarly, a longjmp_unwind type operation shouldn't usually be finalized until it propagates to the target frame.

But there are always, er, "exceptions".

For example, back in the early days of DCE and DCE threads, we see the RPC component servers running what amounts to a remote extension of the client's call stack. The client makes an RPC call, and a remote server application fires up the server side of that call. It does this inside a managed server thread. If the server were to raise an exception, such as cancel, it should propagate and clean up the subset of the call stack that is logically a part of the client's call... but the managed server logic must be able to finalize the exception and marshall it back to the client so it can be made aware of what happened. And there's no need to unwind/cleanup any further, because the managed server thread can live to serve again.

A catch(...) is a catch. A catch(...) that happens to end in a throw isn't fundamentally different. It does represent a sort of ambiguity in the language model, though. Really, "cleanup" in C++ is a destructor, whereas catch() is for finalization. But as most people who initially took up C++ had learned their exception model on another language, such as Ada or Modula-2, the idiom 'catch (...) {throw;}' looked a lot like 'finally{}' and provides a familiar hook for frame-based (rather than object-based) cleanup.

If someone's going to take this idiom more seriously, to the point of defining radically divergent behavior, you'd be far better off adding a true 'finally' keyword that makes it obvious. However, I'd prefer to see emphasis on the "pure C++" model that cleanup is done in local object destructors, while catch() is really for finalization. (That is, there may be reasons for 'catch(...) { if ( ... ) throw; else ... ; }' but 'catch(...) { ... ; throw; }' should be, at least, strongly discouraged.)

Ideally, one would take the position that good C++ code would
encapsulate any cleanups it needs into local automatic objects, so that
the compiler-generated cleanups would invoke the destructor. Real code,
however, doesn't seem to work that way -- we see catch(...) blocks
written with the intent to do cleanups. Given this real code, we should
try to run those cleanups. But what happens when we hit a catch(...) of
the other flavor -- the kind that just want to catch all exceptions?
Ideally, we wouldn't want to run them at all on a forced unwind, since
they're exception handlers, not cleanups. Without Richard's approach,
if we execute such a block on a forced unwind, and that block doesn't
end with a rethrow, the forced unwind doesn't resume (until, in the
case of thread cancellation, the thread next reaches a cancellation
point, and the process gets repeated). With Richard's new routine, a
forced unwind gets the opportunity to rethrow, while a normal exception
gets to resume execution.

There's nothing wrong with finalizing (catching) a cancel, thread exit, or even a longjmp_unwind... if that's what the application intended, and if the designers knew what they were doing. And if not... that's not a language issue, it's an application issue.

I realize that, pragmatically, there are complications when adding something like this after the fact. There will be applications busted when/if cancel becomes a true exception shared with C++, because someone will have catch(...) blocks that simply assume they'll never see an unexpected (new) exception. I call that bad design, but that's life.

There are those who have all along argued that cancel (and thread exit) should run C++ destructors but not be catch()-able. One (weird) option might be to make them catchable by "name", but not anonymously... they'd ignore any catch(...) clauses. I don't like it, but it'd solve the compatibility issue without preventing savvy code from finalizing a cancel or exit where it really makes sense in the context of the application. The same could be done for longjmp_unwind. It might be nice to have a standard way to catch all "forced_unwind" exceptions without needing to name each one.

Essentially, that twists the usage of "forced unwind" around a bit; they CAN be finalized, but only by code that at least explicitly states (correctly or not) that it knows what it's doing.

Throw specs add another interesting wrinkle. If cancel/exit/unwind are "exceptions", then it's illegal to propagate through an empty throw() spec, or any that doesn't identify them. Which means that everything calling a cancellation point needs to propagate the throw(cancel). (And don't forget thread exit! And what about longjmp_unwind?) Many think there's no need to worry about that if they're "forced unwinds" instead of "exceptions", but that depends a lot on the semantic intent of the throw() spec. Should it really be taken to be literally only "C++ exceptions", or should it be taken as a limitation on the reasons for abnormally unwinding that frame? That is, when I call a routine with a throw() spec, should I not be expecting that control will return to me only when the called function returns normally or when one of the listed throw conditions occurs? The "forced unwind" isn't transparent -- it'll run destructors, maybe at least some catch(...) clauses. There's not much use in throw() specifications if they can be so easily violated. (E.g., as someone pointed out, empty throw() clauses might provide an optimization opportunity to avoid generating unwind information... if unwinds can occur anyway you can't do that.)

The subject here is extending C++ to know about and somehow rationally deal with foreign exceptions (and/or unwind if you really want to make that distinction). The language semantics, or at least usage conventions, (and perhaps syntax), NOT just the runtime, needs to be changed to do that in a way that's consistent and useful.

--
/--------------------[ David.Butenhof@xxxxxx ]--------------------\
| Hewlett-Packard Company       Tru64 UNIX & VMS Thread Architect |
|     My book: http://www.awl.com/cseng/titles/0-201-63392-2/     |
\----[ http://homepage.mac.com/dbutenhof/Threads/Threads.html ]---/