Bug ID: JDK-4388912 single-step over 'new' doesn't work right

JDK-4388912 : single-step over 'new' doesn't work right

Type: Bug
Component: vm-legacy
Sub-Component: jvmdi
Affected Version: 1.4.0

Priority: P3
Status: Closed
Resolution: Fixed
OS: solaris_2.6
CPU: sparc

Submitted: 2000-11-14
Updated: 2002-08-30
Resolved: 2002-08-30

Versions (Unresolved/Resolved/Fixed)

The Version table provides details related to the release that this issue/RFE will be addressed.

Unresolved : Release in which this issue/RFE will be addressed.
Resolved: Release in which this issue/RFE has been resolved.
Fixed : Release in which this issue/RFE has been fixed. The release containing this fix may be available for download as an Early Access Release or a General Availability Release.

To download the current JDK release, click here.

Other	Other
1.3.1 rc1Fixed	1.4.0Fixed

Related Reports

Relates :	JDK-4419723 - Line number information for try-catch statement is not sufficient
Relates :	JDK-4421456 - Reg-test StepTest.java Failing

Description

I'm seeing a case where ClassLoader.checkPackageAccess is invoked
multiple times for a single new (it is "called out" of the VM) and 
this messes up single-stepping.

cd ~ivan/for/checkPackageAccess (or get the stuff from the bug attachments)
First check if you can issue:
	java -classpath ".:unix.jar:term.jar" TermApp
(to make sure that LD_LIBRARY_PATH (for JNI) and other stuff all works).

Then go through the scenario described in "in" using jdb.
It demonstrates how for a particular 'new' ClassLoader.checkPackageAccess is
called many many times. It isn't stuck in an infinite loop;
eventually progress is made. But eventually jdb ends up
getting confused as described in "in".

Now one expects that when you single step over an opcode that if it's
an invoke one ends up at the invoked function and otherwise at the next opcode,
_not_ in the system internals.

Now this sort of situation happens in native applications as well, for
example when dispatching through a PLT, thunks, or an objective C method
dispatcher, so having a conceptually simple operation take a circuituous
route to get to the destination is not a new thing however ...

- in the native situation all the dispatching is done at the 
  machine architecture level and there are no magical nested invokation
  from within the microcode into assembly inctructions.

- Often to get src-level single stepping to work right the runtime
  has to provide information to the debugger to help it skip over the
  dispatching code. For example, on Solaris, librtld_db.so has features
  that help dbx to skip over PLT's. But this kind of scaffolding isn't
  neccessary for assembly-level single-stepping, while, at least
  at first glance, it looks like some kind of feature will need to be added
  to JVMDI to help with this anomaly.

As it happens if a java opcode invokes only one callout things sort-of
work. Suppose the 'new' issued only one call to ClassLoader.checkPackageAccess.
In which case the debugger would detect a "call" into a "system" class
and set up a FRAME_POP event, which fires right before checkPackageAccess
is going to return. Then the debugger will turn on single stepping and take
one more step and be back after where the new was called. This
scenario is perhaps different because jdbx detects a call by seeing if
the PC offset is 0. I expect other debuggers might end up stepping for
a very long time or something like that. For this to work also requires that
a class can be classified as a system class but that is a very subjective
definition.

This scenario also works in the case of invocations, where I believe a callout
is made for loading a class or something. After doing the FRAME_POP thing,
the "one more step" gets up to the invoked destination.

But, I hope you agree, that this is all serendipity as opposed to design.
And the above luck won't hold if you have multiple callouts per opcode,
since the debugger will now have to decide how many callouts to ignore.

I"m also a bit worried that since there is a trend in the VM to move more semantics out of the native VM code and into java that the frequency
of such callouts will increase.

Right now this makes for very unreliable single-stepping. 
I doubt that there is a succinct way to characterise and describe to
a user under what conditions a callout will take place (which explains why my reproducible testcase is so complex). And even if
there was a way it would be pointless since debugger users step reflexively
and having to stop and think about exceptions when stepping doesn't work
will surely drive them to S y s t e m . o u t . p r i n t l n ( ).

   
Having written all of this I went to check whether the JVMDI spec says anything
about what a "single step" means, and it means "The VM has reached a new
location". According to this, there is no bug. However the fact that the 
spec provides no example or mention of the subtleties involved leads me to
believe that the defenition cannot be overgeneralized, never mind that 
there is a practical issue to be dealt with here.

According to that definition, the VM will reach a new location either by:
	- advancing the PC by one op
	- setting the PC according to a branch
	- setting the PC according to a "call"
	- setting the pc according to a thrown exception
	- setting the PC accoding to a "callout"
	- other surprises I haven't run into?
The first three jive with traditional understanding of single-stepping.
The exception case is treated properly it seems (but then I haven't
gotten pas simple stuff).

In the 5th item, debuggers usually require an event or an inference rule
defining when a "call" has happenned. And they require an algorithm for
capturing the return from the said "call". (Note the additional
complication that frame_pop by itself doesn't define this "return" and 
in turn (recursively) requires reliance on this IMO shaky definition of step)

In the case of callouts, 
a) they are indistinguishable from "call"s.
b) there is no concrete way to capture a return from them.
c) If the callout happens as a side-effect of a "call" then there is
   no return even.

Comments

CONVERTED DATA BugTraq+ Release Management Values COMMIT TO FIX: ladybird-rc1 merlin-beta FIXED IN: ladybird-rc1 merlin-beta INTEGRATED IN: ladybird-rc1 merlin-beta VERIFIED IN: ladybird-rc1 merlin-rc1

14-06-2004

EVALUATION mandy.chung@eng 2001-02-14 I can reproduce the scenario described in the attached in file using Ladybird JDK as well as Merlin JDK. It looks like a RFE. mandy.chung@eng 2001-02-16 (I) I saw there are some METHOD_ENTRY events posted but not handled by any event handlers. This is a bug in disableEvents() in eventHandler.c. disableEvents() is called by eventHandler_freeInternal() and becomeDefunc(). When we want to free a HandlerNode, it first calls disableEvents() to check if we need to disable an event notification. Then we clear the selector to delete JNI references and then remove the node from the Handlers chain. disableEvents() will disable an event notification if it determines that the event is not enabled globally and is not enabled for any other threads. The problem is that the event notification is only disabled when countThreadHandlers() == 0 (see code below). In fact, countThreadHandlers() never returns 0 when called from disableEvents() because this node has not yet been removed from the chain and the value returned by countThreadHandlers() here is always >=1. static jint disableEvents(HandlerNode *node) { jint kind = node->kind; jint error = JVMDI_ERROR_NONE; /* code not shown here */ ....... if (error == JVMDI_ERROR_NONE) { jthread thread = requestThread(node); ====> if (countThreadHandlers(kind, thread) == 0) { error = threadControl_setEventMode(JVMDI_DISABLE, kind, thread); } } return error; } (II) A single stepping event that should be posted but JVMDI treats it as a duplicate in the following scenario: 1) EVENT_SINGLE_STEPPING enabled and a method A is invoked (bci = 0). 2) post a single stepping event 3) disable EVENT_SINGLE_STEPPING enable EVENT_METHOD_ENTRY and also request for NotifyFramePop event for the current frame. 4) running for a while, several methods have been called and several method entry events are posted. 5) post a frame pop event requested in step 3. 6) enable EVENT_SINGLE_STEPPING 7) method A is invoked again.... Since from step 2 to 5, there were no single stepping event and no breakpoint events, the jvmdi_thread_state of the thread has not been updated since step 1. Since single_step event was posted in step 1 for method A bci 0, JVMDI mistakenly treats this single stepping event is a duplicate and skip not to post this event.

11-06-2004

SUGGESTED FIX mandy.chung@eng 2001-02-23 Fix for this bug: 1) Fix JPDA backend to deinsert the handler node before calling disableEvents(), which will disable an event notification when the chain is empty. 2) Fix hotspot JVMDI to reset thread location information when single stepping or breakpoint event is disabled either in per thread basis or globally (instead of when waiting for both single-stepping and breakpoint events are disabled globally). See attached webrev. mandy.chung@eng 2001-03-14 A refinement to the fix for this bug was made. See webrev attached in 4421456.

14-03-2001