Bug ID: JDK-8252372 Check if cloning is required to move loads out of loops in PhaseIdealLoop::split_if_with_blocks

JDK-8252372 : Check if cloning is required to move loads out of loops in PhaseIdealLoop::split_if_with_blocks_post()

Type: Enhancement
Component: hotspot
Sub-Component: compiler
Affected Version: 16,17

Priority: P3
Status: Resolved
Resolution: Fixed

Submitted: 2020-08-26
Updated: 2025-02-20
Resolved: 2021-05-26

Versions (Unresolved/Resolved/Fixed)

The Version table provides details related to the release that this issue/RFE will be addressed.

Unresolved : Release in which this issue/RFE will be addressed.
Resolved: Release in which this issue/RFE has been resolved.
Fixed : Release in which this issue/RFE has been fixed. The release containing this fix may be available for download as an Early Access Release or a General Availability Release.

To download the current JDK release, click here.

JDK 17
17 b24Fixed

Related Reports

Duplicate :	JDK-8229015 - Check removal of yank() in PhaseIdealLoop::split_if_with_blocks_post
Relates :	JDK-8269752 - C2: assert(false) failed: Bad graph detected in build_loop_late
Relates :	JDK-8340214 - C2 compilation asserts with "no node with a side effect" in PhaseIdealLoop::try_sink_out_of_loop
Relates :	JDK-8270296 - C2: Another assert(!had_error) failed: bad dominance
Relates :	JDK-8293941 - C2: assert(false) failed: Bad graph detected in build_loop_late
Relates :	JDK-8333393 - PhaseCFG::insert_anti_dependences can fail to raise LCAs and to add necessary anti-dependence edges
Relates :	JDK-8271600 - C2: CheckCastPP which should closely follow Allocate is sunk of a loop
Relates :	JDK-8276064 - CheckCastPP with raw oop input floats below a safepoint
Relates :	JDK-8280600 - C2: assert(!had_error) failed: bad dominance
Relates :	JDK-8273416 - C2: assert(false) failed: bad AD file after JDK-8252372 with UseSSE={0,1}
Relates :	JDK-8335753 - C2: assert(x_ctrl == get_late_ctrl_with_anti_dep(x->as_Load(), early_ctrl, x_ctrl)) failed: anti-dependences were already checked
Relates :	JDK-8336472 - C2: assert(!n->is_Store() && !n->is_LoadStore()) failed: no node with a side effect
Relates :	JDK-8260709 - C2: assert(false) failed: unscheduable graph
Relates :	JDK-8271954 - C2: assert(false) failed: Bad graph detected in build_loop_late
Relates :	JDK-8229483 - Sinking load out of loop may trigger: assert(found_sfpt) failed: no node in loop that's not input to safepoint
Relates :	JDK-8249607 - C2: assert(!had_error) failed: bad dominance
Relates :	JDK-8260420 - C2 compilation fails with assert(found_sfpt) failed: no node in loop that's not input to safepoint
Relates :	JDK-8272562 - C2: assert(false) failed: Bad graph detected in build_loop_late
Relates :	JDK-8274074 - SIGFPE with C2 compiled code with -XX:+StressGCM
Relates :	JDK-8276846 - JDK-8273416 is incomplete for UseSSE=1
Relates :	JDK-8308103 - Massive (up to ~30x) increase in C2 compilation time since JDK 17
Relates :	JDK-8315377 - C2: assert(u->find_out_with(Op_AddP) == nullptr) failed: more than 2 chained AddP nodes?
Relates :	JDK-8338100 - C2: assert(!n_loop->is_member(get_loop(lca))) failed: control must not be back in the loop
Relates :	JDK-8267988 - C2: assert(!addp->is_AddP() \|\| addp->in(AddPNode::Base)->is_top() \|\| addp->in(AddPNode::Base) == n->in(AddPNode::Base)) failed: Base pointers must match (addp 1301)
Relates :	JDK-8269088 - C2 fails with assert(!n->is_Store() && !n->is_LoadStore()) failed: no node with a side effect
Relates :	JDK-8280696 - C2 compilation hits assert(is_dominator(c, n_ctrl)) failed
Relates :	JDK-8286625 - C2 fails with assert(!n->is_Store() && !n->is_LoadStore()) failed: no node with a side effect
Relates :	JDK-8290850 - C2: create_new_if_for_predicate() does not clone pinned phi input nodes resulting in a broken graph
Relates :	JDK-8335709 - C2: assert(!loop->is_member(get_loop(useblock))) failed: must be outside loop
Relates :	JDK-8273115 - CountedLoopEndNode::stride_con crash in debug build with -XX:+TraceLoopOpts
Relates :	JDK-8268017 - C2: assert(phi_type->isa_int() \|\| phi_type->isa_ptr() \|\| phi_type->isa_long()) failed: bad phi type
Relates :	JDK-8269575 - C2: assert(false) failed: graph should be schedulable after JDK-8252372
Relates :	JDK-8270307 - C2: assert(false) failed: bad AD file after JDK-8267687
Relates :	JDK-8277529 - SIGSEGV in C2 CompilerThread Node::rematerialize() compiling Packet::readUnsignedTrint
Relates :	JDK-8313262 - C2: Sinking node may cause required cast to be dropped
Relates :	JDK-8269797 - C2: assert(!in->is_CFG()) failed: CFG Node with no controlling input after JDK-8252372

Description

While working on JDK-8249607 it was not clear if the code starting at L1399:

 // See if a shared loop-varying computation has no loop-varying uses.
  // Happens if something is only used for JVM state in uncommon trap exits,
  // like various versions of induction variable+offset.  Clone the
  // computation per usage to allow it to sink out of the loop.
  if (has_ctrl(n) && !n->in(0)) {// n not dead and has no control edge (can float about)

is really doing what it is supposed to do. There are several things to be checked and might to be reworked:
- Revisit the entire cloning optimization:
  - Is it really required or would other loop opts move loads out of the loop anyways if there are no uses inside?
  - What if there are some uses inside and outside, is it really beneficial to do the cloning and possibly ending up with multiple loads?
If optimization is required:
- We should rethink the yanking part as it does not prevent the nodes to be put back on the worklist and might block some optimizations. Is there another way to prevent nodes without control to float back into the loop? Maybe adding a cast node with an explicit control?
- Make sure to not pin loads inside the loop if x_ctrl turns out to be a node inside the loop at the end (if that is not already guaranteed)
- Make sure not to pin loads in an outer strip mined loop if they do not have a use there (i.e. think about late_load_ctrl, maybe have a look at idea in 
http://cr.openjdk.java.net/~chagedorn/8249607/webrev.01/ which tackles that problem)

Comments

Changeset: 9d305b9c Author: Roland Westrelin <roland@openjdk.org> Date: 2021-05-26 09:20:42 +0000 URL: https://git.openjdk.java.net/jdk/commit/9d305b9c0625d73c752724569dbb7f6c8e80931c
26-05-2021
JDK-8260709 revealed yet another problem with this code. Targeting this RFE to JDK 17 for now to keep it on the radar.
03-02-2021
For JDK-8260420, I've experimented with updating late_load_ctrl: @@ -1467,7 +1468,9 @@ void PhaseIdealLoop::split_if_with_blocks_post(Node *n) { // // Because we are setting the actual control input, factor in // the result from get_late_ctrl() so we respect any - // anti-dependences. (6233005). + // anti-dependences. (6233005). Re-compute late ctrl now that + // the load has been cloned and is less restricted by its users. + late_load_ctrl = get_late_ctrl(x, n_ctrl); x_ctrl = dom_lca(late_load_ctrl, x_ctrl); See option 3) described in the PR: https://git.openjdk.java.net/jdk/pull/2315 We would also need to adjust the initial check on late_load_ctrl: // If n is a load, and the late control is the same as the current // control, then the cloning of n is a pointless exercise, because // GVN will ensure that we end up where we started. if (!n->is_Load() \|\| late_load_ctrl != n_ctrl) {
29-01-2021
Roland wrote in his RFR for JDK-8229483: "Unrelated to this fix, I wonder if the code at: http://hg.openjdk.java.net/jdk/jdk/file/cb836bd08d58/src/hotspot/share/opto/loopopts.cpp#l1346 really does what the comment says it does for loads (and really does anything useful actually). Late control for the load is computed with: late_load_ctrl = get_late_ctrl(n, n_ctrl); to make sure anti dependences are taken into account. But if a load is in a loop, it's because it has uses outside [Tobias: he probably meant "inside"] the loop. So late control for the load is in the loop too. When sinking the load, the restriction is that clones should not float below late control, then clones are going to stay in the loop. And that code doesn't do anything for loads. Or am I missing something? https://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/2019-August/034846.html
28-01-2021