While I was running ATG app, I saw it hang after 1 days and 13 hours or so. I also found that the server process was hogging the system at the time of the hang. It is happening to Solaris 9 only, and I couldn't reproduce it using 2 other machines running Solaris 10 and 11. Since I couldn't keep the live process, I have restarted the test and hope I could reproduce it.
After Ramki looking at the pstack file from the gcore, it showed the vm thread was trying to safepoint the JVM while several threads appeared to be stuck looping in the monitor inflation code. We have given the heads-up to Dave, and he will look into the case.
Please see the pstack file attached.
Test related information:
hostname: jtg-blade2.sfbay.sun.com (please contact the submitter for the login)
Sun Blade 1000, 2 ways @ 900 MHZ with 2 G of RAM.
bash-2.05# cat /etc/*release*
Solaris 9 9/04 s9s_u7wos_09 SPARC
Copyright 2004 Sun Microsystems, Inc. All Rights Reserved.
Use is subject to license terms.
Assembled 29 June 2004
Log and core file location:
Please look at core.24669, which is the troubled server process that eventually hog the whole system.
core.24669 <- The server process core file
core.9232 <- The client process core file