VJtop is a JVM monitoring tool to provide a dynamic real-time view of the busiest ten threads, which plays the similar role of the "top" command for viewing the host operation system.
VJtop allows to display process summary information of current CPU/Memory intensive threads within JVM. Using the information collected from /proc 、PerfData and JMX, VJtop is originally forked from the jvmtop project but added many new features by exploiting nice properties of the SJK project. The usage of VJtop offers a highly smooth user experience.
VJtop is built as NON stop-the-world and is considered ready for production diagnostics.
Download vjtop-1.0.4.zip(from Maven Central)
Run the following command under the same user who started the target process.
// showing threads consuming the most cpu
./vjtop.sh <PID>
Process data are retrieved
- from /proc/PID/*
- from /tmp/hsperfxxxx, where stats are written by JDK every other second
- from JMX of the targeted VM. (If JMX isn't started at the time VJtop will try to attach to the process to start JMX).
[Note] If the same items appear in both PerfData and JMX, the one from PerfData is perferred. Item in JMX is used instead when PerfData is unavailable.
With ThreadMxBean:
- getAllThreadIds() is called to collect Thread Ids
- getThreadCpuTime(tids) is called to get all thread cpu time as well as sys cpu time and memory allocation.
- getThreadInfo(tids) is called, top 10 threads are shown. StackTrace is not fetched thus the application will not halt.
// ranks threads by their cpu time, by default, the top 10 are shown and refreshed in every 10 secs
./vjtop.sh <PID>
// ranks threads by total cpu time since startup, differentiated from cpu time within the interval
./vjtop.sh --totalcpu <PID>
// ranks threads by sys cpu
./vjtop.sh --syscpu <PID>
// ranks threads by total sys cpu
./vjtop.sh --totalsyscpu <PID>
VJTop 1.0.0 - 11:38:02, UPTIME: 3d01h
PID: 127197, JVM: 1.7.0_79, USER: even.liang
PROCESS: 0.99% cpu ( 0.04% of 24 core), 2491m rss, 0m swap
IO: 24k rchar, 1k wchar, 0 read_bytes, 0 write_bytes
THREAD: 97 active, 89 daemon, 99 peak, 461 created, CLASS: 12243 loaded, 0 unloaded
HEAP: 160m/819m eden, 0m/102m sur, 43m/1024m old
NON-HEAP: 55m/256m cms perm gen, 8m/96m codeCache
OFF-HEAP: 0m/0m direct, 0m/0m map
GC: 0/0ms ygc, 0/0ms fgc, SAFE-POINT: 6 count, 1ms time, 1ms syncTime
THREADS-CPU: 1.01% (user= 0.31%, sys= 0.70%)
TID NAME STATE CPU SYSCPU TOTAL TOLSYS
43 metrics-mercury-metric-logger-1-thread-1 TIMED_WAIT 0.38% 0.28% 25.48% 9.13%
110 metrics-mercury-metric-logger-2-thread-1 TIMED_WAIT 0.38% 0.18% 25.43% 9.10%
496 RMI TCP Connection(365)-192.168.200.87 RUNNABLE 0.05% 0.05% 0.00% 0.00%
82 Proxy-Worker-5-10 RUNNABLE 0.01% 0.01% 0.93% 0.30%
120 threadDeathWatcher-6-1 TIMED_WAIT 0.00% 0.00% 0.26% 0.09%
98 Proxy-Worker-5-16 RUNNABLE 0.00% 0.00% 0.80% 0.26%
99 Proxy-Worker-5-17 RUNNABLE 0.00% 0.00% 0.92% 0.31%
63 Proxy-Worker-5-2 RUNNABLE 0.00% 0.00% 1.07% 0.37%
70 Proxy-Worker-5-5 RUNNABLE 0.00% 0.00% 0.78% 0.26%
102 Proxy-Worker-5-20 RUNNABLE 0.00% 0.00% 0.80% 0.27%
Note: Only top 10 threads (according cpu load) are shown!
Cost time: 46ms, CPU time: 60ms
Process Region Explained:
rss
:Resident Set Size
, size of all the pages, fetched from /proc/<pid>/status, for definition see proc filesystemswap
: Size of pages that are swapped out, fetched from /proc/<pid>/status, for definition see proc filesystemrchar/wchar
: Number of bytes read/written with system calls, fetched from /proc/<pid>/io, for definition see proc filesystemread_bytes/write_bytes
: Bytes read from/written to the actual storage layer, fetched from/proc/<pid>/io, for definition see proc filesystemcodeCache
: Cache size holding binaries as result of JIT compilation. JIT Compilation will cease when code cache is fully occupied.direct
: Off-heap memory usage. Note that off-heap usage will not be recorded for recent Netty versions, which bypass the JDK API for memory allocation.SAFE-POINT
: JVM real stop counts and stop time, collected only when PerfData is available.
Thread Region Explained:
CPU
: cpu time by percentage within the output interval (100% per core).SYSCPU
: sys cpu time by percentage within the output interval (100% per core).TOTAL
: thread/process total cpu time by percentage since startup.TOLSYS
: total sys cpu time by percentage since startup.
Bottom Region Explained :
Cost time
: time cost for data gathering & outputting for this watch.CPU time
: cpu time cost for data gathering & outputting for this watch.
// ranks threads by memory allocation rates, by default, the top 10 are shown and refreshed in every 10 secs
./vjtop.sh --memory <PID>
// ranks Threads by total memory allcoation rates since startup (instead of by output interval)
./vjtop.sh --totalmemory <PID>
(headers omitted)
THREADS-MEMORY: 30k/s allocation rate
TID NAME STATE MEMORY TOTAL-ALLOCATED
47636 RMI TCP Connection(583)-127.0.0.1 RUNNABLE 27k/s(88.76%) 17m( 0.00%)
1 main RUNNABLE 2k/s( 8.44%) 370g(83.16%)
47845 JMX server connection timeout 47845 TIMED_WAIT 251/s( 0.80%) 21k( 0.00%)
46607 Worker-501 TIMED_WAIT 60/s( 0.19%) 934m( 0.20%)
46609 Worker-502 TIMED_WAIT 60/s( 0.19%) 822m( 0.18%)
46610 Worker-503 TIMED_WAIT 60/s( 0.19%) 737m( 0.16%)
46763 Worker-504 TIMED_WAIT 60/s( 0.19%) 696m( 0.15%)
46764 Worker-505 TIMED_WAIT 60/s( 0.19%) 743m( 0.16%)
47149 Worker-506 TIMED_WAIT 60/s( 0.19%) 288m( 0.06%)
46551 Worker-500 TIMED_WAIT 60/s( 0.19%) 757m( 0.17%)
Process Region Explained:
allocation rate
: Summed memory allocation speed of all threads.
Bottom Region Explained:
STATE
: Current thread stateMEMORY
: Instant memory allocation rate by the second (Instant memory allocation by this thread per second, divided by total allocation)TOTAL-ALLOCATED
: Accumulated memory allocations since startup, including those recycled (Accumulated memory allocation by this thread, divided by total memory allocation)
// prints other options
./vjtop.sh -h
// outputs to file
./vjtop.sh <PID> > /tmp/vjtop.log
// refreshes in every 5 secs (default is 10 secs)
./vjtop.sh -d 5 <PID>
// shows the top 20 threads (default is top 10)
./vjtop.sh -l 20 <PID>
// prints by width of 120 characters(default is 100)
./vjtop.sh -w 120 <PID> > /tmp/vjtop.log
// quits after 20 output interations
./vjtop.sh -n 20 <PID>
- Added Display: thread memory allocation rankings (from SJK).
- Added Display: thread sys cpu time rankings, total cpu time since startup rankings (from SJK).
- Added Display: thread physical memory, swapness, IO stats.
- Added Display: per generation memory and GC information display, CodeCache and off-heap memory display.
- Added Config Arg: printing interval, the number of threads to display.
- Performance boost: massively reduced time cost by means of fetching thread cpu time in batches (inspired by SJK).
- Remove profile page that causes STW from jvmtop.
- Eliminate the steps to fetch all java threads in jvmtop. Remove Overview page, in which results are sometimes underterministic.
- Default output interval set to 10 secs.
- Print cost incurred by the monitoring tool itself.