-
Notifications
You must be signed in to change notification settings - Fork 6
/
Copy pathindex.html
648 lines (573 loc) · 34.8 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
<!DOCTYPE html>
<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=windows-1252">
<title>ruby-prof</title>
<link rel="stylesheet" type="text/css" href="assets/styles.css">
</head>
<body>
<div class="header"><img src="./assets/ruby-prof-logo-white.svg"></div>
<div class="grid-container">
<nav role="navigation">
<ul>
<li><a href="#overview">Overview</a></li>
<li><a href="#why-ruby-prof">Why ruby-prof?</a></li>
<li><a href="#installation">Installation</a></li>
<li><a href="#getting-started">Getting Started</a></li>
<li><a href="#advanced-usage">Advanced Usage</a></li>
<li><a href="#reports">Reports</a></li>
<li><a href="#performance">Performance</a></li>
<li><a href="#profiling-rails">Profiling Rails</a></li>
<li><a href="#architecture">Architecture</a></li>
<li><a href="#version-1.0">Version 1.0</a></li>
<li><a href="#api-documentation">API Documentation</a></li>
<li><a href="#license"></a><a href="#license">License</a></li>
<li><a href="#development">Development</a></li>
</ul>
</nav>
<div class="main">
<h1>ruby-prof<span> </span></h1>
<p><a href="https://travis-ci.org/ruby-prof/ruby-prof"><img
src="https://travis-ci.org/ruby-prof/ruby-prof.png?branch=master"
alt="Build Status"></a></p>
<h2><a id="overview">Overview</a></h2>
<p>ruby-prof is a profiler for MRI Ruby. Its features include:</p>
<ul>
<li>
<p>Speed - it is a C extension and therefore many times faster than
the standard Ruby profiler.</p>
</li>
<li>
<p>Measurement Modes - ruby-prof can measure program wall time,
process time, object allocations and memory usage</p>
</li>
<li>
<p>Reports - ruby-prof can generate a variety of text and
cross-referenced html reports</p>
<ul>
</ul>
</li>
<li>
<p>Threads - supports profiling multiple threads simultaneously</p>
</li>
</ul>
<h2><a id="why-ruby-prof">Why ruby-prof?</a></h2>
<p>ruby-prof is helpful if your program is slow and you don't know why.
It can help you track down methods that are either slow, allocate a
large number of objects or allocate objects with high memory usage.
Often times the results will be surprising - when profiling what you
think you know almost always turns out to be wrong.</p>
<p>Since ruby-prof is built using ruby's C <a href="https://ruby-doc.org/core-2.6.1/TracePoint.html">tracepoint
</a>api, it knows a lot about your program. However, using ruby-prof
also comes with two caveats:</p>
<ul>
<li>
<p>To use ruby-prof you generally need to include a few lines of
extra code in your program (although see <a href="#command-line"
title="Command Line"> command line usage</a>)</p>
</li>
<li>
<p>Using ruby-prof will cause your program to run slower (see <a href="#performance">
Performance</a> section) </p>
</li>
</ul>
<p>Most of the time, these two caveats are acceptable. But if you need
to determine why a program running in production is slow or hung, a
sampling profiler will be a better choice. Excellent choices include <a
href="https://github.com/tmm1/stackprof">stackprof </a>or <a
href="https://rbspy.github.io/">rbspy</a>.</p>
<p>If you are just interested in memory usage, you may also want to
checkout the <a href="https://github.com/SamSaffron/memory_profiler">memory_profiler</a>
gem (although ruby-prof provides similar information).</p>
<h2><a id="installation">Installation</a></h2>
<p>The easiest way to install ruby-prof is by using Ruby Gems. To
install:</p>
<pre class="ruby"><span class="ruby-identifier">gem</span> <span class="ruby-identifier">install</span> <span
class="ruby-identifier">ruby</span><span class="ruby-operator">-</span><span class="ruby-identifier">prof</span>
</pre>
<p>If you are running Linux or Unix you'll need to have a C compiler
installed so the extension can be built when it is installed. If you
are running Windows, then you should install the Windows specific gem
or install install <a href="https://rubyinstaller.org/add-ons/devkit.html">devkit</a>.</p>
<p>ruby-prof requires Ruby 2.7.0 or higher.</p>
<p><strong>!!!WARNING!!!</strong></p>
<p>Ruby versions 2.7.0 and 2.7.1 have a <a href="https://bugs.ruby-lang.org/issues/17152">bug</a>
that can cause ruby-prof to return incorrect results.</p>
<p><br>
</p>
<h2><a id="getting-started">Getting Started</a></h2>
<p>There are three ways to use ruby-prof:</p>
<ul>
<li>command line</li>
<li>convenience API</li>
<li>core API</li>
</ul>
<!-- Command Line -->
<h3><a id="command-line">Command Line </a></h3>
<p>The easiest way to use ruby-prof is via the command line, which
requires no modifications to your program. The basic usage is:</p>
<pre>ruby-prof [options] <script.rb> [--] [script-options]<span
style="color:#808080;font-weight:bold;font-style:italic;">"</span></pre>
<p> Where script.rb is the program you want to profile.</p>
<p>For a full list of options, see <a href="doc/RubyProf/Cmd.html">RubyProf::Cmd</a>
documentation or execute the following command:</p>
<pre>ruby-prof -h</pre>
<!-- Convenience API -->
<h3>Convenience API<span></span></h3>
<p>The second way to use ruby-prof is via its convenience API. This
requires small modifications to the program you want to profile:</p>
<pre class="ruby"><span class="ruby-identifier">require</span> <span class="ruby-string">'ruby-prof'</span>
<span class="ruby-identifier">profile</span> = <span class="ruby-constant">RubyProf</span><span
class="ruby-operator">::</span><span class="ruby-constant">Profile</span>.<span
class="ruby-identifier">new</span>
<span class="ruby-comment"># profile the code</span>
<span class="ruby-object">profile</span>.<span class="ruby-identifier">start</span>
<span class="ruby-comment"># ... code to profile ...</span>
<span class="ruby-identifier">result</span> = <span class="ruby-object">profile</span>.<span
class="ruby-identifier">stop</span>
<span class="ruby-comment"># print a flat profile to text</span>
<span class="ruby-identifier">printer</span> = <span class="ruby-constant">RubyProf</span><span
class="ruby-operator">::</span><span class="ruby-constant">FlatPrinter</span>.<span
class="ruby-identifier">new</span>(<span class="ruby-object">result</span>)
<span class="ruby-object">printer</span>.<span class="ruby-identifier">print</span>(<span
class="ruby-constant">STDOUT</span>)
</pre>
<p>Alternatively, you can use a block to tell ruby-prof what to profile:</p>
<pre class="ruby"><span class="ruby-identifier">require</span> <span class="ruby-string">'ruby-prof'</span>
<span class="ruby-comment"># profile the code</span>
<span class="ruby-identifier">result</span> = <span class="ruby-constant">RubyProf</span><span
class="ruby-operator">::</span><span class="ruby-constant">Profile</span>.<span
class="ruby-identifier">profile</span> <span class="ruby-keyword">do</span>
<span class="ruby-comment"># ... code to profile ...</span>
<span class="ruby-keyword">end</span>
<span class="ruby-comment"># print a graph profile to text</span>
<span class="ruby-identifier">printer</span> = <span class="ruby-constant">RubyProf</span><span
class="ruby-operator">::</span><span class="ruby-constant">GraphPrinter</span>.<span
class="ruby-identifier">new</span>(<span class="ruby-object">result</span>)
<span class="ruby-object">printer</span>.<span class="ruby-identifier">print</span>(<span
class="ruby-constant">STDOUT</span>, {})
</pre>
ruby-prof also supports pausing and resuming profiling runs.
<pre class="ruby"><span class="ruby-identifier">require</span> <span class="ruby-string">'ruby-prof'</span>
<span class="ruby-identifier">profile</span> = <span class="ruby-constant">RubyProf</span><span
class="ruby-operator">::</span><span class="ruby-constant">Profile</span>.<span
class="ruby-identifier">new</span>
<span class="ruby-comment"># profile the code</span>
<span class="ruby-object">profile</span>.<span class="ruby-identifier">start</span>
<span class="ruby-comment"># ... code to profile ...</span>
<span class="ruby-object">profile</span>.<span class="ruby-identifier">pause</span>
<span class="ruby-comment"># ... other code ...</span>
<span class="ruby-object">profile</span>.<span class="ruby-identifier">resume</span>
<span class="ruby-comment"># ... code to profile ...</span>
<span class="ruby-identifier">result</span> = <span class="ruby-object">profile</span>.<span
class="ruby-identifier">stop</span>
</pre>
<p>Note that resume will only work if start has been called previously.
In addition, resume can also take a block:</p>
<pre class="ruby"><span class="ruby-identifier">require</span> <span class="ruby-string">'ruby-prof'</span>
<span class="ruby-identifier">profile</span> = <span class="ruby-constant">RubyProf</span><span
class="ruby-operator">::</span><span class="ruby-constant">Profile</span>.<span
class="ruby-identifier">new</span>
<span class="ruby-comment"># profile the code</span>
<span class="ruby-object">profile</span>.<span class="ruby-identifier">start</span>
<span class="ruby-comment"># ... code to profile ...</span>
<span class="ruby-object">profile</span>.<span class="ruby-identifier">pause</span>
<span class="ruby-comment"># ... other code ...</span>
<span class="ruby-object">profile</span>.<span.<span class="ruby-identifier">resume</span> <span
class="ruby-keyword">do</span>
<span class="ruby-comment"># ... code to profile...</span>
<span class="ruby-keyword">end</span>
<span class="ruby-identifier">result</span> = <span class="ruby-object">profile</span>.<span
class="ruby-identifier">stop</span>
</pre>
<p>With this usage, resume will automatically call pause at the end of
the block.</p>
<p>The <code>RubyProf.profile</code> method can take various options,
which are described in the <a href="#profiling-options">Profiling
Options</a> below. </p>
<!-- Core API -->
<h3>Core API</h3>
<p>The convenience API is a wrapper around the the <code>Ruby::Profile</code>
class. Using the Profile class directly provides addition
functionality, such as <a href="#method-exclusion">method exclusion</a>.</p>
<p>To create a new profile:</p>
<pre>require 'ruby-prof'
profile = RubyProf::Profile.new(options)
result = profile.profile do
...
end</pre>
Once a profile is completed, you can either generate a <a href="#reports">report
</a>via a printer or <a href="#saving-results">save</a> the results
for later analysis. For a list of profiling options, please see the <a
href="#profiling-options">Profiling Options</a> section.
<h2><a id="advanced-usage">Advanced Usage</a></h2>
<p>This section describes advanced usage of ruby-prof. Additional
documentation for every class is also <a href="#api-documentation">available</a>.</p>
<h3><a id="profiling-options">Profiling Options</a></h3>
<p>ruby-prof understands the following options when profiling code:</p>
<dl>
<dt> measure_mode</dt>
<dd>What ruby-prof should measure. For more information see the <a href="#measurements">Measurement
Mode</a> section.
</dd>
<dt>track_allocations</dt>
<dd> Tracks each object location, including the object class, memory
size and source file location. For more information see the <a href="#allocation-tracking">Allocation
Tracking</a> section.
</dd>
<dt> exclude_threads</dt>
<dd>Array of threads which should not be profiled. For more
information see the <a href="#thread-inclusion-exclusion">Thread
Inclusion/Exclusion</a> section.
</dd>
<dt> include_threads</dt>
<dd>Array of threads which should be profiled. All other threads will
be ignored. For more information see the <a href="#thread-inclusion-exclusion">Thread
Inclusion/Exclusion</a> section.
</dd>
</dl>
<h3><a id="measurements">Measurement Mode<br <="" a="">
</a></h3>
<p>The measurement mode determines what ruby-prof measures when
profiling code. Supported measurements are:</p>
<dl class="rdoc-list note-list">
<dt>RubyProf::WALL_TIME</dt>
<dd>
<p>Wall time measures the real-world time elapsed between any two
moments in seconds. If there are other processes concurrently
running on the system that use significant CPU or disk time during
a profiling run then the reported results will be larger than
expected. On Windows, wall time is measured using <code>GetTickCount()</code>,
on MacOS by <code>mach_absolute_time</code>, on Linux by <code>clock_gettime</code>
and otherwise by <code>gettimeofday</code>.</p>
</dd>
<dt>RubyProf::PROCESS_TIME</dt>
<dd>
<p>Process time measures the time used by a process between any two
moments in seconds. It is unaffected by other processes
concurrently running on the system. Remember with process time
that calls to methods like sleep will not be included in profiling
results. On Windows, process time is measured using <code>GetProcessTimes</code>
and on other platforms by <code>clock_gettime</code>.</p>
</dd>
<dt>RubyProf::ALLOCATIONS</dt>
<dd>
<p>Object allocations measures show how many objects each method in
a program allocates. Measurements are done via Ruby's <code>GC.stat</code>
api.</p>
</dd>
<dt>RubyProf::MEMORY</dt>
<dd>
<p>Memory measures how much memory each method in a program uses in
bytes. Measurements are done via Ruby's<span style="font-family: monospace;">
TracePoint</span><code></code> api.</p>
</dd>
</dl>
<p>To set the measurement mode:</p>
<pre class="ruby"><span class="ruby-constant">RubyProf</span>.<span class="ruby-identifier">measure_mode</span> = <span
class="ruby-constant">RubyProf</span><span class="ruby-operator">::</span><span
class="ruby-constant">WALL_TIME</span>
<span class="ruby-constant">RubyProf</span>.<span class="ruby-identifier">measure_mode</span> = <span
class="ruby-constant">RubyProf</span><span class="ruby-operator">::</span><span
class="ruby-constant">PROCESS_TIME</span>
<span class="ruby-constant">RubyProf</span>.<span class="ruby-identifier">measure_mode</span> = <span
class="ruby-constant">RubyProf</span><span class="ruby-operator">::</span><span
class="ruby-constant">ALLOCATIONS</span>
<span class="ruby-constant">RubyProf</span>.<span class="ruby-identifier">measure_mode</span> = <span
class="ruby-constant">RubyProf</span><span class="ruby-operator">::</span><span
class="ruby-constant">MEMORY</span>
</pre>
<p>The default value is <code>RubyProf::WALL_TIME</code>. You may also
specify the measure mode by using the <code>RUBY_PROF_MEASURE_MODE</code>
environment variable: </p>
<pre class="ruby"><span class="ruby-identifier">export</span> <span
class="ruby-constant">RUBY_PROF_MEASURE_MODE</span>=<span
class="ruby-identifier">wall</span>
<span class="ruby-identifier">export</span> <span class="ruby-constant">RUBY_PROF_MEASURE_MODE</span>=<span
class="ruby-identifier">process</span>
<span class="ruby-identifier">export</span> <span class="ruby-constant">RUBY_PROF_MEASURE_MODE</span>=<span
class="ruby-identifier">allocations</span>
<span class="ruby-identifier">export</span> <span class="ruby-constant">RUBY_PROF_MEASURE_MODE</span>=<span
class="ruby-identifier">memory</span>
</pre>
<h3><a id="allocation-tracking">Allocation Tracking</a></h3>
<p>ruby-prof also has the ability to track object allocations. This
functionality can be turned on via the track_allocations option:</p>
<pre>require 'ruby-prof'
RubyProf.profile(:track_allocations => true) do
...
end</pre>
<p>Note the <code>RubyProf::ALLOCATIONS</code> measure mode is slightly
different than tracking allocations. The measurement mode provides
high level information about the number of allocations performed in
each method. In contrast, tracking allocations provides detailed
information about the type, number, memory usage and location of each
allocation. Currently, to see allocations results you must use the <a
href="doc/RubyProf/GraphHtmlPrinter.html"><code>RubyProf::GraphHtmlPrinter</code></a>.</p>
<a id="thread-inclusion-exclusion">
<h3>Thread Inclusion/Exclusion</h3>
<p>ruby-prof can profile multiple threads. Sometimes this can be
overwhelming. For example, assume you want to determine why your
tests are running slowly. If you are using minitest, it will
run your tests in parallel by spawning tens of worker threads (note
to tell minitest to use a single thread set the N environmental
variable like this ENV = 0). Thus, ruby-prof provides two
options to specify which threads should be profiled:</p>
<pre>exclude_threads::
Array of threads which should not be profiled.
include_threads::
Array of threads which should be profiled. All other threads will
be ignored.</pre>
</a>
<h3><a id="method-exclusion">Method Exclusion</a></h3>
<p>ruby-prof supports excluding specific methods and threads from
profiling results. This is useful for reducing connectivity in the
call graph, making it easier to identify the source of performance
problems when using a graph printer. For example, consider <code>Integer#times</code>:
it's hardly ever useful to know how much time is spent in the method
itself. We are more interested in how much the passed in block
contributes to the time spent in the method which contains the <code>Integer#times</code>
call. The effect on collected metrics are identical to eliminating
methods from the profiling result in a post process step.</p>
<pre>profile = RubyProf::Profile.new(...)
profile.exclude_methods!(Integer, :times, ...)
profile.start</pre>
<p>A convenience method is provided to exclude a large number of methods
which usually clutter up profiles:</p>
<pre class="ruby"><span class="ruby-identifier">profile</span>.<span class="ruby-identifier">exclude_common_methods!</span>
</pre>
<p>However, this is a somewhat opinionated method collection. It's
usually better to view it as an inspiration instead of using it
directly (see <a
href="https://github.com/ruby-prof/ruby-prof/blob/e087b7d7ca11eecf1717d95a5c5fea1e36ea3136/lib/ruby-prof/profile/exclude_common_methods.rb">github.com/ruby-prof/ruby-prof/blob/e087b7d7ca11eecf1717d95a5c5fea1e36ea3136/lib/ruby-prof/profile/exclude_common_methods.rb</a>).
</p>
<h3><a id="merge-threads">Merging Threads and Fibers</a></h3>
<p>A common design pattern is to have a main thread delegate work
to background threads or fibers (the rest of this section will just refer to worker
threads to keep the text simpler). Examples include web servers, such as
Puma and Falcon.</p>
<p>Understanding profiling results for worker threads can be very difficult
because there may be hundreds or thousands of them. To help with this,
ruby-prof includes the ability to merge results for worker threads that start
with the same parent method. In the best case, this can collapse
results into two threads - one for the parent thread and one for all worker threads.</p>
<p>Note the collapsed threads show the sum of times for all merged threads. For example,
assume there are 10 worker threads that each took 5 seconds to run. The single merged
thread will show a total time of 50 seconds. </p>
<p>To merge threads:</p>
<pre>profile = RubyProf.profile do
...
end
profile.merge!</pre>
<h3><a id="saving-results">Saving Results</a></h3>
<p>It can be helpful to save the results of a profiling run for later
analysis. Results can be saved using Ruby's <a href="https://ruby-doc.org/core-2.6.3/Marshal.html">marshal</a>
library.</p>
<pre>profile_1 = RubyProf.profile do
...
end
<span class="ruby-comment"># Save the results<span style="color: white;"></span></span>
<span class="ruby-constant"></span>data = Marshal.dump(profile_1)<br><br><span class="ruby-comment"># Sometime later load the results</span>
<span class="ruby-constant"></span>profile_2 = Marshal.load(data)
</pre>
<strong>!!!WARNING!!!</strong> - Only load ruby-prof profiles that you
know are safe. Demarshaling data can lead to arbitrary code execution
and thus can be <a
href="https://ruby-doc.org/core-2.6.3/Marshal.html#module-Marshal-label-Security+considerations">dangerous</a>.
<h2><a id="reports">Reports</a></h2>
<p>Once you have completed a profiling run, ruby-prof provides a number
of reports that you can use to analyze the results. Reports are
created via the use of printers:</p>
<pre class="ruby"><span class="ruby-identifier">profile </span>= <span class="ruby-constant">RubyProf</span>.profile do <br> ...<br> end<br><span
class="ruby-identifier">printer</span> = <span class="ruby-constant">RubyProf</span><span
class="ruby-operator">::</span><span class="ruby-constant">GraphPrinter</span>.<span
class="ruby-identifier">new</span>(profile)
<span class="ruby-identifier">printer</span>.<span class="ruby-identifier">print</span>(<span
class="ruby-constant">STDOUT</span>, <span class="ruby-value">:min_percent</span> <span
class="ruby-operator">=></span> <span class="ruby-value">2</span>)
</pre>
<p>The first parameter is any writable IO object such as <code>STDOUT</code>
or a file. The second parameter specifies the minimum percentage a
method must take to be printed. Percentages should be specified as
integers in the range 0 to 100. For more information please see the
documentation for the different printers.</p>
<p>The different types of reports, and their associated printers, are:</p>
<dl>
<dt>Flat Report (<a href="doc/RubyProf/FlatPrinter.html">RubyProf::FlatPrinter</a>)</dt>
<dd>The flat report show the overall time spent in each method. Its is
a good way of quickly identifying which methods take the most time.
An example of a flat report can be found in <a href="./examples/primes.flat.txt">examples/primes.flat.txt</a>.
</dd>
<dt>Graph Reports (<a href="doc/RubyProf/GraphPrinter.html">RubyProf::GraphPrinter</a>)</dt>
<dd>The graph report shows the overall time spent in each method. In
addition, it also show which methods call the current method and
which methods its calls. Thus they are good for understanding how
methods gets called and provide insight into the flow of your
program. An example graph report is located at <a href="./examples/primes.graph.txt">examples/primes.graph.txt</a>.
<br>
</dd>
<dt>HTML Graph Reports (<a href="doc/RubyProf/GraphHtmlPrinter.html">RubyProf::GraphHtmlPrinter</a>)</dt>
<dd>HTML Graph profiles are the same as graph reports, except output
is generated in hyper-linked HTML. Since graph reports can be quite
large, the embedded links make it much easier to navigate the
results. An example html graph profile is located at <a href="./examples/primes.graph.html">examples/primes.graph.html</a>.
</dd>
<dt>Call stack reports (<a href="doc/RubyProf/CallStackPrinter.html">RubyProf::CallStackPrinter</a>)</dt>
<dd>Call stack reports produce a HTML visualization of the time spent
in each execution path of the profiled code. An example can be found
at <a href="examples/primes.stack.html">examples/primes.stack.html</a>.
</dd>
<dt>Call graphs (<a href="doc/RubyProf/CallTreePrinter.html">RubyProf::CallTreePrinter</a>)</dt>
<dd>Call graphs output results in the calltree profile format which is
used by KCachegrind. More information about the format can be found
at the <a href="https://kcachegrind.github.io/html/Home.html">KCachegrind</a>
site. An example can be found at <a href="examples/primes.callgrind.out.20124">examples/primes.callgrind.out.20124</a>.
</dd>
<dt>Graphviz reports (<a href="doc/RubyProf/DotPrinter.html">RubyProf::DotPrinter</a>)</dt>
<dd>The graphviz report is designed to be opened by <a href="https://www.graphviz.org/">Graphviz
</a>to create visualization of profile results. An example can be
found at <a href="./examples/primes.dot">examples/primes.dot</a>.
</dd>
<dt>Multiple reports (<a href="doc/RubyProf/MultiPrinter.html">RubyProf::MultiPrinter</a>)</dt>
<dd>MultiPrinter which can generate several reports in one profiling
run. <code>MultiPrinter</code><span style="font-family: monospace;">
</span>requires a directory path and a profile basename for the
files they produce
<pre class="ruby"><span class="ruby-identifier">printer</span> = <span
class="ruby-constant">RubyProf</span><span class="ruby-operator">::</span><span
class="ruby-constant">MultiPrinter</span>.<span class="ruby-identifier">new</span>(<span
class="ruby-identifier">result</span>)
<span class="ruby-identifier">printer</span>.<span class="ruby-identifier">print</span>(<span
class="ruby-value">:path</span> <span class="ruby-operator">=></span> <span
class="ruby-string">"."</span>, <span
class="ruby-value">:profile</span> <span class="ruby-operator">=></span> <span
class="ruby-string">"profile"</span>)
</pre>
</dd>
</dl>
<h2><a id="performance">Performance</a></h2>
<p>Significant effort has been put into reducing ruby-prof's overhead.
Our tests show that the overhead associated with profiling code varies
considerably with the code being profiled. Most programs will run
approximately twice as slow while highly recursive programs (like the
fibonacci series test) will run up to five times slower.</p>
<h2><a id="profiling-rails"> Profiling Rails</a></h2>
<p>To profile a Rails application it is vital to run it using production
like settings (cache classes, cache view lookups, etc.). Otherwise,
Rail's dependency loading code will overwhelm any time spent in the
application itself (our tests show that Rails dependency loading
causes a roughly 6x slowdown). The best way to do this is create a new
Rails environment, profile.rb.</p>
<p>To profile Rails:</p>
<ol>
<li>
<p>Create a new profile.rb environment. Make sure to turn on <code>cache_classes</code>
and <code>cache_template_loading</code>. Otherwise your profiling
results will be overwhelmed by the time Rails spends loading
required files. You should likely turn off caching.</p>
</li>
<li>
<p>Add the ruby-prof to your gemfile:</p>
<pre class="ruby"><span class="ruby-identifier">group</span> <span
class="ruby-value">:profile</span> <span
class="ruby-keyword">do</span>
<span class="ruby-identifier">gem</span> <span class="ruby-string">'ruby-prof'</span>
<span class="ruby-keyword">end</span>
</pre>
</li>
<li>
<p>Add the ruby prof rack adapter to your middleware stack. The
Rails <a href="https://guides.rubyonrails.org/configuring.html#configuring-middleware"
title="Configure Middleware">documentation</a> describes several
ways to do this. One way is to add the following code to <code>application.rb</code>:</p>
<pre class="ruby"><span class="ruby-keyword">if</span> <span class="ruby-constant">Rails</span>.<span
class="ruby-identifier">env</span>.<span class="ruby-identifier">profile?</span>
<code class="ruby plain">config.middleware.use</code> <span class="ruby-constant">Rack</span><span
class="ruby-operator">::</span><span class="ruby-constant">RubyProf</span>, <span
class="ruby-value">:path</span> <span class="ruby-operator">=></span> <span
class="ruby-string">'./tmp/profile'</span>
<span class="ruby-keyword">end</span>
</pre>
<p>The path is where you want profiling results to be stored. By
default the rack adapter will generate a html call graph report
and flat text report.</p>
</li>
<li>
<p>Now make a request to your running server. New profiling
information will be generated for each request. Note that each
request will overwrite the profiling reports created by the
previous request!</p>
</li>
</ol>
<h2><a id="architecture">Architecture</a></h2>
<p>To learn more about how ruby-prof works internally please see the <a
href="./architecture.html" title="Architecture">architecture</a>
page.</p>
<p></p>
<ol>
</ol>
<h2><a id="version-1.0">Version 1.0</a></h2>
<p>ruby-prof's development stretches all the way back to 2005. Fourteen
years later, it seems time for version 1.0! Version 1.0 is a
significant update that greatly improves the internals of ruby-prof
and adds a number of improvements:</p>
<ul>
<li>
<p>Profiling is significantly faster (5x in some cases)</p>
</li>
<li>Recursive profiles are finally handled correctly. Yeah!!!</li>
<li>Redesigned <a href="./examples/primes.graph.html">reports</a>.</li>
<li>
<p>The ability to measure allocations and memory usage using a
standard (unpatched) version of ruby</p>
</li>
<li>
<p>The ability to save and reload profiling results for later
analysis</p>
</li>
<li>
<p>The ability track object allocations</p>
</li>
<li>New documentation <a href="https://ruby-prof.github.io">website</a>.</li>
</ul>
<p>At the same time, it was also time to remove old code and deprecated
methods. These changes include:</p>
<ul>
<li>
<p>Drop support for old versions of ruby. Currently 2.4 and up are
required.</p>
</li>
<li>
<p>Remove support for method elimination (instead use <a href="#method-exclusion">method
exclusion</a>).</p>
</li>
<li>
<p>Remove the CPU_TIME measurement mode since it duplicates the
PROCESS_TIME mode and required inline assembly code</p>
</li>
<li>
<p>Remove the GC_TIME and GC_RUNS measurement modes since they
required patched ruby versions. In modern ruby it is possible to
reimplement this functionality, but it does not seem of that much
value (shout out if you want it back).</p>
</li>
<li>Merge the <code>RubyProf::FlatPrinterWithLineNumbers</code>
printer into <code>RubyProf::FlatPrinter</code>.
</li>
<li>Removed support for RUBY_PROF_EDITOR_URI environment variable that
specified the link scheme. This features was more confusing then
helpful.
</li>
</ul>
<h2><a id="api-documentation">API Documentation</a></h2>
<p>API documenation for each class is <a href="doc/index.html">available</a>.</p>
<h2><a id="license">License</a></h2>
<p>See <a href="./LICENSE">LICENSE</a> for license information.</p>
<h2><a id="development">Development</a></h2>
<p>Code is located at <a href="https://github.com/ruby-prof/ruby-prof">github.com/ruby-prof/ruby-prof</a></p>
<ul>
</ul>
</div>
</div>
</body>
</html>