1 # Welcome to the JDK!
2
3 For build instructions please see the
4 [online documentation](https://openjdk.org/groups/build/doc/building.html),
5 or either of these files:
6
7 - [doc/building.html](doc/building.html) (html version)
8 - [doc/building.md](doc/building.md) (markdown version)
9
10 See <https://openjdk.org/> for more information about the OpenJDK
11 Community and the JDK and see <https://bugs.openjdk.org> for JDK issue
12 tracking.
|
1 # Welcome to the Leyden Prototype Repository
2
3 The purpose of the Leyden repository is to prototype improvements to
4 the startup time, time to peak performance, and footprint of Java programs, as a part of
5 [Project Leyden](https://openjdk.org/projects/leyden). We solicit feedback from
6 the Java community, with the hope that some of these improvements can eventually be
7 incorporated into future JDK releases.
8
9 ## 0. Disclaimers
10
11 - *This repository contains experimental and unstable code. It is not intended to be used
12 in a production environment.*
13 - *This repository is intended for developers of the JDK, and advanced Java developers who
14 are familiar with building the JDK.*
15 - *The experimental features in this repository may be changed or removed without notice.
16 Command line flags and workflows will change.*
17 - *The benchmarks results reported on this page are for illustrative purposes only. Your
18 applications may get better or worse results.*
19
20 To try out the Leyden prototype without building it from source code, please download the
21 Leyden EA Release from [https://jdk.java.net/leyden/](https://jdk.java.net/leyden/).
22
23 ## 1. Overview
24
25 As of JDK 25, the Leyden Project has successfully delivered ahead-of-time (AOT)
26 optimizations JEPs:
27
28 - [JEP 483 - Ahead-of-Time Class Loading & Linking](https://openjdk.org/jeps/483)
29 - [JEP 514 - Ahead-of-Time Command-Line Ergonomics](https://openjdk.org/jeps/514)
30 - [JEP 515 - Ahead-of-Time Method Profiling](https://openjdk.org/jeps/515)
31
32 Please refer to the above JEPs for a detailed discussion of AOT optimizations.
33
34 The Leyden "[premain](https://github.com/openjdk/leyden/blob/premain/)" prototype
35 includes new experimental AOT optimizations that are not yet integrated into the JDK mainline:
36
37 - **[Ahead-of-Time Code Compilation (JEP draft 8335368)](https://openjdk.org/jeps/8335368)**: Methods that are frequently used during the training run can be
38 compiled and stored along with the AOT cache. As a result, as soon as the application starts up
39 in the production run, its methods can be natively executed.
40 - This feature is enabled by default when you create an AOT cache. It can be disabled with the diagnostic
41 flag `-XX:-AOTCodeCaching`.
42
43 - **Ahead-of-time generation of [Dynamic Proxies](https://docs.oracle.com/en/java/javase/22/docs/api/java.base/java/lang/reflect/Proxy.html)**:
44 Dynamic proxies are frequently used by popular application frameworks. We can improve start-up time by generating these proxies ahead of time.
45 - This feature is enabled by default when you create an AOT cache. It can be disabled with the diagnostic
46 flag `-XX:-ArchiveDynamicProxies`.
47
48 - **Ahead-of-time generation of reflection data**: Reflection data (such as instances of
49 `java.lang.reflect.Method`) are generated by the JVM to support `java.lang.reflect` operations. We can
50 generate these ahead of time to improve start-up.
51 - This feature is enabled by default when you create an AOT cache. It can be disabled with the diagnostic
52 flag `-XX:-ArchiveReflectionData`.
53
54 - **Class Not Found Cache**: Sometimes application frameworks repeatedly try to load classes that do not exist. This optimization allows such failing lookups to be done quickly without repeatedly scanning the class path.
55 - This feature is enabled by default when you create an AOT cache. It can be disabled with the diagnostic
56 flag `-XX:-ArchiveLoaderLookupCache`.
57
58 ## 2. Building the Leyden Repository
59
60 The Leyden Repository can be built in the same way as the main-line JDK repository.
61 Please use the "premain" branch. I.e., [https://github.com/openjdk/leyden/tree/premain](https://github.com/openjdk/leyden/tree/premain).
62
63 For build instructions please see the
64 [online documentation](https://openjdk.org/groups/build/doc/building.html),
65 or either of these files:
66
67 - [doc/building.html](doc/building.html) (html version)
68 - [doc/building.md](doc/building.md) (markdown version)
69
70 See <https://openjdk.org/> for more information about the OpenJDK
71 Community and the JDK and see <https://bugs.openjdk.org> for JDK issue
72 tracking.
73
74 ## 3. Trying out Leyden Features
75
76 The easiest way to try out the Leyden optimizations is to build a JVM from the Leyden repository, and use it with your application with the `-XX:AOTCache` flag.
77
78
79 Here's a small benchmark that uses the JDK's built-in
80 [`JavaCompiler`](https://docs.oracle.com/en/java/javase/21/docs/api/java.compiler/javax/tools/JavaCompiler.html)
81 class to compile some Java source files. This benchmark spends a significant amount of start-up time
82 setting up the classes used by `JavaCompiler`, so it will benefit from the Leyden features.
83
84 First, download [JavacBenchApp.java](test/setup_aot/JavacBenchApp.java) and compile it into a JAR file.
85
86 (Remember to use the `java` program that you built from the Leyden repository.)
87
88 ```
89 $ javac JavacBenchApp.java
90 $ jar cvf JavacBenchApp.jar JavacBenchApp*.class
91 added manifest
92 adding: JavacBenchApp$ClassFile.class(in = 1608) (out= 787)(deflated 51%)
93 adding: JavacBenchApp$FileManager.class(in = 2090) (out= 979)(deflated 53%)
94 adding: JavacBenchApp$SourceFile.class(in = 1351) (out= 671)(deflated 50%)
95 adding: JavacBenchApp.class(in = 7571) (out= 3302)(deflated 56%)
96 ```
97
98 We can run this benchmark without any AOT optimizations. It takes 893 ms:
99
100 ```
101 $ java -cp JavacBenchApp.jar JavacBenchApp 50
102 Generated source code for 51 classes and compiled them in 893 ms
103 ```
104
105 To use AOT optimizations for JavacBenchApp, we should first perform a _training run_ and
106 capture the profiling information into `JavacBenchApp.aotconfig`
107
108 ```
109 $ java -XX:AOTMode=record -XX:AOTConfiguration=JavacBenchApp.aotconfig \
110 -cp JavacBenchApp.jar JavacBenchApp 50
111 $ ls -l JavacBenchApp.aotconfig
112 -rw-rw-r-- 1 iklam iklam 27652096 Mar 3 16:23 JavacBenchApp.aotconfig
113 ```
114
115 With the `JavacBenchApp.aotconfig` file, we can create the AOT cache. This is called the _assembly phase_:
116
117 ```
118 $ java -XX:AOTMode=create -XX:AOTConfiguration=JavacBenchApp.aotconfig \
119 -cp JavacBenchApp.jar -XX:AOTCache=JavacBenchApp.aot
120 $ ls -l JavacBenchApp.aot
121 -r--r--r-- 1 iklam iklam 42332160 Mar 3 16:58 JavacBenchApp.aot
122 ```
123
124 Alternatively, you can also combine the training run and assembly phase with a single command:
125
126 ```
127 $ java -XX:AOTCacheOutput=JavacBenchApp.aot \
128 -cp JavacBenchApp.jar JavacBenchApp 50
129 $ ls -l JavacBenchApp.aot
130 -r--r--r-- 1 iklam iklam 42332160 Mar 3 16:58 JavacBenchApp.aot
131 ```
132
133 Now, we can make a _production run_ of the program using the AOT cache `JavacBenchApp.aot`. It finishes in 423 ms, or more than twice as fast as
134 before.
135
136 ```
137 $ java -XX:AOTCache=JavacBenchApp.aot -cp JavacBenchApp.jar JavacBenchApp 50
138 Generated source code for 51 classes and compiled them in 423 ms
139 ```
140
141 ### Ending the Training Run Early
142
143 By default, training runs end when the application terminates. You have two other options to end training runs:
144
145 - `-XX:AOTEndTrainingOnMethodEntry=<method1,method2,...>[,count=100]`
146 - `jcmd <pid> AOT.end_training`
147
148 Note that `-XX:AOTEndTrainingOnMethodEntry` uses the same format as `-XX:CompileOnly` and the default count is 1.
149
150 See [EndTrainingOnMethodEntry.java](test/hotspot/jtreg/runtime/cds/appcds/leyden/EndTrainingOnMethodEntry.java) for a test case.
151
152 ### Diagnosing Potential Performance Issues
153
154 As mentioned below, parts or all of the AOT cache may be disabled under certain circumstances. This may lead
155 to lower performance than expected. To diagnose potential performance issues, you can add `-Xlog:aot*` to the
156 command line to see detailed information about what parts of the AOT cache are being utilized. For example, if the
157 the AOT-compiled code cannot be loaded, you will see a log message like this:
158
159 ```
160 [0.008s][info][aot,codecache,init] Mapped 652184 bytes at address 0x00007f491005f028 from AOT Code Cache
161 [0.008s][info][aot,codecache,init] Loaded 439 AOT code entries from AOT Code Cache
162 [0.008s][info][aot,codecache,init] Unable to use AOT Code Cache.
163 ```
164
165 ### Diagnostic VM Flags
166
167 By default, all of the optimizations described
168 in the [Overview](#1-overview) section above are enabled by default. This ensures that you can get all the optimizations
169 without specifying them individually.
170
171 For diagnostic purposes, you can selectively disable some of the options:
172
173 - The `-XX:+AOTCodeCaching` flag affects only the assembly phase and the production run.
174 - The `-XX:+AOTRecordTraining` flag affects only the training run and the assembly phase.
175 - The `-XX:+AOTReplayTraining` flag affects only the production run.
176 - All other options affect only the assembly phase.
177
178 For example, you can disable the loading of AOT-compiled methods during the production run. Notice that the benchmark now
179 starts more slowly than it did when AOT-compiled methods was loaded.
180
181 ```
182 $ java -XX:AOTCache=JavacBenchApp.aot -Xlog:cds=error -XX:-AOTCodeCaching \
183 -cp JavacBenchApp.jar JavacBenchApp 50
184 Generated source code for 51 classes and compiled them in 647 ms
185 ```
186
187 You can also disable AOT compilation in the assembly phase. Note that the size of the AOT
188 cache is smaller because it no longer has AOT-compiled methods.
189
190 ```
191 $ java -XX:AOTMode=create -XX:AOTConfiguration=JavacBenchApp.aotconfig \
192 -cp JavacBenchApp.jar \
193 -XX:AOTCache=JavacBenchApp.aot -XX:-AOTCodeCaching
194 $ ls -l JavacBenchApp.aot
195 -r--r--r-- 1 iklam iklam 29990912 Mar 3 16:34 JavacBenchApp.aot
196 ```
197
198
199 ## 4. Limitations of the Leyden Prototype
200
201 When trying out the Leyden prototype, please pay attention to the following limitations.
202
203 ### The Same CPU Must be Used between Training and Production Runs
204
205 The AOT-compiled code will be only used if the production run is on a machine with the same type of CPU
206 as used in the training run and assembly phase. If this is not the case (for example, the production run is on
207 a machine that has different AVX capabilities), the AOT-compiled code will be ignored.
208
209
210 ### The Same Garbage Collector Must be Used between Training and Production Runs
211
212 The AOT cache generated by the Leyden prototype includes machine instructions that are specific to
213 the garbage collector. We recommend that you explicitly specify the same collector during both
214 training and production runs. For example, if you prefer to use the SerialGC:
215
216 ```
217 # assembly phase.
218 $ java -XX:AOTMode=create -XX:AOTConfiguration=JavacBenchApp.aotconfig \
219 -cp JavacBenchApp.jar \
220 -XX:AOTCache=JavacBenchApp.aot -XX:+UseSerialGC
221
222 # production run
223 $ java -XX:AOTCache=JavacBenchApp.aot -XX:+UseSerialGC -cp JavacBenchApp.jar \
224 JavacBenchApp 50
225 ```
226
227 Otherwise, the AOT cache may not be usable for the production run, leading to suboptimal performance.
228 For example, sometimes you may perform the assembly phase run on a large development host, and then use
229 a container to run the application in a small production node. In the following scenario, as the collector
230 is not explicitly specified, the VM will automatically pick G1 for the assembly phase, and SerialGC for the
231 production run (due to its limited amount of memory):
232
233 ```
234 # Assembly phase (uses G1 by default)
235 $ java -XX:AOTMode=create -XX:AOTConfiguration=JavacBenchApp.aotconfig \
236 -cp JavacBenchApp.jar -XX:AOTCache=JavacBenchApp.aot
237
238 # Production run (uses SerialGC)
239 $ docker run --rm -v /repos/leyden/build/linux-x64/images/jdk:/jdk -v $(pwd):/test \
240 --memory=1024m \
241 container-registry.oracle.com/java/openjdk \
242 bash -c 'cd /test; ' \
243 '/jdk/bin/java -XX:AOTCache=JavacBenchApp.aot ' \
244 ' -cp JavacBenchApp.jar JavacBenchApp 50'
245 [0.001s][error][aot] AOT cache has aot-linked classes. It cannot be used because
246 GC used during dump time (G1) is not the same as runtime (Serial)
247 [0.001s][error][aot] An error has occurred while processing the AOT cache.
248 [0.001s][error][aot] Unable to map shared spaces
249 Error occurred during initialization of VM
250 Unable to use AOT cache.
251 ```
252
253 ### Only G1GC, SerialGC, ParallelGC, EpsilonGC, ShenandoahGC are Supported
254
255 Currently, if you use any other garbage collector in combination with `-XX:AOTMode` or `-XX:AOTCache`, the VM will
256 exit with an error.
257
258 ```
259 $ java -XX:AOTMode=record -XX:AOTConfiguration=JavacBenchApp.aotconfig \
260 -cp JavacBenchApp.jar -XX:+UseZGC JavacBenchApp 50
261 Error occurred during initialization of VM
262 Cannot create the AOT configuration file: UseCompressedClassPointers must be enabled,
263 and collector must be G1, Parallel, Serial, Epsilon, or Shenandoah
264 ```
265
266 ## 5. Benchmarking
267
268 We use a small set of benchmarks to demonstrate the performance of the optimizations in the Leyden repo.
269
270 | Benchmark | Source |
271 | ------------- | ------------- |
272 |[helidon-quickstart-se](test/hotspot/jtreg/premain/helidon-quickstart-se) | https://helidon.io/docs/v4/se/guides/quickstart|
273 |[javac-bench](test/hotspot/jtreg/premain/javac_bench) | Using Javac to compile 50 source files |
274 |[micronaut-first-app](test/hotspot/jtreg/premain/micronaut-first-app) | https://guides.micronaut.io/latest/creating-your-first-micronaut-app-maven-java.html|
275 |[quarkus-getting-started](test/hotspot/jtreg/premain/quarkus-getting-started) | https://quarkus.io/guides/getting-started|
276 |[spring-boot-getting-started](test/hotspot/jtreg/premain/spring-boot-getting-started) | https://spring.io/guides/gs/spring-boot|
277 |[spring-petclinic](test/hotspot/jtreg/premain/spring-petclinic) | https://github.com/spring-projects/spring-petclinic|
278
279 ### Benchmarking Against JDK Main-line
280
281 To can compare the performance of Leyden vs the main-line JDK, you need:
282
283 - An official build of JDK 21
284 - An up-to-date build of the JDK main-line
285 - The latest Leyden build
286 - Maven (ideally 3.8 or later, as required by some of the demos). Note: if you are behind
287 a firewall, you may need to [set up proxies for Maven](https://maven.apache.org/guides/mini/guide-proxies.html)
288
289 The same steps are used for benchmarking all of the above demos. For example:
290
291 ```
292 $ cd test/hotspot/jtreg/premain/helidon-quickstart-se
293 $ make PREMAIN_HOME=/repos/leyden/build/linux-x64/images/jdk \
294 MAINLINE_HOME=/repos/jdk/build/linux-x64/images/jdk \
295 BLDJDK_HOME=/usr/local/jdk21 \
296 bench
297 run,mainline default,mainline custom static cds,mainline aot cache,premain aot cache
298 1,456,229,156,117
299 2,453,227,157,117
300 3,455,232,155,116
301 4,448,230,154,114
302 5,440,228,156,114
303 6,446,228,156,114
304 7,448,232,156,114
305 8,465,261,159,114
306 9,448,226,157,113
307 10,442,233,154,114
308 Geomean,450.05,232.41,155.99,114.69
309 Stdev,6.98,9.72,1.41,1.35
310 Markdown snippets in mainline_vs_premain.md
311 ```
312
313 The above command runs each configuration 10 times, in an interleaving order. This way
314 the noise of the system (background processes, thermo throttling, etc) is more likely to
315 be spread across the different runs.
316
317 As is typical for benchmarking start-up performance, the numbers are not very steady.
318 It is best to plot
319 the results (as saved in the file `mainline_vs_premain.csv`) in a spreadsheet to check for
320 noise and other artifacts.
321
322 The "make bench" target also generates GitHub markdown snippets (in the file `mainline_vs_premain.md`) for creating the
323 graphs below.
324
325 ### Benchmarking Between Two Leyden Builds
326
327 This is useful for Leyden developers to measure the benefits of a particular optimization.
328 The steps are similar to above, but we use the "make compare_premain_builds" target:
329
330 ```
331 $ cd helidon-quickstart-se
332 $ make PM_OLD=/repos/leyden_old/build/linux-x64/images/jdk \
333 PM_NEW=/repos/leyden_new/build/linux-x64/images/jdk \
334 BLDJDK_HOME=/usr/local/jdk21 \
335 compare_premain_builds
336 Old build = /repos/leyden_old/build/linux-x64/images/jdk with options
337 New build = /repos/leyden_new/build/linux-x64/images/jdk with options
338 Run,Old CDS + AOT,New CDS + AOT
339 1,110,109
340 2,131,111
341 3,118,115
342 4,110,108
343 5,117,110
344 6,114,109
345 7,110,109
346 8,118,110
347 9,110,110
348 10,113,114
349 Geomean,114.94,110.48
350 Stdev,6.19,2.16
351 Markdown snippets in compare_premain_builds.md
352 ```
353
354 Please see [test/hotspot/jtreg/premain/lib/Bench.gmk](test/hotspot/jtreg/premain/lib/Bench.gmk) for more details.
355
356 Note: due to the variability of start-up time, the benefit of minor improvements may
357 be difficult to measure.
358
359 ### Preliminary Benchmark Results
360
361 The following charts show the relative start-up performance of the Leyden/Premain branch vs
362 the JDK main-line.
363
364 For example, a number of "premain aot cache: 255" indicates that if the application takes
365 1000 ms to start-up with the JDK main-line, it takes only 255 ms to start up when all the
366 current set of Leyden optimizations are enabled.
367
368 The benchmark results are collected with `make bench` in the following directories under [test/hotspot/jtreg/premain](test/hotspot/jtreg/premain):
369
370 - `helidon-quickstart-se`
371 - `javac-bench`
372 - `micronaut-first-app`
373 - `quarkus-getting-started`
374 - `spring-boot-getting-started`
375 - `spring-petclinic`
376
377 The meaning of the four rows in the following the charts:
378
379 | Row | Meaning |
380 | ------------- | ------------- |
381 | **mainline default** |Run benchmark with no optimizations|
382 | **mainline custom static cds** |Run benchmark with a custom static CDS archive|
383 | **mainline aot cache** |Run benchmark with a custom AOT cache (JDK mainline)|
384 | **premain aot cache** |Run benchmark with a custom AOT cache (Leyden Premain Prototype)|
385
386 These JDK versions were used in the comparisons:
387
388 - JDK main-line: JDK 25, build 25+37-LTS-3491
389 - Leyden: https://github.com/openjdk/leyden/tree/8df3504f55cabe9ff8a1d239f469b18d00ff802b
390
391 For details information about the hardware and raw numbers, see [bench.20250912.txt](test/hotspot/jtreg/premain/bench_data/bench.20250912.txt)
392
393 ### Helidon Quick Start (SE) Demo (3.52x improvement)
394
395 ```mermaid
396 ---
397 config:
398 xyChart:
399 chartOrientation: horizontal
400 height: 300
401 ---
402 xychart-beta
403 x-axis "variant" ["mainline default", "mainline custom static cds", "mainline aot cache", "premain aot cache"]
404 y-axis "Elapsed time (normalized, smaller is better)" 0 --> 1000
405 bar [1000, 484, 398, 351]
406 ```
407
408 ### JavacBenchApp 50 source files (2.17x improvement)
409
410 ```mermaid
411 ---
412 config:
413 xyChart:
414 chartOrientation: horizontal
415 height: 300
416 ---
417 xychart-beta
418 x-axis "variant" ["mainline default", "mainline custom static cds", "mainline aot cache", "premain aot cache"]
419 y-axis "Elapsed time (normalized, smaller is better)" 0 --> 1000
420 bar [1000, 779, 567, 460]
421 ```
422
423 ### Micronaut First App Demo (2.85x improvement)
424
425 ```mermaid
426 ---
427 config:
428 xyChart:
429 chartOrientation: horizontal
430 height: 300
431 ---
432 xychart-beta
433 x-axis "variant" ["mainline default", "mainline custom static cds", "mainline aot cache", "premain aot cache"]
434 y-axis "Elapsed time (normalized, smaller is better)" 0 --> 1000
435 bar [1000, 475, 366, 321]
436 ```
437
438 ### Quarkus Getting Started Demo (2.73x improvement)
439
440 ```mermaid
441 ---
442 config:
443 xyChart:
444 chartOrientation: horizontal
445 height: 300
446 ---
447 xychart-beta
448 x-axis "variant" ["mainline default", "mainline custom static cds", "mainline aot cache", "premain aot cache"]
449 y-axis "Elapsed time (normalized, smaller is better)" 0 --> 1000
450 bar [1000, 487, 412, 367]
451 ```
452
453 ### Spring-boot Getting Started Demo (3.96x improvement)
454
455 ```mermaid
456 ---
457 config:
458 xyChart:
459 chartOrientation: horizontal
460 height: 300
461 ---
462 xychart-beta
463 x-axis "variant" ["mainline default", "mainline custom static cds", "mainline aot cache", "premain aot cache"]
464 y-axis "Elapsed time (normalized, smaller is better)" 0 --> 1000
465 bar [1000, 496, 334, 253]
466 ```
467
468 ### Spring PetClinic Demo (3.24 improvement)
469
470 ```mermaid
471 ---
472 config:
473 xyChart:
474 chartOrientation: horizontal
475 height: 300
476 ---
477 xychart-beta
478 x-axis "variant" ["mainline default", "mainline custom static cds", "mainline aot cache", "premain aot cache"]
479 y-axis "Elapsed time (normalized, smaller is better)" 0 --> 1000
480 bar [1000, 598, 554, 308]
481 ```
482
483 ## 6. More Documentation
484
485 Please see [test/hotspot/jtreg/premain/](test/hotspot/jtreg/premain) for more information.
|