Open dsouzai opened 4 years ago
-Xmx3200m -Xms3200m -Xmn1200m
Shift0
RUN RESULT: hbIR (max attempted) = 13837, hbIR (settled) = 11548, max-jOPS = 10516, critical-jOPS = 2144
RUN RESULT: hbIR (max attempted) = 12302, hbIR (settled) = 11859, max-jOPS = 10088, critical-jOPS = 2224
RUN RESULT: hbIR (max attempted) = 13837, hbIR (settled) = 11548, max-jOPS = 10793, critical-jOPS = 2135
RUN RESULT: hbIR (max attempted) = 13837, hbIR (settled) = 11548, max-jOPS = 10931, critical-jOPS = 2154
max-jOPS = 10582 critical-jOPS = 2164.25
Shift3 RUN RESULT: hbIR (max attempted) = 12302, hbIR (settled) = 11859, max-jOPS = 10703, critical-jOPS = 2161 RUN RESULT: hbIR (max attempted) = 11859, hbIR (settled) = 11489, max-jOPS = 10673, critical-jOPS = 2080 RUN RESULT: hbIR (max attempted) = 11548, hbIR (settled) = 11430, max-jOPS = 10278, critical-jOPS = 2129 RUN RESULT: hbIR (max attempted) = 11548, hbIR (settled) = 10959, max-jOPS = 10278, critical-jOPS = 2071
max-jOPS = 10483 critical-jOPS = 2110.25
- 1% drop in max-jOPS
- 2.5% drop in critical-jOPS
-Xmx3200m -Xms3200m -Xmn2600m -Xjit:scratchSpaceLimit=2048000,acceptHugeMethods -Xgcpolicy:gencon -Xcompressedrefs -XXgc:forcedShiftingCompressionAmount=0
SPECjbb2005 bops = 72214 SPECjbb2005 bops = 72204 SPECjbb2005 bops = 72304 SPECjbb2005 bops = 72690 SPECjbb2005 bops = 71765 Average bops = 72235.4
SPECjbb2005 bops = 69579 SPECjbb2005 bops = 69860 SPECjbb2005 bops = 70412 SPECjbb2005 bops = 70350 SPECjbb2005 bops = 69376 Average bops = 69915.4
3.3% throughput drop
FYI @vijaysun-omr @zl-wang @mpirvu
@harryyu1994 could you please make a summary of all the experiments that were tried? It seems that only Power sees more than 2% regression from the move to shift3
it is within expectation. i remembered the overhead was about 2.5% previously we did the experiments.
@vijaysun-omr I have all the results listed here, will be waiting for your final call on this.
@zl-wang I am worried by the high throughput loss on Power still (in excess of 3%). I don't know if you can afford to slow down everything 3+% inside OpenShift on Power. While I agree we used to have an overhead of approximately 2-3% on all platforms previously due to the shift, we now find that the overhead on other platforms (X86 has more data shown than Z) is lower. Can you please try the same on your Open Liberty setup ?
@vijaysun-omr i will give DT7/OpenLiberty a spin next as I talked to @harryyu1994
shift0 average throughput: 2798/s shift3 average throughput: 2757/s The gap is about 1.5%.
However, the up-down in the same run could be as big as 3-4%. haven't investigated why it is not as stable as my older driver: this one is July 29 build on Adopt site, as harry suggested a recent build.
Can we try to get a Daytrader7 run done on Z as well so that we have more than just that one data point ?
@zl-wang fluctuation of 3-4% is high enough that we don't know if the overhead is in the 3% range on Power in this case as well. Ideally we should try to understand what is different on Power before going ahead but I am okay with delivering the change to make things portable wrt compressed refs with the general approach taken in this design first and then work out how to make the situation better on Power as a continuing effort past that initial delivery.
There are two approaches that were brought up in the Portable SCC discussion regarding how to deal with the potential for the compressed refs shift changing with the heap size.