Experience the NUC of cheetah Canyon and feel the “full blood” performance of 40W core i7-1165g7



In the second half of 2020, Intel released the 11th generation of core processor tigerlake-up3 series for light and thin computers. In 2021, Intel also updated its mini computer product line and launched a new cheetah Canyon NUC with the 11th generation of core tigerlake-up3.

At the beginning of this month, pconline evaluation room conducted the first evaluation on cheetah Canyon NUC (the specific model is nuc11pahi7, equipped with core i7-1165g), but that article was mainly from the perspective of experience, but did not carry out a detailed test on the performance of the product.

In fact, the author believes that even though the cheetah Canyon NUC is only a mini computer, the power supply and heat dissipation capacity of the motherboard can not be compared with that of the desktop computer, the heat dissipation efficiency of the cheetah Canyon NUC will still far exceed that of the smaller traditional lightweight and business computers, and the stronger heat dissipation capacity will help to release the hardware performance better. In this evaluation, we will test the NUC of cheetah Canyon in detail to show the “full blood” performance of core i7-1165g7.

Test platform introduction and power configuration

Cheetah Canyon NUC is a quasi system, which has no memory and hard disk in advance and needs to be purchased by users themselves. In order to give full play to the overall performance as much as possible, the author selected two 16gbddr43200c22 to form a dual channel 32GB (unfortunately, the memory used in this test is 2R × 16, not 2R × 8, the number of bankgroups is slightly less), and the Samsung 980pro solid state hard disk with 500GB capacity as the system disk.

Using aida64 to test the memory performance, the read speed is 47574mb / s, the write speed is 45558mb / s, the copy speed is 39825mb / s, and the delay is 81.6ns. If each 32GB dual channel memory is 2R × 8, the efficiency will be slightly higher.

Using crystal disk mark to test Samsung 980pro solid-state silver disk, we can see that the pcie4.0 protocol supported by tiger lake-up3 has a place to play. Combined with good heat dissipation, the read speed reaches 6346.25mb/s and the write speed reaches 4845.46mb/s, which shows the performance of a new generation of high-end solid-state hard disk.

Players who are familiar with hardware know that the 11th generation of core tiger lake-up3, including the core i7-1165g7, is a lightweight product, but it is very sensitive to power consumption settings. As mentioned in the previous article, this is exactly the advantage of NUC in cheetah canyon. According to the identification of aida64 and hwinfo64, the NUC of cheetah canyon has set the Pl1 long-time power consumption limit of up to 40W for the core i7-1165g7, and the pl2 short-time power consumption is up to 67W, far exceeding the power consumption setting of the lightweight version.

Of course, Pl1 and pl2 settings are only theoretical settings. How much power the processor can run with will be directly affected by the power supply and cooling capacity of the motherboard. At room temperature of 26 ℃, aida64 was used for pressure test of single baked FPU, and hwinfo64 was used for real-time monitoring. After 8 minutes of operation, the NUC of cheetah Canyon can ensure that the core i7-1165g7 can operate stably above 40W power consumption, and the full core frequency is about 3.6ghz.

It is worth noting that, considering that the core willowcove micro architecture of tiger lake-up3 supports avx-512 instruction set, it brings a certain increase in CO frequency power consumption. If avx-512 instruction set is turned off when aida64fpu toaster, or the user’s actual running application load does not support avx-512 instruction set, the core i7-1165g7 can make the full core Rui frequency stable at about 3.9ghz-4.1ghz with 40W power released by cheetah Canyon NUC, which can almost reach the upper limit.

Introduction and theoretical performance test of willowcove micro architecture

Before testing, let’s review the architecture features of tigerlake-up3, the 11th generation core processor. Tigerlake-up3 is the direct successor of icelake-u, which has achieved comprehensive optimization in many aspects, such as kernel micro architecture, kernel display scale, process technology, memory controller and so on.

As for the core of the processor, the microarchitecture used by tigerlake-up3 is called willowcove. Intel says that willowcove is designed based on sunnycove, the core microarchitecture of Icelake. There are great similarities between the two. They are highly similar in many aspects, such as front-end, out of order buffer and back-end execution unit. However, willowcove has improved the cache system and greatly increased the cache capacity. Each core has 1.25mb L2 cache and 3MB L3 cache. A larger cache helps to improve the efficiency of the processor in intensive computing. Of course, willowcove also supports CET technology to deal with side channel attacks against cache and enhance security.

Intel has conducted detailed tests on willowcove before, using core i7-1185g7e (4.4ghz willowcove micro architecture) and core i7-8665ue (4.4ghz skylake micro architecture) to run specrate2017int at the same time. The results show that willowcove’s IPC is improved by about 23% compared with skylake. In fact, players familiar with Intel processor microarchitecture know that sunny cove, adopted by Icelake, has an 18% increase in IPC compared with skylake. Roughly speaking, the IPC of tiger Lake (willow Cove) is about 4% higher than that of ice lake (sunny Cove).

In fact, from the perspective of microarchitecture design, willowcove’s 4% IPC improvement compared with sunny Cove is not much, or even difficult to detect. But why can tigerlake-up3 achieve double-digit performance improvement? The answer is that tigerlake-up3 uses 10nm superfin technology to achieve a large number of optimizations, which can be regarded as the biggest in node performance enhancement.

Finally, with the blessing of 10nm superfin technology, the core operating frequency of tigerlake-up3 has been greatly improved, approaching 5.0ghz, and the energy consumption curve has also been greatly improved compared with the original 10nm icelake-u.

Back to the cheetah Canyon NUC in this test, the processor model of this product is core i7-1165g7, the specification is 4 cores and 8 threads, the basic frequency is 2.8GhZ, the highest core frequency is 4.7ghz, the second level cache is 5MB, and the third level cache is 12MB. The following is the detailed parameter information identified by aida64cpuid:

CPU-Z is a commonly used processor test software, which has been updated to version 1.95. The time of running version 17 test is short and the load is low. The core i7-1165g7 on NUC of cheetah Canyon can run full frequency, with single thread score of 625.5 and multi thread score of 2840.1. This single thread performance is comparable to TVB core i9-10900k at 5.3GHz, which shows the efficiency improvement of willowcove micro architecture.

Version 19avx2 test has always been an advantage project of Intel processor, while core i7-1165g7 based on willowcove micro architecture continues this advantage, with single thread score of 903.1 and multi thread score of 4202.2 after running full frequency. However, due to the half throughput avx-512, the back end can only execute a 512bit FMA, and the core i7-1165g7 runs version 19avx512 and version 19avx2 tests, but the two results are basically the same.

Geekbench5 is a popular cross platform processor performance testing software. The core i7-11657g on cheetah Canyon NUC has a single thread score of 1579 and a multi thread score of 5396. As a reference, TVB core i9-10900k at 5.3GHz scored about 1400 in the geekbench5 single thread test, and the IPC of willowcove micro architecture in geekbench5 test was even improved by more than 25%.

Let’s talk about the common cinebench. As a test software developed based on Cinema4D, cinebench is widely popular. At present, it has been updated to R23 with Cinema4D. Because it is based on a single software application scenario, cinebench can not fully reflect the comprehensive performance of the processor.

As for the NUC product of cheetah Canyon, the core i7-1165g7 has a single thread score of 589pts and a multi thread score of 2624pts in R20, and a single thread score of 1544pts and a multi thread score of 6799pts in R23. Among them, the single thread test is more excellent, which also surpasses the core i9-10900k desktop processor of skylake’s old architecture, reflecting the improvement of IPC of willowcove micro architecture; in terms of multi thread, based on stronger heat dissipation capacity and higher power consumption configuration, cheetah Canyon NUC is still stronger than the lightweight version with core i7-1165g7 in the market.

POV ray is an open source free software that uses ray tracing to draw 3D images. It is tested with version 3.7.1. The core i7-1165g7 on cheetah Canyon NUC has a single thread score of 633.27pps and a multi thread score of 2724.11pps.

Using blender2.83lts to test, importing bwm27 material, the NUC rendering time of cheetah Canyon is 6 minutes and 15 seconds. The time is about one minute shorter than the mainstream slim version of the core i7-1165g7 previously tested.

V-Ray is a rendering software developed by chaosgroup, a professional developer of renderers. It is the most popular rendering engine in the industry. The benchmark version 5.0.20 was used to test, and the NUC rendering score of cheetah canyon was 4901 vsamples. It’s also stronger than 28wpl1’s core i7-1165g7.

As mentioned earlier, the willowcove micro architecture of tiger lake-up3 supports avx-512 instruction set and its extension. At present, some software can make good use of avx-512 instruction set, such as y-cruncher (supporting avx-512ifma).

Specifically, running y-cruncher with cheetah Canyon NUC to calculate the Pi of 1000000000 bits takes about 60s, which is even faster than many games with 8-core processors. As a processor with 4-core and 8-thread specifications, it is really rare. This can also show the high efficiency of willowcove micro architecture under the new instruction set. Of course, the design of AVX intensive computing is greatly affected by cache and memory. The memory used this time is 32GB dual channel 2R × 16 memory. If 32GB dual channel 2R × 8 memory is used, it may have better performance.

Performance test of sharp torch in xelp architecture

In addition to the improvement of the kernel willowcove micro architecture, another major upgrade of the 11th generation core processor tigerlake-up3 is the integrated sharp torch Xe core display.

After Intel announced its return to the exclusive market, it has demonstrated its ambitious plans. The xegpu architecture will include the xehpc for HPC, the xehp for data center, the xehpg for game players and the XLP focusing on low power consumption. Tigerlake-up3’s Xe core is based on the xelp architecture.

Among them, the core i7-1165g7 carries the sharp torch Xe core display, which is full of blood in terms of specifications. It has 96 groups of EU units, and the highest frequency can reach 1.3GHz. Of course, there are also many factors that affect the performance of the core display. In addition to heat dissipation and power consumption, the memory of the system is also the top priority. Tigerlake-up3’s memory controller supports both lpddr4x-4266 and ddr4-3200, but cheetah Canyon NUC does not have on-board lpddr4x-4266 memory, which is a pity.

As mentioned earlier, the NUC of cheetah Canyon used two 16gbddr43200c22 (2R × 16) to form a dual channel 32GB. In 3dmark, the author runs three projects, nightraid, firestrike and timespy. The score of 96eu Ruiju Xe is basically in line with the expectation, which is basically equal to that of mx350.

However, considering that the memory used this time is not the optimal solution in ddr4-3200, the above test results of 3dmark are only for reference, and can not represent the maximum performance of Ruiju Xe core display. The detailed graphics performance of cheetah Canyon NUC (including the results of verification and display theory and the game) will be tested with 2R × 8 memory in the future. At the same time, the lightweight version using lpddr4x-4266 memory will be compared.

Comprehensive performance + practical application test

As for the overall performance of the whole machine, pcmark10 was used for the test first. In the standard test, the NUC score of cheetah canyon was 5088, which is similar to the high-end light-weight version equipped with the 11th generation core tiger lake-up3.

In the test of pcmark10’s office + edge application, the NUC score of cheetah Canyon reached 11257, breaking the 11000 mark, which is slightly better than the high-end light and thin one equipped with the 11th generation core tiger lake-up3.

UL, the organization that developed pcmark and 3dmark, recently launched a new benchmark software Procyon, which includes the running scripts of adobe related software, and provides photo editing benchmark and video editing benchmark.

The photo editing benchmark uses Adobe Lightroom to import, process and modify the selected image, and uses Adobe Photoshop for image editing and layer adjustment. In this test, Adobe Lightroom version 10.1.1 and Adobe Photoshop version 22.2.0 were used, and the final overall score of NUC of cheetah canyon was 6029, image retrieval score was 6258 and batch processing score was 5810.

In video editing test, we use Adobe Premiere Pro to export video. In this test, core i7-1165g7 integrated 96eu sharp torch Xe nuclear display has a place, the overall score is 2113. The core i00630 is slightly better than the hdu1099.

Summary: what else can we expect from NUC of cheetah Canyon?

Intel’s 11th generation core tigerlake-up3 has been on the market for a long time. However, because it is a processor for light-weight and business-based products, and most OEM manufacturers are more conservative about the cooling adjustment of these two types of products, neither players nor the media can understand the “full blood” performance of tigerlake-up3 through the products on the market.

The appearance of NUC in cheetah canyon has changed this situation, allowing ordinary users to appreciate the full blood performance of tiger lake-up3. From the test results, it can be found that core i7-1165g7 can not only provide powerful single thread performance in cheetah Canyon NUC (single core Rui frequency is less sensitive to power consumption and does not need 40W), but also based on 40W performance release, core i7-1165g7’s actual full core Rui frequency in cheetah Canyon NUC is higher than the thin version, and the multi thread performance has been steadily improved.

Unfortunately, tigerlake-up3 only has 4 cores and 8 threads, and there are no more core products. Therefore, in the case of excluding avx-512 and other higher loads, the 40W power consumption full core Rui frequency has basically run full. If you manually increase the pl1pl2 value, even if the heat dissipation of cheetah Canyon NUC can be suppressed, there will not be much benefit in terms of performance. The specification of 4-core and 8-thread may be more suitable at present, but as a small desktop with stronger heat dissipation capacity, the specification of 6-core and 12 thread may be more perfect.

In any case, we can fully understand the “full blood” performance of core i7-1165g7 from the test of NUC in cheetah canyon. With the support of many new technologies such as 10nm superfin process, willowcove core microarchitecture, and xelp display architecture, tigerlake-up3 can compare the icelake-u of sunnycove microarchitecture with the original 10nm process, or skylake microarchitecture with the 14nm + + process The cometlake-u of the architecture has achieved the performance improvement of the next generation.

Based on the above performance, I also look forward to the performance of tigerlake-h45 series, the high-performance mobile version of the 11th generation core processor. As a product of the fever level game book, tigerlake-h45 is also equipped with willowcove core micro architecture. The single core Rui frequency is higher than tigerlake-up3 and can reach 5.0ghz. More importantly, tigerlake-h45 can provide 8-core and 16 thread specifications, which can be described as full Meet all the expectations of the market. In the future, I believe tigerlake-h45 will be able to burst out stronger performance in high-end game books and mobile workstations with excellent heat dissipation ability!

Author: Diamond Bay

 


发表评论

邮箱地址不会被公开。 必填项已用*标注