graph: backend: dnnl: encode mem address into constant cache key #2312

xiang1guo · 2024-12-24T03:11:27Z

Description

Same compiled partitions may have different constant weights, potential accuracy issue may happens on future op direct optimization integration solutions.

This PR aims to enhance the library constant cache key for better cache or differentiation.

For the future direction of API design, please refer to #2280 for details. We will revisit this RFC once the real user scenario and request pops out.

Performance impact

Pattern level performance impact
- CPU: No impact
- GPU: No impact
Model level performance impact
- IPEX model:
  - RN50 fp32 no impact.
  - Bert large fp32/bf16/int8 no impact.

xiang1guo · 2024-12-25T08:05:56Z

make test
enable benchdnn_nightly
disable benchdnn_all
enable benchdnn_graph

TaoLv · 2024-12-26T09:06:10Z

src/graph/backend/dnnl/kernels/kernel_base.cpp

@@ -43,6 +43,24 @@ bool kernel_base_t::enabled_constant_cache() const {
    return enabled;
 }

+size_t kernel_base_t::encode_constant_cache_key(
+        const std::vector<tensor_t> &inputs, size_t cache_key) const {
+    // Encode the constant memory address into cache key for differentiation


With this, the original constant_key_ is not a cache key anymore. I would suggest to rename these variables as well as the function name of generate_constant_cache_key.

Thanks for the review! Rename to const_md_hash_ and generate_constant_md_hash accordingly, please review again.

src/graph/backend/dnnl/kernels/kernel_base.cpp

src/graph/backend/dnnl/kernels/large_partition.cpp

xiang1guo · 2024-12-31T03:15:20Z

make test
enable benchdnn_nightly
disable benchdnn_all
enable benchdnn_graph

xiang1guo added the component:graph-api Codeowner: @oneapi-src/onednn-graph label Dec 24, 2024

xiang1guo self-assigned this Dec 24, 2024

xiang1guo requested a review from a team as a code owner December 24, 2024 03:11

github-actions bot added the component:tests Codeowner: @oneapi-src/onednn-arch label Dec 24, 2024

xiang1guo force-pushed the xiang/main/fix-constant-cache-key branch from f4499e2 to ac702db Compare December 24, 2024 04:42

gyhintel approved these changes Dec 25, 2024

View reviewed changes

xiang1guo force-pushed the xiang/main/fix-constant-cache-key branch from ac702db to 89d402e Compare December 25, 2024 06:14

ElaineBao approved these changes Dec 25, 2024

View reviewed changes

wzt1997 approved these changes Dec 26, 2024

View reviewed changes

TaoLv reviewed Dec 26, 2024

View reviewed changes

xiang1guo added 2 commits December 31, 2024 03:11

graph: dnnl: encode mem address into cache key

620e9f9

gtest: graph: unit: test differnet constant weight tensor cache

28606ed

xiang1guo force-pushed the xiang/main/fix-constant-cache-key branch from 470f190 to 28606ed Compare December 31, 2024 03:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

graph: backend: dnnl: encode mem address into constant cache key #2312

graph: backend: dnnl: encode mem address into constant cache key #2312

xiang1guo commented Dec 24, 2024 •

edited

Loading

xiang1guo commented Dec 25, 2024

TaoLv Dec 26, 2024

xiang1guo Dec 26, 2024

xiang1guo commented Dec 31, 2024

graph: backend: dnnl: encode mem address into constant cache key #2312

Are you sure you want to change the base?

graph: backend: dnnl: encode mem address into constant cache key #2312

Conversation

xiang1guo commented Dec 24, 2024 • edited Loading

Description

Performance impact

xiang1guo commented Dec 25, 2024

TaoLv Dec 26, 2024

Choose a reason for hiding this comment

xiang1guo Dec 26, 2024

Choose a reason for hiding this comment

xiang1guo commented Dec 31, 2024

xiang1guo commented Dec 24, 2024 •

edited

Loading