BF16 weight prepack needs the cpu support avx512bw, avx512vl and avx512dq - how to set up neural chat to AVX2 #1506

hnguy31-hvevn · 2024-04-23T08:55:08Z

hnguy31-hvevn
Apr 23, 2024

Hi Team,

I have an issue with CPU XEON GOLD , Ubuntu VM.
How do I set up neural chat to AVX2?

BF16 weight prepack needs the cpu support avx512bw, avx512vl and avx512dq

[2024-04-23 00:52:31,123] [ ERROR] - Failed to start server.
[2024-04-23 00:52:31,123] [ ERROR] - BF16 weight prepack needs the cpu support avx512bw, avx512vl and avx512dq, but the desired instruction sets are not available. Please set dtype to torch.float or set weights_prepack to False.

Spycsh · 2024-05-06T04:13:42Z

Spycsh
May 6, 2024
Collaborator

Hi @hnguy31-hvevn , yes I think IPEX may optimize too aggressively. There is actually a parameter to turn off the weights_prepack.

I've made a PR here #1526, could you have a look on whether this fix your issue?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BF16 weight prepack needs the cpu support avx512bw, avx512vl and avx512dq - how to set up neural chat to AVX2 #1506

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

BF16 weight prepack needs the cpu support avx512bw, avx512vl and avx512dq - how to set up neural chat to AVX2 #1506

hnguy31-hvevn Apr 23, 2024

Replies: 1 comment

Spycsh May 6, 2024 Collaborator

hnguy31-hvevn
Apr 23, 2024

Spycsh
May 6, 2024
Collaborator