Use sequence length axis in tensor trim #723

as-suvorov · 2024-08-01T10:06:21Z

Use sequence length axis in trimm_tensor

ilya-lavrenov · 2024-08-01T15:14:54Z

samples/cpp/prompt_lookup_decoding_lm/CMakeLists.txt

should we add such models with different sequence length dimension ID to GHA CI validation ?

Would we keep stateful mode after CB merge? If yes, then I think it make sense to add tests.

Yes, CB is already merged and we keep stateful mode.

Tests can be added separately

samples/cpp/prompt_lookup_decoding_lm/CMakeLists.txt

as-suvorov · 2024-08-02T11:54:38Z

samples/cpp/prompt_lookup_decoding_lm/prompt_lookup_decoding_lm.cpp

+    const std::map<std::string, size_t> model_type_to_seq_len_axis{
+        {"chatglm", 0},
+        {"llama", 2},
+    };


@ilya-lavrenov , @Wovchena , This approach is not reliable, chatglm3 and chatglm2 have seq_len_axis=0 but glm-4-9b-chat has seq_len_axis=2. All of them have same model_type='chatglm'.
I'll try to detect seq_len_axis by comparing kv tensor shape dimensions with generated tokens length.

as-suvorov · 2024-08-02T11:55:33Z

Removed from merge queue due to: #723 (comment)

.github/workflows/causal_lm_cpp.yml

as-suvorov added 2 commits August 1, 2024 11:37

Set seq len axis based on model type

841fae0

Rename function

06c7a91

as-suvorov requested review from pavel-esir and Wovchena August 1, 2024 10:06

as-suvorov added bug Something isn't working enhancement New feature or request labels Aug 1, 2024

ilya-lavrenov approved these changes Aug 1, 2024

View reviewed changes

ilya-lavrenov reviewed Aug 1, 2024

View reviewed changes

ilya-lavrenov assigned ilya-lavrenov and Wovchena Aug 1, 2024

ilya-lavrenov added this to the 2024.4 milestone Aug 1, 2024

ilya-lavrenov reviewed Aug 1, 2024

View reviewed changes

samples/cpp/prompt_lookup_decoding_lm/CMakeLists.txt Outdated Show resolved Hide resolved

ilya-lavrenov self-requested a review August 1, 2024 17:21

Add guard and policy

478de30

pavel-esir approved these changes Aug 2, 2024

View reviewed changes

ilya-lavrenov approved these changes Aug 2, 2024

View reviewed changes

ilya-lavrenov added this pull request to the merge queue Aug 2, 2024

as-suvorov commented Aug 2, 2024

View reviewed changes

as-suvorov removed this pull request from the merge queue due to a manual request Aug 2, 2024

as-suvorov marked this pull request as draft August 2, 2024 11:55

as-suvorov added 2 commits August 2, 2024 17:56

Update only trim fn

1b6350d

Fix formatting

a3fdc04

as-suvorov marked this pull request as ready for review August 2, 2024 16:01

ilya-lavrenov approved these changes Aug 2, 2024

View reviewed changes

Fix command

8dfc1ab

as-suvorov changed the title ~~Set sequence length axis based on model type~~ Use sequence length axis in tensor trim Aug 5, 2024

as-suvorov added 2 commits August 5, 2024 09:03

Fix main model

61a3081

Merge branch 'master' into as/set_seq_len_axis_based_on_model_type

511e60b

Wovchena reviewed Aug 5, 2024

View reviewed changes

.github/workflows/causal_lm_cpp.yml Show resolved Hide resolved

Wovchena approved these changes Aug 5, 2024

View reviewed changes

ilya-lavrenov added this pull request to the merge queue Aug 5, 2024

Merged via the queue into openvinotoolkit:master with commit 4e1e755 Aug 6, 2024
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use sequence length axis in tensor trim #723

Use sequence length axis in tensor trim #723

as-suvorov commented Aug 1, 2024 •

edited

Loading

ilya-lavrenov Aug 1, 2024 •

edited

Loading

as-suvorov Aug 2, 2024

Wovchena Aug 2, 2024

ilya-lavrenov Aug 2, 2024

as-suvorov Aug 2, 2024

as-suvorov commented Aug 2, 2024

Use sequence length axis in tensor trim #723

Use sequence length axis in tensor trim #723

Conversation

as-suvorov commented Aug 1, 2024 • edited Loading

ilya-lavrenov Aug 1, 2024 • edited Loading

Choose a reason for hiding this comment

as-suvorov Aug 2, 2024

Choose a reason for hiding this comment

Wovchena Aug 2, 2024

Choose a reason for hiding this comment

ilya-lavrenov Aug 2, 2024

Choose a reason for hiding this comment

as-suvorov Aug 2, 2024

Choose a reason for hiding this comment

as-suvorov commented Aug 2, 2024

as-suvorov commented Aug 1, 2024 •

edited

Loading

ilya-lavrenov Aug 1, 2024 •

edited

Loading