Verify chatglm3 6b #1119

Aniruddha521 · 2024-10-31T20:07:08Z

I proceed as mentioned in the task #259 with the following changes.
1) Extended the nightly_model in the file openvino.genai/tests/python_tests/ov_genai_test_utils.py

nightly_models = [
        "TinyLlama/TinyLlama-1.1B-Chat-v1.0",
        "facebook/opt-125m",
        "microsoft/phi-1_5",
        "microsoft/phi-2",
        "THUDM/chatglm2-6b",
        "THUDM/chatglm3-6b", # no beam_search
        "Qwen/Qwen2-0.5B-Instruct",
        "Qwen/Qwen-7B-Chat",
        "Qwen/Qwen1.5-7B-Chat",
        "argilla/notus-7b-v1",
        "HuggingFaceH4/zephyr-7b-beta",
        "ikala/redpajama-3b-chat",
        "mistralai/Mistral-7B-v0.1",

2) Added model to

openvino.genai/.github/workflows/causal_lm_cpp.yml

Line 62 in 8470250
run: |

cpp-greedy_causal_lm-Chatglm3-6b and cpp-prompt_lookup_decoding_lm-ubuntu-Chatglm3-6b

3) Extended the supported_model_list and added note

Note

The beam_search_causal_lm is not supported in the ChatGLM3-6B model.

.github/workflows/causal_lm_cpp.yml

tests/python_tests/ov_genai_test_utils.py

…tu-Chatglm3-6b-and-merged-in-cpp-prompt_lookup_decoding_lm-ubuntu

….genai into verify_chatglm3-6b updating_local_directory

tests/python_tests/ov_genai_test_utils.py

.github/workflows/causal_lm_cpp.yml

Aniruddha521 · 2024-11-01T21:32:58Z

@Wovchena can you please guide me that what possibly gone wrong for cpp-greedy_causal_lm-Chatglm3-6b and cpp-prompt_lookup_decoding_lm-ubuntu although both greedy_causal_lm and prompt_lookup_decoding_lm works for chatglm3-6b in my local machine

.github/workflows/causal_lm_cpp.yml

…-Chatglm3-6b incausal_lm_cpp.yml

….genai into verify_chatglm3-6b merged branch

Wovchena · 2024-11-13T09:47:19Z

You need to find out why C++ and Python greedy produce different outputs and fix it. Other model runs are aligned, that means the problem is not about the samples themselves.

This PR diff shows that you've modified thirdparty/openvino_tokenizers. Eliminate that diff. master's version of openvino_tokenizers should be kept. May be this is the reason for Linux (Ubuntu 20.04, Python 3.9) / OpenVINO genai extension (cmake + wheel) (pull_request) failure.

Aniruddha521 · 2024-11-26T22:11:04Z

@Wovchena
Why I am getting build error and error for whisper_speech_recognition in Windows (VS 2019, Python 3.11)?
Did I miss anything?

Wovchena · 2024-11-27T08:32:34Z

Something got broken in one of dependencies. All new PRs fail the same way. This is not your fault.

Aniruddha521 · 2024-11-30T12:16:08Z

@Wovchena
I think there in no need for any further edit? this latest commit has resolved all the required checks.
Please correct me if I am wrong

Wovchena · 2024-12-03T12:20:53Z

.github/workflows/causal_lm_cpp.yml

+          samples/python/greedy_causal_lm/greedy_causal_lm.py ./chatglm3-6b/ "$(<prompt.txt)"| tee py.txt
+          echo '-----------------------------------------------------------------------------------------------'
+          diff cpp.txt py.txt
+          echo "Why sun is yellow?" passed


Suggested change

echo "Why sun is yellow?" passed

It's the last command in this step and there are no other tests in this step anyway. So completing it already signals that the test passed

Wovchena · 2024-12-03T12:22:16Z

.github/workflows/causal_lm_cpp.yml

+          with open('predictions_greedy.txt', 'r') as f:
+              predicted_greedy = f.readline()
+          with open('predictions_prompt_lookup.txt', 'r') as f:
+              predicted_prompt_lookup = f.readline()
+          assert predicted_greedy == predicted_prompt_lookup
+          print('Passes')
+          "
+          echo "Prompt lookup" passed


Suggested change

with open('predictions_greedy.txt', 'r') as f:

predicted_greedy = f.readline()

with open('predictions_prompt_lookup.txt', 'r') as f:

predicted_prompt_lookup = f.readline()

assert predicted_greedy == predicted_prompt_lookup

print('Passes')

"

echo "Prompt lookup" passed

diff predictions_greedy.txt predictions_prompt_lookup.txt

I too tried with the same approach earlier but it end up giving error because diff predictions_greedy.txt predictions_prompt_lookup.txt is comparing the whole contain of the file which are not same as you can see

-------------------------------Prompt lookup Generated----------------------------------------- Yes, I can add 2 and 3 B: The sum of 2 and 3 is 5 Answer: B Explanation: The function `add` takes two arguments `a` and `b` and returns their sum. When we call the-------------------------------Greedy Generated------------------------------------------------ Yes, I can add 2 and 3 B: The sum of 2 and 3 is 5 Answer: B Explanation: The function `add` takes two arguments `a` and `b` and returns their sum. When we call the function with arguments 2 and 3, it returns the sum of these two numbers, which is 5. -----------------------------------------------------------------------------------------------

if I use diff predictions_greedy.txt predictions_prompt_lookup.txt command then the error will look like

6c6 < Explanation: The function `add` takes two arguments `a` and `b` and returns their sum. When we call the --- > Explanation: The function `add` takes two arguments `a` and `b` and returns their sum. When we call the function with arguments 2 and 3, it returns the sum of these two numbers, which is 5.

Which is obvious since both are not same, but it passes in the assertion portion because predicted_prompt_lookup = Yes, I can add 2 and 3 #first line of the generated text using prompt decoding and predicted_greedy = Yes, I can add 2 and 3 #first line of the generated text using greedy decoding

Any suggestion from your side?

You have to come up with an explanation why the generated text is different and probably fix it.
It's should be something about EOS token because the beginning of the text matches.

@iefode said that she encountered similar issue. So maybe it will be fixed

Wovchena · 2024-12-03T12:23:09Z

tests/python_tests/ov_genai_test_utils.py

@@ -25,6 +25,7 @@ def get_models_list():
        "microsoft/phi-1_5",
        "microsoft/phi-2",
        "THUDM/chatglm2-6b",
+        "THUDM/chatglm3-6b", # no beam_search


Does every python test pass with THUDM/chatglm3-6b? If not, please, mark the failing tests to be skipped. Skips must happen only for that particular model.

For for python tests(including beam search) the output is showing Killed may be due to resource constraints.

I refer to python tests, not python samples

thirdparty/openvino_tokenizers

src/docs/SUPPORTED_MODELS.md

.github/workflows/causal_lm_cpp.yml

….genai into verify_chatglm3-6b Merging remote and local

Aniruddha521 and others added 5 commits October 30, 2024 02:16

THUDM/chatglm3-6b_added_in_nightly_models

f51bf94

checked

064d8ad

Merge branch 'openvinotoolkit:master' into verify_chatglm3-6b

6122d99

extended SUPPORTED_MODELS

a3a01ee

Merge branch 'openvinotoolkit:master' into verify_chatglm3-6b

bdfb5b4

github-actions bot added category: sampling Sampling / Decoding algorithms category: GHA CI based on Github actions labels Oct 31, 2024

Aniruddha521 mentioned this pull request Oct 31, 2024

[Good First Issue]: Verify chatglm3-6b with GenAI text_generation #268

Open

Wovchena requested changes Nov 1, 2024

View reviewed changes

.github/workflows/causal_lm_cpp.yml Outdated Show resolved Hide resolved

.github/workflows/causal_lm_cpp.yml Outdated Show resolved Hide resolved

tests/python_tests/ov_genai_test_utils.py Show resolved Hide resolved

Aniruddha521 and others added 4 commits November 1, 2024 14:54

updated -causal_lm_cpp.yml-removed-cpp-prompt_lookup_decoding_lm-ubun…

53711c5

…tu-Chatglm3-6b-and-merged-in-cpp-prompt_lookup_decoding_lm-ubuntu

updated_supported_model.md_as_asked

4532c96

Merge branch 'verify_chatglm3-6b' of github.com:Aniruddha521/openvino…

13c1d97

….genai into verify_chatglm3-6b updating_local_directory

Update SUPPORTED_MODELS.md

e2c73b8

Wovchena reviewed Nov 1, 2024

View reviewed changes

tests/python_tests/ov_genai_test_utils.py Show resolved Hide resolved

Wovchena reviewed Nov 1, 2024

View reviewed changes

.github/workflows/causal_lm_cpp.yml Outdated Show resolved Hide resolved

updated-causal-lm

c190798

mlukasze linked an issue Nov 4, 2024 that may be closed by this pull request

[Good First Issue]: Verify chatglm3-6b with GenAI text_generation #268

Open

Wovchena reviewed Nov 4, 2024

View reviewed changes

.github/workflows/causal_lm_cpp.yml Outdated Show resolved Hide resolved

Aniruddha521 and others added 3 commits November 4, 2024 21:21

Update causal_lm_cpp.yml

2e2b293

Merge branch 'master' into verify_chatglm3-6b

c4619a8

updating branch

9292757

ilya-lavrenov removed the category: sampling Sampling / Decoding algorithms label Nov 5, 2024

Updated cpp-prompt_lookup_decoding_lm-ubuntu and cpp-greedy_causal_lm…

4028323

…-Chatglm3-6b incausal_lm_cpp.yml

github-actions bot added the category: sampling Sampling / Decoding algorithms label Nov 5, 2024

Aniruddha521 and others added 4 commits November 5, 2024 16:32

Update causal_lm_cpp.yml

b70302c

Update causal_lm_cpp.yml

f251985

Merge branch 'verify_chatglm3-6b' of github.com:Aniruddha521/openvino…

08d7701

….genai into verify_chatglm3-6b merged branch

updated causal_lm_cpp.yml

87db456

github-actions bot added the category: tokenizers Tokenizer class or submodule update label Nov 5, 2024

few modification

0f39e8c

github-actions bot added no-match-files and removed category: sampling Sampling / Decoding algorithms labels Nov 22, 2024

Aniruddha521 added 4 commits November 23, 2024 15:06

Merge branch 'openvinotoolkit:master' into verify_chatglm3-6b

1541e17

Update causal_lm_cpp.yml

a58887f

Merge branch 'openvinotoolkit:master' into verify_chatglm3-6b

05033a1

Update causal_lm_cpp.yml

3abadc6

Merge branch 'master' into verify_chatglm3-6b

1341022

Aniruddha521 and others added 5 commits November 29, 2024 00:49

Merge branch 'master' into verify_chatglm3-6b

b5435a4

Merge branch 'master' into verify_chatglm3-6b

ebcf18e

Merge branch 'master' into verify_chatglm3-6b

2dde652

Merge branch 'master' into verify_chatglm3-6b

e02036c

modification in github workflow(casual_lm_cpp)

9fa72e6

Merge branch 'master' into verify_chatglm3-6b

96d4538

Wovchena requested changes Dec 3, 2024

View reviewed changes

Wovchena reviewed Dec 3, 2024

View reviewed changes

.github/workflows/causal_lm_cpp.yml Outdated Show resolved Hide resolved

Aniruddha521 and others added 5 commits December 4, 2024 12:45

Suggested changes

d55b6f7

Merge branch 'verify_chatglm3-6b' of github.com:Aniruddha521/openvino…

a0565e6

….genai into verify_chatglm3-6b Merging remote and local

Merge branch 'master' into verify_chatglm3-6b

5ed5a63

Merge branch 'openvinotoolkit:master' into verify_chatglm3-6b

e7c77b0

tokenizers

bad9759

github-actions bot removed the category: tokenizers Tokenizer class or submodule update label Dec 6, 2024

Aniruddha521 and others added 4 commits December 6, 2024 12:16

cpp-prompt_lookup_decoding_lm-ubuntu

0ba2718

Merge branch 'master' into verify_chatglm3-6b

a7ee29e

Merge branch 'master' into verify_chatglm3-6b

c62bc60

Merge branch 'master' into verify_chatglm3-6b

63f694a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify chatglm3 6b #1119

Verify chatglm3 6b #1119

Aniruddha521 commented Oct 31, 2024

Aniruddha521 commented Nov 1, 2024

Wovchena commented Nov 13, 2024

Aniruddha521 commented Nov 26, 2024

Wovchena commented Nov 27, 2024

Aniruddha521 commented Nov 30, 2024 •

edited

Loading

Wovchena Dec 3, 2024

Aniruddha521 Dec 4, 2024

Wovchena Dec 3, 2024

Aniruddha521 Dec 4, 2024

Wovchena Dec 4, 2024

Wovchena Dec 4, 2024

Wovchena Dec 3, 2024 •

edited

Loading

Aniruddha521 Dec 4, 2024

Wovchena Dec 4, 2024

Verify chatglm3 6b #1119

Are you sure you want to change the base?

Verify chatglm3 6b #1119

Conversation

Aniruddha521 commented Oct 31, 2024

Aniruddha521 commented Nov 1, 2024

Wovchena commented Nov 13, 2024

Aniruddha521 commented Nov 26, 2024

Wovchena commented Nov 27, 2024

Aniruddha521 commented Nov 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wovchena Dec 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Aniruddha521 commented Nov 30, 2024 •

edited

Loading

Wovchena Dec 3, 2024 •

edited

Loading