Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

random text generation from Qwen2-VL-7B-Instruct with TGI3 #2851

Open
2 of 4 tasks
DongyoungKim2 opened this issue Dec 17, 2024 · 0 comments
Open
2 of 4 tasks

random text generation from Qwen2-VL-7B-Instruct with TGI3 #2851

DongyoungKim2 opened this issue Dec 17, 2024 · 0 comments

Comments

@DongyoungKim2
Copy link

System Info

OS version: Ubuntu 22.04
Model: Qwen2-VL-7B-Instruct
GPU: H100 80G
Deployment: k8s

Information

  • Docker
  • The CLI directly

Tasks

  • An officially supported command
  • My own modifications

Reproduction

I compared output between vLLM and TGI3 and TGI3 produce random text for non-english cases

  1. run Qwen2-VL-7B-Instruct model using tgi:3.0.0image
  2. run the same model using vllm:v0.6.4.post1 image
  3. for TGI3, no cuda graph used (Error for Qwen2-VL-2B-Instruct using v3.0.0 #2823)

and do the inference with openai api and TGI3 produce non-sense random characters.
Task was OCR for given image (written in Korean) and vLLM got correctly but TGI3.

Expected behavior

output from vLLM and TGI3 should be similar

@DongyoungKim2 DongyoungKim2 changed the title Issues using TGI3 with Qwen2-VL-7B-Instruct random text generation from Qwen2-VL-7B-Instruct with TGI3 Dec 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant