Evaluate mistralai/Codestral-22B-v0.1 (FIM and quants) #202

the-crypt-keeper · 2024-05-29T14:33:54Z

No description provided.

the-crypt-keeper · 2024-05-29T15:24:27Z

despite being hosted on HF, this model has no config.json and doesnt support inference with transformers library or any other library it seems, only their own custom mistral-inference runtime

the-crypt-keeper · 2024-05-29T19:11:00Z

Completed initial instruction eval at FP16, this is an excellent model at JavaScript especially. It used about 45GB of VRAM for inference during my testing runs so should work with 2x24GB setups.

This model also supports FIM, so will keep this issue open for that as well as any quants as they pop up.

Latest interview_cuda supports torchrun and mistral-inference runtime in an MVP capacity:

torchrun --nproc-per-node 4 ./interview_cuda.py --runtime mistral --model_name ~/models/codestral-22B-v0.1 --params params/greedy-hf.json --input results/prepare_senior_python-javascript_chat-simple.ndjson,results/prepare_junior-v2_python-javascript_chat-simple.ndjson

Adjust 4 to the number of GPU, and --model_name in this case is a directory path and not an HF path

the-crypt-keeper added the model request Evaluate performance of a new model label May 29, 2024

the-crypt-keeper pushed a commit that referenced this issue May 29, 2024

#202 evaluate mistralai/Codestral-22B-v0.1

27f9403

the-crypt-keeper changed the title ~~Evaluate mistralai/Codestral-22B-v0.1~~ Evaluate mistralai/Codestral-22B-v0.1 (FIM and quants) May 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluate mistralai/Codestral-22B-v0.1 (FIM and quants) #202

Evaluate mistralai/Codestral-22B-v0.1 (FIM and quants) #202

the-crypt-keeper commented May 29, 2024

the-crypt-keeper commented May 29, 2024

the-crypt-keeper commented May 29, 2024 •

edited

Loading

Evaluate mistralai/Codestral-22B-v0.1 (FIM and quants) #202

Evaluate mistralai/Codestral-22B-v0.1 (FIM and quants) #202

Comments

the-crypt-keeper commented May 29, 2024

the-crypt-keeper commented May 29, 2024

the-crypt-keeper commented May 29, 2024 • edited Loading

the-crypt-keeper commented May 29, 2024 •

edited

Loading