Description

This code contains our submission for the MediQA challenge 2024, 2nd task: Multilingual & Multimodal Medical Answer Generation. It converts the training data to the format used by the LLaVA-Med model, runs training & inference and then converts predictions back to the challenge required format.

Note: Generation is not fully deterministic, so results might differ slightly from the final_prediction.json, which represents our submission.

Running the experiment

Clone this repository recursively to include LLaVA-Med.

git clone --recursive [email protected]:Shiniri/MediQA.git

Include a valid Llama-7b checkpoint in the repository, as well as: the images for the training data in ./data/images_train and the images for the test data in ./data/images_test. Also include the training and test json files from the challenge in ./data/test.json / ./data/train.json.
Follow the setup instructions of the original LLaVA-Med repository here.
Set the Llama path variable in the ./run_experiment script to point to your Llama checkpoint and execute it. Note: you can probably leave out certain parts of the script depending on whether you want to re-run data conversion, training, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
LLaVA-Med @ fc39e79		LLaVA-Med @ fc39e79
predictions		predictions
scipts		scipts
.gitignore		.gitignore
.gitmodules		.gitmodules
final_prediction.json		final_prediction.json
final_prediction_translations.json		final_prediction_translations.json
readme.md		readme.md
run_experiment.sh		run_experiment.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Running the experiment

About

Releases

Packages

Languages

Shiniri/MediQA-M3G-Submission

Folders and files

Latest commit

History

Repository files navigation

Description

Running the experiment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages