Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding Testing on new data #4

Open
SmritiChawla opened this issue Mar 10, 2022 · 2 comments
Open

Regarding Testing on new data #4

SmritiChawla opened this issue Mar 10, 2022 · 2 comments

Comments

@SmritiChawla
Copy link

SmritiChawla commented Mar 10, 2022

We are trying to predict drug response on a new dataset i.e. cell lines present in the new dataset is not present in the training dataset. However, I am getting the following error as shown in the screenshot below;
image
How can we predict on a new set of cell lines?

@rafaelperes
Copy link
Member

Hi @SmritiChawla! Could you give us more details? Are you using the notebook?

Best,
Rafael.

@sLee0709
Copy link

sLee0709 commented Nov 7, 2024

Hi @SmritiChawla! Could you give us more details? Are you using the notebook?

Best, Rafael.

Hi Rafael, I'm facing the same error. I'm firstly using model_training.py to train models based on GDSC data, and then use the trained models to make predictions towards our own single cell expression matrix. In the line 60 of model.py, np.matrix(test_kernel_df[kernel_sample_list]) failed, as it seems you used the cell line IDs of GDSC data as indexes for test_kernel_df. But the column names of test_kernel_of are cell label ids generated from scRNA sequencing. It is impossible to do that unless you used the same data. I'm not sure how did you do when you used your method towards new scRNA data?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants