Add support for infinite output model fallback #2631

IsaacBreen · 2024-12-14T21:04:03Z

When a response exceeds its length limit and the model doesn't support assistant prefill, we currently throw an error. This PR adds support for falling back to a dedicated "infinite output" model in such cases.

Changes

Added --infinite-output-model CLI argument
Added infinite_output_model support to Model class
Modified response handling to check for and use infinite output model before giving up
Updated status display to show infinite output model when configured

Impact

This is particularly valuable for users of models with lower output token limits that don't support prefill:

Gemini users benefit most, since Gemini has an 8k token limit and no prefill support, but great free usage tiers
OpenAI users might benefit for extremely long edits (though 16k limit is usually sufficient)
Claude users unaffected (already supports prefill)

Implementation Notes

The flow is now:

If main model hits length limit, check if it supports prefill
If not, check for infinite output model
If found and it supports prefill, switch to it
Otherwise throw error as before

I haven't added any default infinite output model configurations. The current convention is that default models (main/weak/editor) come from the same provider. Since the whole point of infinite output models is to fall back to a different provider when the main one doesn't support it, this would break that convention.

We could add defaults (e.g. falling back to Claude for Gemini users), but I kept this PR focused on just the core mechanism.

CLAassistant · 2024-12-14T21:04:09Z

All committers have signed the CLA.

paul-gauthier · 2024-12-15T23:59:12Z

Thanks for your interest in aider, and for filing this PR.

I'm finding it a bit hard to understand the need here. Why not just work with a main model that supports prefill?

IsaacBreen added 7 commits December 13, 2024 13:40

feat: Add infinite output model for long responses

38d6296

feat: Add --infinite-output-model argument

fc37bf9

feat: Edit arg messsage

af4c75b

feat: Print infinite output model name

417db3c

hmmm

2af6a10

feat: Fixes for infinite model name

2bf6f59

fix

2b7fc9f

mbailey mentioned this pull request Dec 15, 2024

feat: Support custom Whisper API endpoints for voice transcription #2634

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for infinite output model fallback #2631

Add support for infinite output model fallback #2631

IsaacBreen commented Dec 14, 2024 •

edited

Loading

CLAassistant commented Dec 14, 2024 •

edited

Loading

paul-gauthier commented Dec 15, 2024

Add support for infinite output model fallback #2631

Are you sure you want to change the base?

Add support for infinite output model fallback #2631

Conversation

IsaacBreen commented Dec 14, 2024 • edited Loading

Changes

Impact

Implementation Notes

CLAassistant commented Dec 14, 2024 • edited Loading

paul-gauthier commented Dec 15, 2024

IsaacBreen commented Dec 14, 2024 •

edited

Loading

CLAassistant commented Dec 14, 2024 •

edited

Loading