How does the "Target length (tokens)" work? #769
-
Hi everybody, im trying to understand how does it work between endpoints etc.. but i can't find anything about it in the wiki. Can someone help me? |
Beta Was this translation helpful? Give feedback.
Answered by
LostRuins
Apr 8, 2024
Replies: 1 comment 4 replies
-
Max Length is Amount to Generate = Max number of tokens that are allowed to be generated in 1 request. I assume that's what you're looking for. Max context length is the maximum total length of the story the AI is allowed to process. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Then I think such a feature does not exist in stock KoboldCpp. I am not sure how the SillyTavern implementation works, but in Kobold a generation will continue until
Normally EOS will be hit long before the max length is hit. Combined with "Trim Sentences" and a decently long max length, it should get you close to what you'd expect.