Streaming output and structured output cannot be supported at the same time #1569

balala8 · 2024-12-14T08:11:24Z

If structured output is required, and str is returned directly without parsing, is it possible to support streaming output?

manthanguptaa · 2024-12-18T07:21:02Z

@balala8 can you share your code as to what you exactly are trying to do

ysolanky · 2024-12-18T14:56:52Z

Hello @balala8 ! Can you please also share your use case for streaming structured output? Disabling streaming for structured output is a conscious decision we made so curious to know your intended use case

balala8 · 2024-12-23T01:31:01Z

@balala8 can you share your code as to what you exactly are trying to do

Hello @balala8 ! Can you please also share your use case for streaming structured output? Disabling streaming for structured output is a conscious decision we made so curious to know your intended use case

Of course. I'm following OpenAI's example of supporting both structured output and streaming simultaneously. Here's my code, which is almost identical to OpenAI's implementation:

from typing import List
from pydantic import BaseModel
from openai import OpenAI

class EntitiesModel(BaseModel):
    attributes: List[str]
    colors: List[str]
    animals: List[str]

client = OpenAI()

with client.beta.chat.completions.stream(
    model="gpt-4",
    messages=[
        {"role": "system", "content": "Extract entities from the input text"},
        {
            "role": "user",
            "content": "The quick brown fox jumps over the lazy dog with piercing blue eyes",
        },
    ],
    response_format=EntitiesModel,
) as stream:
    for event in stream:
        if event.type == "content.delta":
            if event.parsed is not None:
                # Print the parsed data as JSON
                print("content.delta parsed:", event.parsed)
        elif event.type == "content.done":
            print("content.done")
        elif event.type == "error":
            print("Error in stream:", event.error)

final_completion = stream.get_final_completion()
print("Final completion:", final_completion)

In my use case, I need the model to output a complex table. Since the output time is relatively long, I want users to see the first row of content quickly rather than watching a loading progress bar until all tokens are generated.

After reviewing phidata's code, I noticed that enabling both streaming and structured output was intentionally disabled. This prompted me to raise this issue. Additionally, I noticed that phidata's implementation of structured output doesn't use OpenAI SDK's response_format parameter. I'm curious about two things:

Does this affect the performance of the model?
Does it increase the likelihood of failures compared to using OpenAI's SDK directly?

manthanguptaa · 2024-12-25T08:09:21Z

@balala8 can you share the agent config you built using Phidata? That would be more helpful for us

balala8 · 2024-12-26T11:05:24Z

@balala8 can you share the agent config you built using Phidata? That would be more helpful for us

Here's my code using phidata. I believe the main advantage of streaming output is that when my output is very long, users can quickly see the first token's output. Meanwhile, I also need structured output. Therefore, streaming output and structured output should be supported simultaneously.

from typing import List
from pydantic import BaseModel
from phi.model.openai import OpenAIChat
from phi.agent import Agent,RunResponse

class EntitiesModel(BaseModel):
    attributes: List[str]
    colors: List[str]
    animals: List[str]

if __name__=="__main__":
    agent = Agent(
        model=OpenAIChat(id='gpt-4o-mini'),
        response_model=EntitiesModel,
        structured_outputs=True,
        debug_mode=True,
    )
    response = agent.run("What are the colors of the rainbow?",
                                     stream=True)
    print(response)

Additionally, I noticed that phidata's implementation of structured output doesn't directly use response_format in OpenAI. What are the differences between these two approaches? Would they affect performance? I would greatly appreciate your insights on this matter.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming output and structured output cannot be supported at the same time #1569

Streaming output and structured output cannot be supported at the same time #1569

balala8 commented Dec 14, 2024

manthanguptaa commented Dec 18, 2024

ysolanky commented Dec 18, 2024

balala8 commented Dec 23, 2024

manthanguptaa commented Dec 25, 2024

balala8 commented Dec 26, 2024

Streaming output and structured output cannot be supported at the same time #1569

Streaming output and structured output cannot be supported at the same time #1569

Comments

balala8 commented Dec 14, 2024

manthanguptaa commented Dec 18, 2024

ysolanky commented Dec 18, 2024

balala8 commented Dec 23, 2024

manthanguptaa commented Dec 25, 2024

balala8 commented Dec 26, 2024