Remove `while True` in AgentController #5868

rbren · 2024-12-27T16:54:46Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below
no changelog

Give a summary of what the PR does, explaining any non-trivial design decisions

The while True here is causing a lot of problems. It's easy for it to get stuck permanently.

Removing that loop is a surprisingly simple change--we just _step the agent whenever a new event comes in that it might react to. Haven't fully tested this yet but at first blush it's working well.

However, doing that surfaced a bunch of subtle issues with EventStream callbacks blocking each other, as well as main parts of the app. So now:

All events are added to a queue, which is processed in order
The queue is processed on its own thread
Every subscriber gets its own thread, with an asyncio loop set up for it

Link of any specific issues this addresses

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:86e5632-nikolaik   --name openhands-app-86e5632   docker.all-hands.dev/all-hands-ai/openhands:86e5632

enyst · 2024-12-27T19:52:16Z

openhands/controller/agent_controller.py

-                # no need to check too often
-                await asyncio.sleep(1)
-            else:
+            if self.delegate.get_agent_state() != AgentState.PAUSED:
                await self._delegate_step()


Please feel free to ignore, since this is a draft, just FYI when you get to test delegates: this whole bit with delegates became pointless. Because currently, the parent is unsubbed when the delegate starts, and the delegate subbed, and at the end of the delegate, the parent subscribes again. 🤔

ugh this needs some love 😄

Are we still using delegation for the browser?

No, we have some code, but it's not in use. That tool is not defined. The two actions that CodeAct has, for browsing (interactive and text-only), are good. The second is crazy good. 😅

FWIW, using CodeAct as delegate (of another "planner" CodeAct) is a thing that has been tried, more than once, and in that example it has worked decently. IMO that PR was practically mergeable at some point. If it will be tried again, I think it would be nice to be able to support it. Hey I'm just adding a data point, in case you contemplate killing it! 😇

The old agents still work, including delegator. (Pretty amazing, considering they, erm, aren't really tested for a while afaik.🙊)

Yeah I had high hopes for delegation when we put it here! I still think it could be great, but I don't love the handshake we currently have

enyst · 2024-12-27T23:48:54Z

openhands/controller/agent_controller.py

        elif isinstance(event, Observation):
            await self._handle_observation(event)
+            self.step()


Does this include obs like AgentChangeState or something which isn't supposed to interest the agent? It seems like it does

enyst · 2024-12-28T00:06:24Z

openhands/events/stream.py

-            # No event loop running...
-            asyncio.run(self._async_add_event(event, source))
-
-    async def _async_add_event(self, event: Event, source: EventSource):


If I understand correctly, we're back to a sync add_event? I do like it! I'm not sure why it had switched again 😅

enyst · 2024-12-28T00:31:01Z

openhands/controller/agent_controller.py

-            except Exception as e:
-                traceback.print_exc()
-                self.log('error', f'Error while running the agent: {e}')
-                await self._react_to_exception(e)


Just a thought about what I don't see at a cursory look (on github interface):

do we still send status message to the UI on exception?

rbren added 12 commits December 27, 2024 09:29

first pass

17e9d1d

add logs

91750d5

Merge branch 'main' into rb/controller-loop

4f601b1

add logs

a563b04

remove debug

9200520

sleep to give control

e4ae68a

remove debug

017b206

logspam

0e6d123

remove spam

7b15214

logspam

edda57e

Merge branch 'main' into rb/controller-loop

48d9f9c

remove threading

b6b1977

enyst reviewed Dec 27, 2024

View reviewed changes

rbren added 10 commits December 27, 2024 16:35

sync event callbacks

8dbd05d

remove print

cd4d588

remvoe sleep

afe3a9a

fix on_events

36b8b52

es queue

47a2526

debug info

99b7fef

give each subscriber its own thread

4e3d88f

everything working

9cfe028

logspam

4086da6

should_continue

df8f685

rbren marked this pull request as ready for review December 27, 2024 23:37

enyst reviewed Dec 27, 2024

View reviewed changes

enyst reviewed Dec 28, 2024

View reviewed changes

fix: call on_event directly in security tests

86e5632

enyst reviewed Dec 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove `while True` in AgentController #5868

Remove `while True` in AgentController #5868

rbren commented Dec 27, 2024 •

edited by github-actions bot

Loading

enyst Dec 27, 2024

rbren Dec 27, 2024

enyst Dec 27, 2024

rbren Dec 27, 2024

enyst Dec 27, 2024

enyst Dec 28, 2024

enyst Dec 28, 2024

Remove while True in AgentController #5868

Are you sure you want to change the base?

Remove while True in AgentController #5868

Conversation

rbren commented Dec 27, 2024 • edited by github-actions bot Loading

enyst Dec 27, 2024

Choose a reason for hiding this comment

rbren Dec 27, 2024

Choose a reason for hiding this comment

enyst Dec 27, 2024

Choose a reason for hiding this comment

rbren Dec 27, 2024

Choose a reason for hiding this comment

enyst Dec 27, 2024

Choose a reason for hiding this comment

enyst Dec 28, 2024

Choose a reason for hiding this comment

enyst Dec 28, 2024

Choose a reason for hiding this comment

Remove `while True` in AgentController #5868

Remove `while True` in AgentController #5868

rbren commented Dec 27, 2024 •

edited by github-actions bot

Loading