-
-
Notifications
You must be signed in to change notification settings - Fork 382
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Agent stops taking jobs after server throws 5XX errors #4446
Comments
Does it work if you deploy an agent in Kubernetes (direct Agent-Server connection, not via Traefik)? JFYI, that is my apiVersion: traefik.io/v1alpha1
kind: IngressRoute
metadata:
name: woodpecker-server
spec:
entryPoints:
- websecure
routes:
- kind: Rule
match: Host(`wp.domain.tld`)
services:
- name: woodpecker-server
port: http
- kind: Rule
match: Host(`wp.domain.tld`) && Headers(`Content-Type`, `application/grpc`)
services:
- name: woodpecker-server
port: grpc
scheme: h2c However, I didn't restarted the server, if I remember correctly. |
The kubernetes-agents work fine and are not affected by the problem. It is very likely that the 5XX errors come from Traefik mainly. However I would also expect the agent to not poop itself when there are errors for a few seconds. Matching the application type is a good hint, I might implement this. I currently don't use IngressRoute objects and instead configure normal Ingresses with annotations. |
I think so and I had this.
Seems, it is trying. Do you have 2 ingresses: one for HTTP, another for gRPC? Could you show HTTP one? |
Accidentally added the label. Can't remove it anymore :/ |
Component
agent
Describe the bug
When the server (running in kubernetes) restarts my docker agent refuses to take new jobs until restarted. In the agent logs I can see several 5XX Errors while the server reboots. After that the agent shows as online in the UI but does not take jobs.
Agent logs: See below
Steps to reproduce
Expected behavior
The agent should properly reconnect to the Server via gRPC after the server restarts.
System Info
Server:
{"source":"https://github.com/woodpecker-ci/woodpecker","version":"2.7.3"}
Helm values:
gRPC Ingress:
docker-compose config for agent:
Additional context
Agent logs:
Validations
next
version already [https://woodpecker-ci.org/faq#which-version-of-woodpecker-should-i-use]The text was updated successfully, but these errors were encountered: