Skip to content

Dagster+ Agent request timeouts #21597

May 2, 2024 · 3 comments · 4 replies
Discussion options

You must be logged in to vote

Solutions

TCP KEEPALIVE

When using the dagster-cloud-agent helm charts with a version ≥1.7.3, you can use the following options to enable probing of TCP connections to preemptively detect broken connections and reduce or eliminate the problem.

Note that setting these options also requires the dagster version for all of your code locations to also be 1.7.3 or later.

dagsterCloud:
  socketOptions:
    - ["SOL_SOCKET", "SO_KEEPALIVE", 1]
    - ["IPPROTO_TCP", "TCP_KEEPIDLE", 11]
    - ["IPPROTO_TCP", "TCP_KEEPINTVL", 7]
    - ["IPPROTO_TCP", "TCP_KEEPCNT", 5]

These values are relatively aggressive and will cause additional network packets to be set to probe and maintain the connection. You m…

Replies: 3 comments 4 replies

Comment options

mlarose
May 2, 2024
Collaborator Author

You must be logged in to vote
0 replies
Answer selected by mlarose
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
4 replies
@mlarose
Comment options

mlarose May 29, 2024
Collaborator Author

@cbini
Comment options

@cbini
Comment options

@mlarose
Comment options

mlarose Aug 2, 2024
Collaborator Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: troubleshooting Related to debugging and error messages deployment: k8s Related to deploying Dagster to Kubernetes Dagster+ Relates to Dagster paid plans
3 participants