Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compute helm chart. Which services must be run? #94

Closed
roman-shakudo opened this issue Aug 7, 2024 · 4 comments
Closed

Compute helm chart. Which services must be run? #94

roman-shakudo opened this issue Aug 7, 2024 · 4 comments

Comments

@roman-shakudo
Copy link

roman-shakudo commented Aug 7, 2024

Hi

I managed to run neon on my k8s cluster based on #35
Image

I see pageserver and three safekeepers running
I had to force using k8s service dns as BROKER_ENDPOINT and S3_ENDPOINT because default values are not correct

Since I was changing charts manually pageserver and safekeeper added dependecies which were run as dependant charts

Storage controller is falling with Error: loading public key. I suppose JWT issue
Storage Scrubber's image is not available

Questions:

  1. Am I understand right that we have to wait for Compute chart? AFAIK I can't use any posgtres
  2. Which services are essential to run neon? I see a lot of various services and I do not know which are core. It would great to add some doc about it
  3. How can deployment be tested?
  4. Can I pack Compute from docker-compose.yaml in neon as a helm cart and use it or this image is just for "testing"?
  5. Am I understand right that there is no UI in the helm charts?
@roman-shakudo
Copy link
Author

roman-shakudo commented Aug 8, 2024

From docs I could figure out that the core services are

  • pageserver
  • safekeeper
  • compute
  • storage broker
  • storage controller
  • minio (for local or k8s)

Have no idea what others are

  • pn-sni-router
  • proxy
  • storage-scrubber

And the whole architecture is kinda like that

Image

@GangWangAI
Copy link

GangWangAI commented Nov 6, 2024

I wrote a compute chart that runs normally for about 6 minutes before crashing and automatically restarting. The error log shows:

PG:2024-11-05 09:48:58.052 GMT [69] PANIC:  [WP] collected propEpochStartLsn 0/1537150, but basebackup LSN 0/14EE288
PG:2024-11-05 09:48:58.053 GMT [43] LOG:  background worker "WAL proposer" (PID 69) was terminated by signal 6: Aborted
PG:2024-11-05 09:48:58.053 GMT [43] LOG:  terminating any other active server processes

Did you run it successfully afterwards? @roman-shakudo

@roman-shakudo
Copy link
Author

roman-shakudo commented Nov 22, 2024

GangWangAI
I did not run it
Somewhere I found information that k8s chart is not priority right now and was told to use cloud one =)
So I skipped it till it be done

Related Compute node
Did you apply patches neondb team created for a potstgres mentioned here?

Did you see that Safekeepr node receives and process WAL from Compute service?

@jcsp
Copy link
Contributor

jcsp commented Dec 16, 2024

We'e clarified the README to explain that the helm charts are currently not meant to bring up a full working system: they just represents the components where we at Neon use helm at present: #119

I think this issue refers to safekeeper/pageserver patches from #35, so I'm going to close this one in favor of that one -- let's use that one as a place for folks to collaborate when they would like to try bringing up fully working systems via helm

@jcsp jcsp closed this as completed Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants