Support private processing by keeping data encrypted for rented servers #707

daniel31x13 · 2025-02-17T20:10:39Z

Hey, not sure if this is a thing already or not but I was wondering if there could be the possibility of adding some sort of "homomorphic encryption" so that whatever that's being sent to the cluster (and is being processed) is completely encrypted, the client then decrypts the results.

The idea is to have your prompts to be sent to a rented remote cluster (a bunch of rented servers), when they receive the prompt in encrypted format, they processes the prompt (encrypted) and spit out the results (also encrypted), and when the client receives the results, it decrypts it.

You might consider implementing it similarly to Tor's onion routing, where data can be decrypted only if all nodes are accessed. This means that unless someone has control over every machine, the data remains secure.

Btw as an open-source project maintainer, I wanted to thank you and all the devs supporting this project!

AlexCheema · 2025-02-18T13:24:09Z

Hey - we did some work on homomorphic encryption for private search here: https://blog.exolabs.net/day-8
The main problem with doing it for model inference as you describe is the massive overhead.
It works for private search because we use a narrow linearly homomorphic encryption scheme that is fast and parallelizable

daniel31x13 · 2025-02-18T18:38:23Z

Got it. Thanks for the reply!

Just to confirm, does each machine in the cluster know what prompt it’s processing and what the final output will be?

I’m wondering the possibility of hosting our machines in different data centers (for example, one on AWS, one on DigitalOcean, and one on Linode) while ensuring that the data remains private. The idea is that privacy would be maintained unless someone gains access to all the servers simultaneously, rather than just one.

Would the overhead issue still persist in this scenario as well?

AlexCheema · 2025-02-19T00:38:03Z

Got it. Thanks for the reply!

Just to confirm, does each machine in the cluster know what prompt it’s processing and what the final output will be?

I’m wondering the possibility of hosting our machines in different data centers (for example, one on AWS, one on DigitalOcean, and one on Linode) while ensuring that the data remains private. The idea is that privacy would be maintained unless someone gains access to all the servers simultaneously, rather than just one.

Would the overhead issue still persist in this scenario as well?

Yeah, currently each node knows the prompt. We propagate the prompt to each node for debugging / to show the user the prompt on each machine. We could of course remove that.

At the very least each node needs the intermediary embeddings, which contain information about the prompt. Each node knowing just this would be a weak form of privacy. I don't know of any low-overhead approach for collaborative inference with privacy between nodes. This would be an interesting research direction to explore.

daniel31x13 · 2025-02-19T14:16:32Z

Just messaged you on Discord.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support private processing by keeping data encrypted for rented servers #707

Support private processing by keeping data encrypted for rented servers #707

daniel31x13 commented Feb 17, 2025

AlexCheema commented Feb 18, 2025 •

edited

Loading

daniel31x13 commented Feb 18, 2025

AlexCheema commented Feb 19, 2025

daniel31x13 commented Feb 19, 2025

Support private processing by keeping data encrypted for rented servers #707

Support private processing by keeping data encrypted for rented servers #707

Comments

daniel31x13 commented Feb 17, 2025

AlexCheema commented Feb 18, 2025 • edited Loading

daniel31x13 commented Feb 18, 2025

AlexCheema commented Feb 19, 2025

daniel31x13 commented Feb 19, 2025

AlexCheema commented Feb 18, 2025 •

edited

Loading