-
Notifications
You must be signed in to change notification settings - Fork 109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow warmers to trigger hooks #322
Conversation
Hey @camilleryr! I've only glanced over this so far, I'll take a deeper look in future. Thank you for contributing! No worries about no corresponding issue, I don't really care; I just prefer that to save people doing unnecessary work. My initial thoughts (again, with only a quick glance over):
Yep makes sense. I'm not entirely a fan of calling via the name, I'm wondering if we can do something like
I'm not sure I'm a fan of this, it feels a bit awkward and unnatural to me. I understand there are cases where you might want this, but you could also manually implement this waiting without having to change it for everyone (and thus add the maintenance burden). I guess I'm curious what your hook is doing; it feels like you implicitly know a warmer is running on startup so I'm curious why you'd have to be notified about it?
Although I understand why you did it, I'm not a huge fan of |
I agree that this is a suboptimal solution, but I also though about the performance impact and had the same though about how frequently these are used! Im open to other approaches, but this seemed like the path of least resistance to ensuring that we would never call
I am generally sensitive to the potential for race conditions - so when approaching how to allow warmers to trigger hooks I wanted to guarantee that the fix would work as expected every time. Without some coordination between the overseer and incubator there would be a potential for the warmer to run and have the As far as what we are trying to achieve - We have a work load that would rely heavily on this cached data, and ideally we would not start the workload until the warmer has ran. We could utilize the
Lol - I thought the same thing, but went with the helper to follow what I thought of as a convention here - I can definitely pull that out and replace it with a explicitly defined module I understand if you dont think this warrants inclusion in cachex proper, but thought I would propose some changes since these pieces did not compose in the way I anticipated! Thanks for the review and I am looking forward to more of your thoughts. |
Most of this makes sense to me, I'll give it another look over and see if anything else comes up. It seems like the required changes basically boil down to this (correct me if I'm wrong, of course):
Then there's the third change regarding startup, but I'm not exactly sure what I think about this. Thinking out loud on this a bit more (disclaimer: kinda sick, and it's 1am, so feel free to tell me if I'm being stupid): I see the dilemma with the current call to Technically I haven't really looked at how this would appear in the codebase, but rather than I've marked this for v3.7.0 which should be in the next couple of weeks (waiting for some follow up on a couple of other threads just to tie those up). Hopefully this timeline works for you? We can definitely improve a lot of this even if we don't fully merge everything as-is right now. |
e4e778a
to
8dfa226
Compare
I think you are correct that its better to address the cause of the issue rather than to code around it! I have reimplemented this PR to try and have the This would not have been my preferred approach, as I think I would have put this in Additionally - I pulled out that Hope you feel better soon - and as always, let me know your thoughts on this and thank you for your time. |
Wanted to check in and see if you had any additional thoughts on this work |
Hi @camilleryr! I was actually planning on looking at this today, what strange timing. I can run through this now with my thoughts, then you can adapt as you see fit - then I'll probably get this merged in this weekend (maybe with some further changes from my side if necessary). I'll leave a quick review, and if you could resolve the conflicts (which don't seem to make sense to me...) it'd be great! I should note I'm also happy to pick this up if your time is limited, just let me know! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I left a couple of comments on the initial changes.
That being said - in general I don't really like the new flow of hooks starting first before the main cache table.
I think I would prefer doing something where the warmers aren't automatically fired on startup, and are instead fired after the start_link
call on the line below here.
I think this makes this a lot easier, not much has to change beyond what we have - it's just some changes to warmers and then a tweak here to add an initial warming (which can be done via Cachex.warm/2
that I added recently).
Do you have any thoughts on this approach instead?
lib/cachex/warmer.ex
Outdated
@@ -99,11 +102,11 @@ defmodule Cachex.Warmer do | |||
|
|||
# set pairs without options | |||
{:ok, pairs} -> | |||
Cachex.put_many(cache, pairs) | |||
Cachex.put_many(name, pairs) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should stay as state
when possible; using the name has overhead that the struct does not.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This change was included because there was a race condition caused by the use of the cache
here - we had perviously discussed that the performance impacts of this change were mostly likely irrelevant due the frequency of warmers - happy to try and address this in some other way if you have thoughs
Warmers should use the latest version of cache state.
- I definitely consider this one a bug!
- There's no good reason to not use the name
- The overhead is so small, it will only ever cause negative side effects (like this PR) by using the state
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, totally spaced on this one - the bug also sort of exists on L117, to a lesser extent. I think the best approach here is to wrap everything in the case
body in a Cachex.execute(name)
block, so that we have a copy of the latest state regardless.
That also means that we can purely store the warmer's state
and the timer ref as the server state, with just the cache name:
{ name, state, timer }
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note to self that this kinda sucks because provisions
don't work with warmers, only hooks. I don't think it should be changed in this PR, but I have filed #332 to follow up on to clean this type of thing up a bit more.
2291ea2
to
8af1324
Compare
lib/cachex/warmer.ex
Outdated
def warm(pid, opts) do | ||
if opts[:respect_async_warmers] do | ||
GenServer.call(pid, :cachex_warmer, :infinity) | ||
else | ||
send(pid, :cachex_warmer) | ||
end | ||
end |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We need to be able to block for async: false
workers, so I added this api - if you would prefer we could move this if/else block to the Actions.Warm
module to avoid the additional function here.
lib/cachex/warmer.ex
Outdated
_from, | ||
{cache, warmer(async: async) = warmer, timer_ref} | ||
) do | ||
if timer_ref, do: Process.cancel_timer(timer_ref) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is an unrelated fix - but currently every time that Cachex.warm
is called we would stack timers causing additional runs of the warmer, by tracking and potentially canceling a timer, we should just reset the interval
whenever Cachex.warm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good catch on this one, I'm going to revisit this after merge - I can clean up the Cachex.warm/2
implementation a bit more.
I refactored this, hopefully its closer to what you were describing. With the exception of the updates to the |
lib/cachex/warmer.ex
Outdated
_from, | ||
{cache, warmer(async: async) = warmer, timer_ref} | ||
) do | ||
if timer_ref, do: Process.cancel_timer(timer_ref) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good catch on this one, I'm going to revisit this after merge - I can clean up the Cachex.warm/2
implementation a bit more.
lib/cachex/warmer.ex
Outdated
@@ -99,11 +102,11 @@ defmodule Cachex.Warmer do | |||
|
|||
# set pairs without options | |||
{:ok, pairs} -> | |||
Cachex.put_many(cache, pairs) | |||
Cachex.put_many(name, pairs) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, totally spaced on this one - the bug also sort of exists on L117, to a lesser extent. I think the best approach here is to wrap everything in the case
body in a Cachex.execute(name)
block, so that we have a copy of the latest state regardless.
That also means that we can purely store the warmer's state
and the timer ref as the server state, with just the cache name:
{ name, state, timer }
lib/cachex.ex
Outdated
@@ -302,6 +302,7 @@ defmodule Cachex do | |||
{:ok, pid} = Supervisor.start_link(__MODULE__, cache, name: name), | |||
{:ok, link} = Informant.link(cache), | |||
^link <- Overseer.update(name, link), | |||
_ = warm(cache, respect_async_warmers: true), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
First instinct is already that this looks a lot better. Minor nitpick but would prefer Cachex.warm
rather than warm
to make it more obvious we're re-using the public API.
lib/cachex/warmer.ex
Outdated
@@ -99,11 +102,11 @@ defmodule Cachex.Warmer do | |||
|
|||
# set pairs without options | |||
{:ok, pairs} -> | |||
Cachex.put_many(cache, pairs) | |||
Cachex.put_many(name, pairs) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note to self that this kinda sucks because provisions
don't work with warmers, only hooks. I don't think it should be changed in this PR, but I have filed #332 to follow up on to clean this type of thing up a bit more.
e41692e
to
a0d3717
Compare
Looks good, thanks @camilleryr! |
This will be shipped in the next release, which is looking like it will be v4.0.0. Your work here has raised a lot of ugliness with the existing warmer implementation, so I'm going to take this opportunity to refactor all of it before release - so thank you once again for your time and patience with all of this! |
A) Sorry that this didnt start with an issue - feel free to close this and we can move any discussion over there, but I put together this PR as a POC to ensure that this solved our issue on our application side
B) The issue we were running into is that we wanted to use a Hook to react to when a Warmer ran and found that Warmers did not trigger hooks - this issue seems to be caused by the fact that warmers are initialized with a cache spec before the hooks are associated to their pids, so the notification filters these out thinking that they are not running.
This PR addresses that particular issue by having the warmer call
put_many
with just the cache name instead of the cache spec so that the current state will always be used to avoid any possibility of a race condition between the warmer running and the hooks starting and being added to the Overseer's cache stateAdditionally, this PR rearranges the lifecycle of the warmer a bit to ensure that the first run will happen after any hooks have been started / attached so that notifications will not be missed
Please let me know if there are any questions / comments / changes you would like to see - and thank you for all of the work you do by maintaining this project