-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple issues with Service Fabric Runtime #10920
Comments
Hi @scale-tone , we're looking into this issue , we will update on it ASAP. thank you ! |
Due to deployment issues in some of our VMs, windows 2022 image versions are inconsistent. |
Hello @scale-tone , Could you please check if you are still facing version mismatch. |
Oh, yes. The agent's version is now Yet Service Fabric Runtime's version is still the same faulty one: And predictably it fails in the same way. @subir0071 , what's the ETA for fixing this issue? |
This image is from the runner windows-2022 runner image VM control panel - This screenshot is from windows-2022 runner image from the fabrichost.exe properties - Please execute the below workflow to display the Service Fabric version via GHA -
I execute the same and got this same output. |
@subir0071 , we are not using GitHub Actions. As I said, we're using a dedicated internal agent pool (maintained by our sister team) in Azure DevOps. |
Thanks for your quick response. My apologies for the confusion - However, let me check some more aspects around it. |
@subir0071 , look, in your test you're getting image version 20241113.3.0. |
It seems the agent pool for Azure DevOps that you are using is not being maintained by Azure. Kindly check with the agent pool administrator (the sister team). |
A very unique kind of issue... Will you be able to check if there is any other tools/packages are getting updated? |
Thanks @mmrazik for the analysis on this issue. |
@subir0071 feel free to close this issue. I clarified this with @scale-tone 1:1. |
Description
We're running integration tests on a Service Fabric dev cluster provisioned on an Azure DevOps build pipeline.
We're using internal Windows Server 2022-based agent pool.
Everything worked until this Saturday 02.11.2024.
Before that we were getting this image: 20240922
Since Saturday we started getting this image: 20241021
Starting from Saturday the dev cluster fails to reach healthy state, due to the failing FaultAnalysisService (which is a non-configurable part of Service Fabric runtime). We don't have any visibility into why exactly it is failing.
This repo says we should be having this (two years old) version of Service Fabric runtime: 9.1.1436.9590.
That is not the case: the actual Service Fabric runtime, that now appears on our agents is this (two months old) one: 9.1.2718.9590. We established that by dumping FabricHost.exe from an agent.
We're not able to prove or disprove that SF runtime version is the actual culprit (because we cannot travel back in time to try the previous one - we're always getting the latest agent image, and cannot control its version), but it looks highly likely.
Question1: can there be any workaround for our failing SF cluster? E.g. maybe there's a way to override SF runtime version to be used? (Just remember that SF runtime installer requires root privileges, therefore just running it as part of the pipeline does not work).
Question2: why this repo's change history does not reflect the actual picture, and can this be fixed?
Question3: is there a chance to have SF runtime updated on the agent image? I cannot say which exact version it needs to be updated to (since we have no way to try them out), but maybe just to revert it to the previous, stable one?
Platforms affected
Runner images affected
Image version and build link
20241021
Is it regression?
yes
Expected behavior
SF dev cluster starts and successfully goes into healthy state on a build agent
Actual behavior
SF never reaches healthy state (waited for up to 1 hour)
Repro steps
The text was updated successfully, but these errors were encountered: