Open emmet-m opened 3 years ago
One more thing - I've had service fabric running successfully before until today. I haven't run our project for a while (maybe a few weeks, a month) and all of a sudden it didn't work.
Also possible related: #1227
I have the exact same issue, both Windows 10 Pro and Windows 11 The closest I've got is in Windows events
Sorted! I was running a 5 node. I ran clean cluster then configured a 1 node which came up without problem. I then switched to a 5 node and all good. I'm thinking there was some remnants of the old version cluster definition.
Unfortunately this didn't fix it for me. I've switched a few times between 5 and 1 node configs after cleaning up the old cluster and it didn't fix anything.
I'm having this same issue. It had been working recently - possibly 4 or 5 days ago. A few notes...
I had a similar issue a few months ago and it pointed to my VM having control issues with host CPU resources. I had to ensure the host was setup with EUFI and that AMD SVM was enabled in the BIOS. I had to install Win11 in order to change the resource control setting from root to core in Hyper-V. I believe it was only available in VM version 9+. Some of those details are fuzzy right now.
It was all working great until I tried earlier today. I retraced my steps from above and it's still not connecting. I'm hoping I just missed something.
ugh, I am having exactly same issue now. And it is just suddenly showing up. I was able to run everything about 3 weeks ago. But today, Service Fabric just refused to run. Done everything: reboot, reinstall, cleanup, etc. Nothing helps.
This is frustrating.
I had what sounds like a very similar issue and tried all sorts of debugging attempts. Eventually I found this comment in another local cluster-related thread: https://github.com/microsoft/service-fabric/issues/382#issuecomment-542356378
There it recommended trying to run the fabric host process directly using 'FabricHost.exe -c', which runs it in console mode, and for me it popped up a couple windows and showed my problem:
System Error : The code execution cannot proceed because MSVCR110.dll was not found. Reinstalling the program may fix this problem.
This file is from the Microsoft Visual C++ Redist 12: https://www.microsoft.com/en-us/download/details.aspx?id=30679
Once I executed that installer and selected "repair", I was able to create my local cluster again.
Hope this helps.
Robert
@rwardms While that doesn't seem to have fixed my problem, that did help me make a lot of progress, thank you!
I went to C:\Program Files\Microsoft Service Fabric\bin
and ran .\FabricHost.exe -c
, which opened up a console that filled with logs/error message before closing a second later. This console kept reappearing and disappearing every second or so with the same logs. Eventually I managed to click the window fast enough, which paused the console, and mashing CTRL+A allowed one log to be printed out line by line. The most amount of logs I could get was this:
FabricSetup.exe invoked with arguments (C:\Program Files\Microsoft Service Fabric\bin\Fabric\Fabric.Code\FabricSetup.exe /operation:addnodestate). Current Exe version 8.1.321.9590
Environment information Data Root C:\SfDevCluster\Data, Log Root C:\SfDevCluster\Log
Starting service eventlog
Starting service pla
Starting FolderACLManager::Install
Obtained exclusive file C:\SfDevCluster\Data\daclupdate.lock
Released exclusive file C:\SfDevCluster\Data\daclupdate.lock
Directory:C:\SfDevCluster\Data has been updated with ACL (Account|Sid) ServiceFabricAdministrators|S-1-5-21-2662639430-4145260883-3466995100-1016 ServiceFabricAllowedUsers|S-1-5-21-2662639430-4145260883-3466995100-1017
Obtained exclusive file C:\SfDevCluster\Log\daclupdate.lock
Released exclusive file C:\SfDevCluster\Log\daclupdate.lock
Directory:C:\SfDevCluster\Log has been updated with ACL (Account|Sid) ServiceFabricAdministrators|S-1-5-21-2662639430-4145260883-3466995100-1016
FolderACLManager::Install successful
Starting EventTraceInstaller::Install
EventTraceInstaller::Install successful
Starting CrashDumps::Install
CrashDumps::Install successful
Starting DriverInstallManager::Install
SFVolumeDiskService is not enabled (OnInstall).
Stopping Driver: LeasLayr.
Stopping Driver: KtlLogger.
DriverInstallManager::Install successful
Starting FabricDeployer::Install
CreateProcess Successful for CommandLine:FabricDeployer.exe. ProcessId:19316 MainThreadId:22768 ProcessHandle:24c
Configuration Deployment failed with error 0xffffffff
FabricDeployer::Install failed with error 0xffffffff
FabricDeployer::Install failed with error 0xffffffff, Rolling back
Starting FabricDeployer::Uninstall
CreateProcess Successful for CommandLine:FabricDeployer.exe /operation:Rollback. ProcessId:19484 MainThreadId:4948 ProcessHandle:248
FabricDeployer::Uninstall successful
Starting DriverInstallManager::Uninstall
SFVolumeDiskService is not enabled (OnUninstall).
Stopping Driver: LeasLayr.
Stopping Driver: KtlLogger.
DriverInstallManager::Uninstall successful
Starting CrashDumps::Uninstall
Reset crash dump location to default
CrashDumps::Uninstall successful
Starting EventTraceInstaller::Uninstall
EventTraceInstaller::Uninstall successful
S
As you can see, there's 3 lines in the middle that indicate some kind of failure:
Configuration Deployment failed with error 0xffffffff
FabricDeployer::Install failed with error 0xffffffff
FabricDeployer::Install failed with error 0xffffffff, Rolling back
When I open up Event Viewer, look under the "Error" tab I see 3 Service Fabric sections:
The first section just contains the two messages "Install failed with error 0xffffffff" and "Install failed with error 0xffffffff, Rolling back" 30 times each, the second one has the message "Kernel crash upload is configured but failed to get kernel crash dump folder." 12 times, and the last one has this message:
System.IO.FileNotFoundException: Could not find file 'C:\Users\emmurra\AppData\Local\Temp\EM-DESKTOP-Server-ScaleMin.xml'.
File name: 'C:\Users\emmurra\AppData\Local\Temp\EM-DESKTOP-Server-ScaleMin.xml'
at System.IO.__Error.WinIOError(Int32 errorCode, String maybeFullPath)
at System.IO.FileStream.Init(String path, FileMode mode, FileAccess access, Int32 rights, Boolean useRights, FileShare share, Int32 bufferSize, FileOptions options, SECURITY_ATTRIBUTES secAttrs, String msgPath, Boolean bFromProxy, Boolean useLongPath, Boolean checkHost)
at System.IO.FileStream..ctor(String path, FileMode mode, FileAccess access, FileShare share)
at System.Fabric.FabricDeployer.XmlHelper.ReadXml[T](String fileName, String schemaFile)
at System.Fabric.FabricDeployer.DeploymentParameters.CreateFromFile()
at System.Fabric.FabricDeployer.CommandLineInfo.Parse(String[] args)
at System.Fabric.FabricDeployer.Program.Main(String[] args)
... Bingo? (EM-DESKTOP
is my PC name). Navigating to C:\Users\emmurra\AppData\Local\Temp\
shows me that the file above is indeed not there, and there is no *-Server-ScaleMin.xml
of any kind either (one of the above fixes involved changing my computer name, which I thought might be a problem).
I can't think of anything else to debug this anymore, but I think the Service Fabric team has enough to at least investigate this bug... Please feel free to email/ping me (emmurra) if you need more logs/info/want to run a share screen session.
ah, I actually forgot I left a comment here. My issue was resolved after finding this post: https://stackoverflow.com/a/38073418/598562
It worked twice on my machine. But I don't understand why we have to do that. lol
@sowenzhang this worked for me as well.
Describe the bug I cannot successfully setup any cluster from my machine, either from the taskbar menu or from powershell. Whenever I run the script, for either 1 node or 5 nodes, I get the following output:
Note that this takes a LONG time, setting up a one node cluster from the task bar menu on my old laptop takes much less time (and succeeds) compared to my dev box which is much more powerful.
See copy of log file pasted below.
To fix this, I've tried the following solutions, all of which have failed:
Area/Component: SDK
To Reproduce I was unable to reproduce this on another machine... I have no idea what's causing it
Expected behavior The cluster to be successfully set up
Service Fabric Runtime Version:
Environment:
Verbatim log file (C:\SfDevCluster\Log\DevClusterSetup.log)
Assignees: /cc @microsoft/service-fabric-triage