Unhelpful error message if DCP dependency check fails

dotnet / aspire

An opinionated, cloud ready stack for building observable, production ready, distributed applications in .NET

https://learn.microsoft.com/dotnet/aspire

MIT License

3.41k stars 357 forks source link

Unhelpful error message if DCP dependency check fails #4691

Closed karolz-ms closed 1 week ago

karolz-ms commented 2 weeks ago

If the dependency check that uses dcp info command fails, with dcp returning non-zero code, an exception is thrown. The way we process this exception here https://github.com/dotnet/aspire/blob/main/src/Aspire.Hosting/Dcp/DcpDependencyCheck.cs#L105 cause all output of the dcp command to be discarded.

The result is the user only sees a message that says "dcp info returned non-zero exit code", which is not helpful:

We should instead include all the output from dcp in the log, so that the user is informed about why the dependency check failed.

JamesNK commented 2 weeks ago

Have you considered having dcp return specific error codes for certain situations? For example, return exit code X if Docker isn't healthy and then the host can check for known error codes and print a friendly error message to the host console. It would look better than an error type + message + stack trace.

The fallback for unknown exit codes would be what you suggested and include dcp output in the host console.

karolz-ms commented 2 weeks ago

Agreed, I do not think we want the stack trace here.

We could consider having specific error codes for common issues, but ultimately DCP should be able to tell the user what part of dependency check failed, and what the user might want to try to get themselves unblocked.

radical commented 2 weeks ago

Some hosting tests fail randomly on CI with Application orchestrator dependency check had an unexpected error System.TimeoutException: The operation has timed out. but no other information in the log. This issue - https://github.com/dotnet/aspire/issues/4640 - would also benefit from getting the details in the log.

davidfowl commented 2 weeks ago

Wait is this a dupe of https://github.com/dotnet/aspire/issues/4089?

cc @danegsta

JamesNK commented 2 weeks ago

It was thought to be fixed. But I got an unhelpful error message when testing locally with latest main source code.

JamesNK commented 2 weeks ago

FYI, this is the result from dcp info:

{"version":"0.5.4","commitHash":"a2453505df8d60a288a1fdb30004e11ca8854b22","buildTimestamp":"2024-06-20T22:49:08Z","containers":{"runtime":"docker","hostName":"host.docker.internal","installed":true,"running":false,"error":"Docker CLI timed out while checking status. Ensure Docker CLI is functioning correctly and try again."}}

Could it have been fixed in dcp but dcp wasn't yet updated on dotnet/aspire main?