Closed jaredpar closed 4 years ago
This is now regularly blocking CI @karelz
Build | Pull Request | Test Failure Count | Date |
---|---|---|---|
#584646 | Rolling | 1 | 2020/4/1 |
#584991 | Rolling | 1 | 2020/4/1 |
#585184 | Rolling | 2 | 2020/4/2 |
Build | Pull Request | Console | Core | Test Results | Run Client |
---|---|---|---|---|---|
#584646 | Rolling | console.log | testResults.xml | run_client.py | |
#584991 | Rolling | console.log | testResults.xml | run_client.py | |
#585184 | Rolling | console.log | testResults.xml | run_client.py | |
#585184 | Rolling | console.log | testResults.xml | run_client.py |
runfo tests -d runtime -c 100 -pr -n System.Net.NameResolution.Functional.Tests -m
@MihaZupan can you please take a look at this new failure? Kusto database mining shows first and only failure 2 days ago (3/31). Disabling the test to unblock CI may be best first response. cc @alnikola
Build | Pull Request | Test Failure Count |
---|---|---|
#585813 | Rolling | 1 |
#586664 | Rolling | 1 |
#587083 | Rolling | 1 |
Build | Pull Request | Console | Core | Test Results | Run Client |
---|---|---|---|---|---|
#585813 | Rolling | console.log | testResults.xml | run_client.py | |
#586664 | Rolling | console.log | testResults.xml | run_client.py | |
#587083 | Rolling | console.log | testResults.xml | run_client.py |
This is causing roughly a 3% failure rate at this point. Do we have an ETA for when this will be disabled?
Sorry for that, I thought @MihaZupan had chance to do it over night. PR is up - see #34527
From console logs here I am seeing the following tests failing
DnsGetHostEntry_LocalHost_ReturnsFqdnAndLoopbackIPs
DnsObsoleteGetHostByName_EmptyString_ReturnsHostName
DnsObsoleteBeginEndGetHostByName_EmptyString_ReturnsHostName
Dns_GetHostEntry_HostString_Ok
Dns_GetHostEntryAsync_HostString_Ok
Which looks like the 4 mentioned in https://github.com/dotnet/runtime/issues/1488 + now DnsGetHostEntry_LocalHost_ReturnsFqdnAndLoopbackIPs
This may be misconfigured machine. Also SLES12 uses systemd and that resolver can synthesize response. I think we should collect more system info on failure. If we have helper diag functions, that may be helpful for other DNS test failures.
@wfurt I think you are right because the test always fails when it's run on an sles.12.amd64.open agent with machine name 'localhost' which looks weird.
The failures were caused by a Helix infra issue which was resolved yesterday.
@alnikola did we re-enable the tests?
Was it the misconfiguration with localhost
?
@alnikola
The failures were caused by a Helix infra issue which was resolved yesterday.
What Helix issue was this? I'm looking for an issue in core-eng or arcade to link to. If they didn't create on for this problem we should push them to do so.
Issues are the primary way we track reliability between our services. If there is a bug in Helix, Azure, etc ... that impacted our reliability we should push to make sure that there is an issue tracking that. Always feel free to include me in the convo to help with this if needed.
The issue was closed without enabling the affected test by mistake. Will do it shortly.
@alnikola
Where is the Helix issue that describes the bug that they fixed? That is what I'm interested in. If they're not filing bugs then it's issues we're not tracking. That means we can't track improvements.
@jaredpar I reported the issue with a strange agent name ('localhost') to the engineering services team and they said it's a known issue which has been already fixed. So, I don't have a link. Will ping you offline for the details.
Console Log Summary
Builds
Configurations
Helix Logs
Only seen one failure so far but also suspicious that this showed up in CI