Open Kyle-Verhoog opened 12 hours ago
CODEOWNERS
have been resolved as:
tests/contrib/langchain/test_langchain_community.py @DataDog/ml-observability
tests/contrib/langchain/test_langchain_llmobs.py @DataDog/ml-observability
Benchmark execution time: 2024-11-22 19:41:32
Comparing candidate commit 971dad7b34d58e3d2d55ee3fb17e5891693a4630 in PR branch kylev/flaky-tests-should-be-shot-into-the-sun
with baseline commit d792c3dc3c7452ed64524ea38d0b9c9116330a73 in branch main
.
Found 0 performance improvements and 0 performance regressions! Performance is the same for 388 metrics, 2 unstable metrics.
There appear to be stability issues with using snapshots and/or LangChain in general.
There are failures in the mocked tests that look like:
as well as failures with snapshot based tests:
While we investigate a more stable method of testing it makes sense to disable the tests to avoid noise to our neighbours in the library :).
DOWN WITH FLAKY TESTS
Checklist
Reviewer Checklist