Asynchronous test runs are sometimes not completed correctly

Describe the bug When running evaluate() with run_async=True, sometimes tests are not completed (so any job/task/pipeline that relies on exit code will fail). Results are neither printed nor emitted. The evaluation is essentialy stuck after running the last test case. It is not always the case, without any regularity, so it looks like a race condition. Likely there is issue with async stuff in evaluate.py file - a_execute_test_cases(), get_or_create_event_loop() or loop.run_until_complete.

It might be possible that await asyncio.sleep(throttle_value) leads to semaphore being stuck or something. I haven't debug it except a brief static code analysis, though.

To Reproduce Steps to reproduce the behavior:

Create a test cases list
Run them using evaluate() with run_async=True
All tests are executed asynchronously and that's fine
Results are not printed, saved, etc. It is essentialy stuck after running the last test case

Expected behavior Tests always end with printing either results or errors. They should never last infinitely

Screenshots If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

OS: [e.g. iOS] Windows 11

confident-ai / deepeval

Asynchronous test runs are sometimes not completed correctly #1147