Open akv-mshin opened 1 year ago
Also just wanted to mention that the same code in different environment succeeds (it has different number of items in mongo collection).
Also the same code and the same pipeline and environment worked just a few days ago.
So it really should be the issue with the number of items in the mongo collection (And the Error place also suggests that too)
Just verified that I see the same issue on 2.46
What happened?
Our dataflow job started to fail few days ago (apache beam 2.44) without any changes. After investigating logs and execution details I came to conclusion that the issue is with a part of code that is related to
ReadFromMongoDB
:Code:
pipeline | ReadFromMongoDB(uri=..., db=..., coll=..., bucket_auto=True)
Looks like RangeTracker that is used underneath makes the code fail (
ikey
is probably choosen as a too large number).The current collection size is
221418
elements which should not be a problem for float capacity.Issue Priority
Priority: 2 (default / most bugs should be filed as P2)
Issue Components