Open jk47 opened 4 months ago
The main difficulty is to decide where you should use each class and call each method.
For example, consider a distributed system with one master node and several workers node. TableScan
should only be used in master, while TableRead
and TableWrite
should only be used in workers. Also you need to design how to distribute Split
s generated from TableScan
to the workers. You also need to be careful with TableCommit
because it can only run with 1 parallelism (otherwise the consistency guarantee is broken).
All in all, these things are exactly what you need to concern when designing a distributed system.
Search before asking
Motivation
https://paimon.apache.org/docs/0.8/program-api/java-api/ comes with a warning at the top
Can you elaborate on the difficulties that will be encountered?
Solution
No response
Anything else?
No response
Are you willing to submit a PR?