Open mvashishtha opened 1 year ago
In this case, asynchronous execution will degrade.
Why do you want to move the check to the main process?
@anmyachev we need a synchronous check for equality so we can throw a ValueError
if the indexes don't match on either axis:
import pandas as pd
pd.DataFrame([[1]]).compare(pd.DataFrame([[1, 2]]))
ValueError: Can only compare identically-labeled (both index and columns) DataFrame objects
We shouldn't require every query compiler to do this synchronous check.
Also, we should add this check to the API layer because the error message depends on the type of the API-layer object. For Series
, we have ValueError: Can only compare identically-labeled Series objects
, but for dataframe, we have ValueError: Can only compare identically-labeled (both index and columns) DataFrame objects
.
Modin version checks
[X] I have checked that this issue has not already been reported.
[X] I have confirmed this bug exists on the latest released version of Modin.
[X] I have confirmed this bug exists on the main branch of Modin. (In order to do this you can follow this guide.)
Reproducible Example
Issue Description
I get an error from a remote task that is doing a pandas
compare
.Expected Behavior
I want the API layer to catch the error here, but
compare_index
has no effect whenother
is a modin dataframe or series.Error Logs
Installed Versions