wr.lakeformation.read_sql_query() on table with deleted S3 files returns empty dataframe instead of error

Describe the bug When using wrangler to write a parquet file to a table, it's possible for the upload to fail (i.e. if network is interrupted or something). This can cause lf catalog to appear that it has the S3 file, but it does not actually exist. When the table is subsequently read, it returns an empty DF instead of throwing an exception, leading to silent failures/data corruption/etc. Wrangler should be able to identify that the LF parquet file was not actually found, and should fail fast and throw an exception instead.

To Reproduce Steps to reproduce the behavior. Also add details about Python and Wrangler's version and how the library was installed.

VENV setup:

BRANCH=main-governed-tables

VENV=./venv
rm -rf $VENV
python3.9 -m venv $VENV && source venv/bin/activate
pip3.9 install git+https://github.com/awslabs/aws-data-wrangler@$BRANCH

pip3.9 install numpy pandas faker aiobotocore[boto3] fsspec s3fs

# Also remember to update botocore to match main-governed-tables APIs

Example pseudocode:

    table_name="whatever"
    database_name="some_database"
    s3_prefix="s3://somewhere-good"
    wr.s3.to_parquet(
        some_df,
        path=s3_prefix + table_name,
        dataset=True,
        mode='overwrite',
        database=database_name,
        table=table_name,
        table_type='GOVERNED'
    )

    # <go delete created S3 file on AWS console>

    should_be_same_df = wr.lakeformation.read_sql_query(sql=F"select * from {table_name}", database=database_name)

    # returns empty DF instead of a 'file not found' or some other error

P.S. Don't attach files. Please, prefer add code snippets directly in the message body.

aws / aws-sdk-pandas

wr.lakeformation.read_sql_query() on table with deleted S3 files returns empty dataframe instead of error #626