str_length function not working in pa.Field for PySpark

Describe the bug

When trying to use the str_length function in pa.Field to validate the length of a string, we get a NotImplementedError every time. I tried passing arguments in different ways, as in the screenshot, and in the form of str_length(1, 2), both options give the same error

[X] I have checked that this issue has not already been reported.
[X] I have confirmed this bug exists on the latest version of pandera.
[ ] (optional) I have confirmed this bug exists on the master branch of pandera.

Note: Please read this guide detailing how to provide the necessary information for us to reproduce your bug.

Code Sample, a copy-pastable example

import pandera.pyspark as pa
import pyspark.sql.types as T
import pyspark.sql.functions as F

from decimal import Decimal
from pyspark.sql import SparkSession
from pandera.pyspark import DataFrameModel

spark = SparkSession.builder.getOrCreate()

class PanderaSchema(DataFrameModel):
    id: T.IntegerType() = pa.Field(gt=4)
    product_name: T.StringType() = pa.Field(str_length={"min_value": 1, "max_value": 2}, coerce=True)
    price: T.DecimalType(20, 5) = pa.Field()
    description: T.ArrayType(T.StringType()) = pa.Field()
    meta: T.MapType(T.StringType(), T.StringType()) = pa.Field()

data = [
    (5, "Bread", Decimal(44.4), ["description of product"], {"product_category": "dairy"}),
    (15, "Butter", Decimal(99.0), ["more details here"], {"product_category": "bakery"}),
]

spark_schema = T.StructType(
    [
        T.StructField("id", T.IntegerType(), False),
        T.StructField("product_name", T.StringType(), False),
        T.StructField("price", T.DecimalType(20, 5), False),
        T.StructField("description", T.ArrayType(T.StringType(), True), False),
        T.StructField(
            "meta", T.MapType(T.StringType(), T.StringType(), True), False
        ),
    ],
)
df = spark.createDataFrame(data, spark_schema)

import json
df_out = PanderaSchema.validate(check_obj=df)

df_out_errors = df_out.pandera.errors
print(json.dumps(dict(df_out_errors), indent=4))

Expected behavior

We expect a successful or unsuccessful test of the str_length function (validation error), but we get an error

/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/pyspark/pandas/__init__.py:48: UserWarning: 'PYARROW_IGNORE_TIMEZONE' environment variable was not set. It is required to set this environment variable to '1' in both driver and executor sides if you use pyarrow>=2.0.0. pandas-on-Spark will set it for you but it does not work if there is a Spark context already launched.
  "'PYARROW_IGNORE_TIMEZONE' environment variable was not set. It is required to "
Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
Setting default log level to "WARN".
To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel).
23/08/14 18:37:19 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
{
    "DATA": {
        "CHECK_ERROR": [
            {
                "schema": "PanderaSchema",
                "column": "product_name",
                "check": "str_length(1, 2)",
                "error": "Error while executing check function: NotImplementedError()\nTraceback (most recent call last):\n  File \"/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/pandera/backends/pyspark/components.py\", line 135, in run_checks\n    check_obj, schema, check, check_index, *check_args\n  File \"/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/pandera/backends/pyspark/base.py\", line 85, in run_check\n    check_result = check(check_obj, *args)\n  File \"/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/pandera/api/checks.py\", line 229, in __call__\n    return backend(check_obj, column)\n  File \"/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/pandera/backends/pyspark/checks.py\", line 110, in __call__\n    check_obj, key, self.check._check_kwargs\n  File \"/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/multimethod/__init__.py\", line 407, in __call__\n    return self[sig](*args, **kwargs)\n  File \"/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/pandera/backends/pyspark/checks.py\", line 79, in apply\n    return self.check._check_fn(check_obj_and_col_name, **kwargs)\n  File \"/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/multimethod/__init__.py\", line 371, in __call__\n    return func(*args, **kwargs)\n  File \"/Users/ka/change-devices-propensity/venv/lib/python3.7/site-packages/pandera/backends/base/builtin_checks.py\", line 92, in str_length\n    raise NotImplementedError\nNotImplementedError\n"
            }
        ]
    }
}

Desktop (please complete the following information):

OS - macOS Monterey, 12.6.3, Python 3.7.9. PySpark 3.2.4
Browser - N/A
Version - 0.16.1 and 0.16.0

Additional context

As part of the tests, I decided to try the in_range function, because it has the same argument passing syntax - it works flawlessly

unionai-oss / pandera