Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[SPARK-49771][PYTHON] Improve Pandas Scalar Iter UDF error when outpu…
…t rows exceed input rows ### What changes were proposed in this pull request? This PR changes the `assert` error into a user-facing PySpark error when the pandas_iter UDF has more output rows than input rows. ### Why are the changes needed? To make the error message more user-friendly. After the PR, the error will be `pyspark.errors.exceptions.base.PySparkRuntimeError: [PANDAS_UDF_OUTPUT_EXCEEDS_INPUT_ROWS] The Pandas SCALAR_ITER UDF outputs more rows than input rows.` ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Existing tests ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#48231 from allisonwang-db/spark-49771-pd-iter-err. Authored-by: allisonwang-db <[email protected]> Signed-off-by: Hyukjin Kwon <[email protected]>
- Loading branch information