Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Spark: Better statistics estimation for Spark 2 Reader (apache#3134)
Follow-up to apache#3038. Use (estimated) row size * number of rows to estimate the size instead of adding up file sizes. The row size is estimated from the pruned schema if we prune columns.
- Loading branch information