Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TASK] Benchmark PERFILE deletion vector reader #12299

Open
razajafri opened this issue Mar 7, 2025 · 0 comments
Open

[TASK] Benchmark PERFILE deletion vector reader #12299

razajafri opened this issue Mar 7, 2025 · 0 comments
Assignees
Labels
? - Needs Triage Need team to review and classify bug Something isn't working

Comments

@razajafri
Copy link
Collaborator

We need to benchmark the perfile reader to document if we see any performance regression.

There are multiple steps involved to achieve this

  • Convert the sf100 parquet data to Delta Lake format
  • Create two sets of tables, one with 2% of the data deleted, and the other with 50% of the data deleted
  • Benchmark the query against branch-25.04 and the feature branch

Once documented, the above steps should be repeated with sf3k if needed

@razajafri razajafri added ? - Needs Triage Need team to review and classify bug Something isn't working labels Mar 7, 2025
@razajafri razajafri self-assigned this Mar 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
? - Needs Triage Need team to review and classify bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant