-
-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ensure 100% table coverage in BigQuery #36
Comments
I’ll work on this one. |
Thanks Paul! |
Using the following query to extract all of this data. I'm looking at row counts by table names in each dataset -
Here's a graphical summary of the gaps for all the In case the descriptions are difficult to read, here's a summary of the gaps -
I'll work on the har, lighthouse, pages and requests datasets next. |
The har, lighthouse, pages and requests datasets do not appear to have any gaps between the old and new datasets. There are a few interesting things to note:
|
This is awesome thanks for compiling it, Paul. I'll work on rerunning the pipeline for any missing tables. |
Cool, thanks. Let me know when it’s done and I can update the visualization with the latest table data to confirm. |
https://discuss.httparchive.org/t/missing-2016-02-15-chrome-requests/1310 is a bug report that some 2016_02_15 tables are missing.
We should take inventory of all tables across all dates and reprocess anything that's missing.
This can be a good first bug for first time contributors. Overview of the expected workflow:
bq
command line interface to list the contents of each datasetThe text was updated successfully, but these errors were encountered: