I guess I could just dump directly to CSVs and download but BQ is a nice convenient bottomless data bucket.
[1] https://gdal.org
Last year I found ClickHouse did H3 enrichment much faster than PG or BQ so it has become a go-between for larger workloads with simple enrichment requirements.
The dataset in the post is an example anyone can download. I handle 30B+ records at work.
You can just add feature requests like this to ClickHouse issue tracker. https://github.com/ClickHouse/ClickHouse/issues/45129
BigQuery also supports in-place querying of datasets on GCS (or S3/Azure using Omni) via external/BigLake tables. https://cloud.google.com/bigquery/docs/query-cloud-storage-u...
(i was the pm for bigquery federation with gcs and cloudsql)
"It would be great to see data vendors deliver data straight into the Cloud Databases of their customers. It would save a lot of client time that's spent converting and uploading files."
Hear hear! Shameless plug: this is exactly what we enable at prequel.co. If there are any data vendors reading this, or anyone who wants easier access to data from their vendor, we're here to help.
edit: quote fmt
Also, thanks for sharing S2! It'll be nice to look at.