Loading table from Cloud Storage to BigQuery using Python

Question

Could someone share an example of a job config for uploading json newline_delimited file to a new Bigquery table, please?

Trying to do this based on google docs with no success so far.

Willian Fuks · Accepted Answer · 2017-06-05 16:40:39Z

This example from GCP repository is a good one for loading data from GCS.

The only thing you will have to adapt in your code is setting the job.source_format to be the new delimited json file, like so:

def load_data_from_gcs(dataset_name, table_name, source):
    bigquery_client = bigquery.Client()
    dataset = bigquery_client.dataset(dataset_name)
    table = dataset.table(table_name)
    job_name = str(uuid.uuid4())

    job = bigquery_client.load_table_from_storage(
        job_name, table, source)

    job.source_format = 'NEWLINE_DELIMITED_JSON'
    job.begin()

    wait_for_job(job)

    print('Loaded {} rows into {}:{}.'.format(
        job.output_rows, dataset_name, table_name))

(The correct thing would be to receive this parameter as input in your function but this works as an example).

Also, the table should already exist when you run this code (I looked for schema auto-detection in the Python API but it seems there isn't one yet).

Collectives™ on Stack Overflow

Loading table from Cloud Storage to BigQuery using Python

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related