Importing external Apache Iceberg tables into Lakehouse for Apache Iceberg consolidates data from multiple catalogs. Within the Lakehouse runtime catalog, the Apache Iceberg REST catalog endpoint provides query engines with direct access to these imported tables.
Before you begin
-
Verify that billing is enabled for your Google Cloud project.
-
Enable the BigLake API.
Roles required to enable APIs
To enable APIs, you need the Service Usage Admin IAM role (
roles/serviceusage.serviceUsageAdmin), which contains theserviceusage.services.enablepermission. Learn how to grant roles.
Required roles
To get the permissions that you need to import tables in the Google Cloud console, ask your administrator to grant you the following IAM roles on your project:
- BigLake Admin (
roles/biglake.admin) - Storage Admin (
roles/storage.admin)
For more information about granting roles, see Manage access to projects, folders, and organizations.
You might also be able to get the required permissions through custom roles or other predefined roles.
Import tables
You can ingest data from external Apache Iceberg REST catalogs (IRC) directly into Google Cloud Lakehouse tables using Dataflow's job builder UI. For more information, see Import external Iceberg tables into Dataflow.