- 2.25.0 (latest)
- 2.24.0
- 2.23.0
- 2.22.0
- 2.21.0
- 2.20.0
- 2.19.0
- 2.18.0
- 2.17.0
- 2.16.0
- 2.15.0
- 2.14.0
- 2.13.0
- 2.12.0
- 2.11.0
- 2.10.0
- 2.9.0
- 2.8.0
- 2.7.0
- 2.6.0
- 2.5.0
- 2.4.0
- 2.3.0
- 2.2.0
- 1.36.0
- 1.35.0
- 1.34.0
- 1.33.0
- 1.32.0
- 1.31.0
- 1.30.0
- 1.29.0
- 1.28.0
- 1.27.0
- 1.26.0
- 1.25.0
- 1.24.0
- 1.22.0
- 1.21.0
- 1.20.0
- 1.19.0
- 1.18.0
- 1.17.0
- 1.16.0
- 1.15.0
- 1.14.0
- 1.13.0
- 1.12.0
- 1.11.1
- 1.10.0
- 1.9.0
- 1.8.0
- 1.7.0
- 1.6.0
- 1.5.0
- 1.4.0
- 1.3.0
- 1.2.0
- 1.1.0
- 1.0.0
- 0.26.0
- 0.25.0
- 0.24.0
- 0.23.0
- 0.22.0
- 0.21.0
- 0.20.1
- 0.19.2
- 0.18.0
- 0.17.0
- 0.16.0
- 0.15.0
- 0.14.1
- 0.13.0
- 0.12.0
- 0.11.0
- 0.10.0
- 0.9.0
- 0.8.0
- 0.7.0
- 0.6.0
- 0.5.0
- 0.4.0
- 0.3.0
- 0.2.0
StreamingDataFrame(df: bigframes.dataframe.DataFrame, *, create_key=0)Two-dimensional, size-mutable, potentially heterogeneous tabular data.
Data structure also contains labeled axes (rows and columns). Arithmetic operations align on both row and column labels. Can be thought of as a dict-like container for Series objects. The primary pandas data structure.
Methods
__getitem__
__getitem__(*args, **kwargs)Gets the specified column(s) from the StreamingDataFrame.
| Parameter | |
|---|---|
| Name | Description | 
| key | indexIndex or list of indices. It can be a column label, a list of column labels, a Series of booleans or a pandas Index of desired column labels | 
| Returns | |
|---|---|
| Type | Description | 
| bigframes.pandas.Series or Any | Value(s) at the requested index(es). | 
__repr__
__repr__(*args, **kwargs)Converts a StreamingDataFrame to a string. Calls to_pandas.
Only represents the first <xref uid="bigframes.options">bigframes.options</xref>.display.max_rows.
__setitem__
__setitem__(*args, **kwargs)Modify or insert a column into the StreamingDataFrame.
| Parameters | |
|---|---|
| Name | Description | 
| key | column indexIt can be a new column to be inserted, or an existing column to be modified. | 
| value | scalar, Sequence, StreamingDataFrame, or SeriesValue to be assigned to the column | 
rename
rename(*args, **kwargs)Rename columns.
Dict values must be unique (1-to-1). Labels not contained in a dict will be left as-is. Extra labels listed don't throw an error.
| Parameters | |
|---|---|
| Name | Description | 
| columns | MappingDict-like from old column labels to new column labels. | 
| inplace | boolDefault False. Whether to modify the StreamingDataFrame rather than creating a new one. | 
| Exceptions | |
|---|---|
| Type | Description | 
| KeyError | If any of the labels is not found. | 
| Returns | |
|---|---|
| Type | Description | 
| bigframes.pandas.DataFrame None | StreamingDataFrame with the renamed axis labels or None if inplace=True. | 
to_bigtable
to_bigtable(
    *,
    instance: str,
    table: str,
    service_account_email: typing.Optional[str] = None,
    app_profile: typing.Optional[str] = None,
    truncate: bool = False,
    overwrite: bool = False,
    auto_create_column_families: bool = False,
    bigtable_options: typing.Optional[dict] = None,
    job_id: typing.Optional[str] = None,
    job_id_prefix: typing.Optional[str] = None,
    start_timestamp: typing.Optional[
        typing.Union[int, float, str, datetime.datetime, datetime.date]
    ] = None,
    end_timestamp: typing.Optional[
        typing.Union[int, float, str, datetime.datetime, datetime.date]
    ] = None
) -> google.cloud.bigquery.job.query.QueryJobExport the StreamingDataFrame as a continue job and returns a QueryJob object for some management functionality.
This method requires an existing bigtable preconfigured to accept the continuous query export statement. For instructions on export to bigtable, see https://cloud.google.com/bigquery/docs/export-to-bigtable.
| Parameters | |
|---|---|
| Name | Description | 
| instance | strThe name of the bigtable instance to export to. | 
| table | strThe name of the bigtable table to export to. | 
| service_account_email | strFull name of the service account to run the continuous query. Example: accountname@projectname.gserviceaccounts.com If not provided, the user account will be used, but this limits the lifetime of the continuous query. | 
| app_profile | str, default NoneThe bigtable app profile to export to. If None, no app profile will be used. | 
| truncate | bool, default FalseThe export truncate option, see https://cloud.google.com/bigquery/docs/reference/standard-sql/other-statements#bigtable_export_option | 
| overwrite | bool, default FalseThe export overwrite option, see https://cloud.google.com/bigquery/docs/reference/standard-sql/other-statements#bigtable_export_option | 
| auto_create_column_families | bool, default FalseThe auto_create_column_families option, see https://cloud.google.com/bigquery/docs/reference/standard-sql/other-statements#bigtable_export_option | 
| bigtable_options | dict, default NoneThe bigtable options dict, which will be converted to JSON using json.dumps, see https://cloud.google.com/bigquery/docs/reference/standard-sql/other-statements#bigtable_export_option If None, no bigtable_options parameter will be passed. | 
| job_id | str, default NoneIf specified, replace the default job id for the query, see job_id parameter of https://cloud.google.com/python/docs/reference/bigquery/latest/google.cloud.bigquery.client.Client#google_cloud_bigquery_client_Client_query | 
| job_id_prefix | str, default NoneIf specified, a job id prefix for the query, see job_id_prefix parameter of https://cloud.google.com/python/docs/reference/bigquery/latest/google.cloud.bigquery.client.Client#google_cloud_bigquery_client_Client_query | 
| start_timestamp | int, float, str, datetime, date, default NoneThe starting timestamp for the query. Possible values are to 7 days in the past. If don't specify a timestamp (None), the query will default to the earliest possible time, 7 days ago. If provide a time-zone-naive timestamp, it will be treated as UTC. | 
| Returns | |
|---|---|
| Type | Description | 
| google.cloud.bigquery.QueryJob | See https://cloud.google.com/python/docs/reference/bigquery/latest/google.cloud.bigquery.job.QueryJob The ongoing query job can be managed using this object. For example, the job can be cancelled or its error status can be examined. | 
to_pubsub
to_pubsub(
    *,
    topic: str,
    service_account_email: str,
    job_id: typing.Optional[str] = None,
    job_id_prefix: typing.Optional[str] = None,
    start_timestamp: typing.Optional[
        typing.Union[int, float, str, datetime.datetime, datetime.date]
    ] = None
) -> google.cloud.bigquery.job.query.QueryJobExport the StreamingDataFrame as a continue job and returns a QueryJob object for some management functionality.
This method requires an existing pubsub topic. For instructions on creating a pubsub topic, see https://cloud.google.com/pubsub/docs/samples/pubsub-quickstart-create-topic?hl=en
Note that a service account is a requirement for continuous queries exporting to pubsub.
| Parameters | |
|---|---|
| Name | Description | 
| topic | strThe name of the pubsub topic to export to. For example: "taxi-rides" | 
| service_account_email | strFull name of the service account to run the continuous query. Example: accountname@projectname.gserviceaccounts.com | 
| job_id | str, default NoneIf specified, replace the default job id for the query, see job_id parameter of https://cloud.google.com/python/docs/reference/bigquery/latest/google.cloud.bigquery.client.Client#google_cloud_bigquery_client_Client_query | 
| job_id_prefix | str, default NoneIf specified, a job id prefix for the query, see job_id_prefix parameter of https://cloud.google.com/python/docs/reference/bigquery/latest/google.cloud.bigquery.client.Client#google_cloud_bigquery_client_Client_query | 
| start_timestamp | int, float, str, datetime, date, default NoneThe starting timestamp for the query. Possible values are to 7 days in the past. If don't specify a timestamp (None), the query will default to the earliest possible time, 7 days ago. If provide a time-zone-naive timestamp, it will be treated as UTC. | 
| Returns | |
|---|---|
| Type | Description | 
| google.cloud.bigquery.QueryJob | See https://cloud.google.com/python/docs/reference/bigquery/latest/google.cloud.bigquery.job.QueryJob The ongoing query job can be managed using this object. For example, the job can be cancelled or its error status can be examined. |