Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

BigQuery DataFrames (also known as BigFrames)

License

NotificationsYou must be signed in to change notification settings

googleapis/python-bigquery-dataframes

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
orphan:

BigQuery DataFrames (BigFrames)

GApypiversions

BigQuery DataFrames (also known as BigFrames) provides a Pythonic DataFrameand machine learning (ML) API powered by the BigQuery engine.

  • bigframes.pandas provides a pandas API for analytics. Many workloads can bemigrated from pandas to bigframes by just changing a few imports.
  • bigframes.ml provides a scikit-learn-like API for ML.

BigQuery DataFrames is an open-source package.

Version 2.0 introduces breaking changes for improved security and performance. See below for details.

Getting started with BigQuery DataFrames

The easiest way to get started is to try theBigFrames quickstartin anotebook in BigQuery Studio.

To use BigFrames in your local development environment,

  1. Runpip install --upgrade bigframes to install the latest version.
  2. SetupApplication default credentialsfor your local development environment enviroment.
  3. Create aGCP project with the BigQuery API enabled.
  4. Use thebigframes package to query data.
importbigframes.pandasasbpdbpd.options.bigquery.project=your_gcp_project_iddf=bpd.read_gbq("bigquery-public-data.usa_names.usa_1910_2013")print(df.groupby("name")    .agg({"number":"sum"})    .sort_values("number",ascending=False)    .head(10)    .to_pandas())

Documentation

To learn more about BigQuery DataFrames, visit these pages

⚠️ Warning: Breaking Changes in BigQuery DataFrames v2.0

Version 2.0 introduces breaking changes for improved security and performance. Key default behaviors have changed, including

  • Large Results (>10GB): The default value forallow_large_results has changed toFalse.Methods liketo_pandas() will now fail if the query result's compressed data size exceeds 10GB,unless large results are explicitly permitted.
  • Remote Function Security: The library no longer automatically lets the Compute Engine default serviceaccount become the identity of the Cloud Run functions. If that is desired, it has to be indicated by passingcloud_function_service_account="default". And network ingress now defaults to"internal-only".
  • @remote_function Argument Passing: Arguments other thaninput_types,output_type, anddatasettoremote_function must now be passed using keyword syntax, as positional arguments are no longer supported.
  • @udf Argument Passing: Argumentsdataset andname toudf are now mandatory.
  • Endpoint Connections: Automatic fallback to locational endpoints in certain regions is removed.
  • LLM Updates (Gemini Integration): Integrations now default to thegemini-2.0-flash-001 model.PaLM2 support has been removed; please migrate any existing PaLM2 usage to Gemini.Note: The current defaultmodel will be removed in Version 3.0.

Important: If you are not ready to adapt to these changes, please pin your dependency to a version less than 2.0(e.g.,bigframes==1.42.0) to avoid disruption.

To learn about these changes and how to migrate to version 2.0, see theupdated introduction guide.

License

BigQuery DataFrames is distributed with theApache-2.0 license.

It also contains code derived from the following third-party packages:

For details, see thethird_partydirectory.

Contact Us

For further help and provide feedback, you can email us atbigframes-feedback@google.com.


[8]ページ先頭

©2009-2025 Movatter.jp