Stay organized with collections Save and categorize content based on your preferences.
Introduction to BigQuery DataFramesBigQuery DataFrames is a set of open source Python libraries that let you take advantage of BigQuery data processing by using familiar Python APIs. BigQuery DataFrames provides a Pythonic DataFrame powered by the BigQuery engine, and it implements the pandas and scikit-learn APIs by pushing the processing down to BigQuery through SQL conversion. This lets you use BigQuery to explore and process terabytes of data, and also train machine learning (ML) models, all with Python APIs.
The following diagram describes the workflow of BigQuery DataFrames:
Note: There are breaking changes to some default parameters in BigQuery DataFrames version 2.0. To learn about these changes and how to migrate to version 2.0, see Migrate to BigQuery DataFrames 2.0. BigQuery DataFrames benefitsBigQuery DataFrames does the following:
BigQuery DataFrames is distributed with the Apache-2.0 license.
BigQuery DataFrames also contains code derived from the following third-party packages:
For details, see the third_party/bigframes_vendored
directory in the BigQuery DataFrames GitHub repository.
_anonymous_
dataset in the Google Cloud project you specify in the bf.options.bigquery.project
option.dbt-bigquery
adapter.Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-07 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[[["BigQuery DataFrames are open-source Python libraries that enable users to leverage BigQuery's data processing power through familiar Python APIs."],["It offers over 750 implemented pandas and scikit-learn APIs by converting them transparently into SQL for BigQuery and BigQuery ML API processing."],["BigQuery DataFrames enhances performance by deferring query execution and allowing user-defined Python functions for data transformation, which are automatically deployed as BigQuery remote functions."],["The libraries integrate with Vertex AI for text generation with Gemini models, alongside other external packages like Ibis, pandas, and scikit-learn, and is distributed under the Apache-2.0 license."],["Users should be aware of BigQuery quotas, the subset of supported pandas and scikit-learn APIs, and that the usage of BigQuery, Cloud Run functions, and Vertex AI may incur additional costs."]]],[]]
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4