This article explains the benefits of running your workloads on the Photon query engine.
Photon is a high-performance Databricks-native vectorized query engine that runs your SQL workloads and DataFrame API calls faster to reduce your total cost per workload. Photon is compatible with Apache Spark APIs, so it works with your existing code.
Photon featuresâThe following are key features and advantages of using Photon.
UPDATE
, DELETE
, MERGE INTO
, INSERT
, and CREATE TABLE AS SELECT
, including wide tables that contain thousands of columns.Photon enablement varies by compute type:
To enable or disable Photon on all-purpose and jobs compute, select the Use Photon Acceleration checkbox in the Compute UI.
Photon is not enabled by default on any compute created with the Clusters API or Jobs API. To enable Photon, you must set the runtime_engine
attribute to PHOTON
.
Photon supports a number of instance types on the driver and worker nodes. Photon instance types consume DBUs at a different rate than the same instance type running the non-Photon runtime. For more information about Photon instances and DBU consumption, see the Databricks pricing page.
Supported operators, expressions, and data typesâThe following are the operators, expressions, and data types that Photon covers.
Operators
Expressions
Data types
The following are features that require Photon.
MERGE
, UPDATE
, and DELETE
statements. See Dynamic file pruning.If your workload hits an unsupported operation, the compute resource switches to the standard runtime engine for the remainder of the workload.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4