RetroSearch Browse

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Showing content from https://docs.databricks.com/aws/en/archive/runtime-release-notes/9.1lts below:

Databricks Runtime 9.1 LTS | Databricks Documentation

Databricks Runtime 9.1 LTS

The following release notes provide information about Databricks Runtime 9.1 LTS and Databricks Runtime 9.1 LTS Photon, powered by Apache Spark 3.1.2. Databricks released this version in September 2021. Photon is in Public Preview.

New features and improvementsâ

Auto Loader schema hints now work with array and map typesâ

Array and map types are supported in Override schema inference with schema hints for Auto Loader.

Examples of schema hints for arrays include:

arr Array<TYPE> changes the array type.
arr.element TYPE changes the array type (by using the element keyword).
arr.element.x TYPE changes a nested field type in an array of structures.

The first two examples are hints to change the type of array arr to TYPE, but they use different syntax. In the third example, arr is an array of structures with a field x. This example shows how to change the type of x to TYPE.

Examples of schema hints for maps include:

m Map<KEY-TYPE, VALUE-TYPE> changes map key and value types.
m.key TYPE changes the type of map keys.
m.value TYPE changes the type of map values.
m.key.x TYPE changes the field type in a map key.
m.value.x TYPE changes the field type of a map value.

The first example changes both the key and value types of map m to KEY_TYPE and VALUE_TYPE respectively. The second and third examples can be used if only the key type or only the value type needs to be changed. In the fourth and fifth examples, m is a map with key and value of structure types with a field x. This example shows how to change the type of x to TYPE.

Avro file support for merge schemaâ

The Avro file format now supports the mergeSchema option when reading files. Setting mergeSchema to true when reading Avro files will infer a schema from a set of Avro files rather than from a single file. This improves usability by inferring a schema that may be able to read all files even if their individual schemas differ. See Configuration.

Auto Loader incremental listing support (Public Preview)â

In the case of lexicographically generated files, What is Auto Loader? now leverages lexical file ordering and existing optimized APIs to make the directory listing more efficient by listing from previously-ingested files rather than by listing the entire directory. Auto Loader automatically detects whether a given directory is suitable for incremental listing by default. To control this behavior explicitly, set the new cloudFiles.useIncrementalListing option to on (true), off (false), or automatic (auto). If you set this behavior to true, you can also set the cloudFiles.backfillInterval option to schedule regular backfills over your data, to make sure all of your data is completely ingested.

Delta now supports arbitrary replaceWhereâ

In combination with overwrite mode, the replaceWhere option can be used to simultaneously overwrite data that matches a predicate defined in the option. Previously, replaceWhere supported a predicate only over partition columns, but it can now be an arbitrary expression. See Write to a table.

Auto Loader for Google Cloud now supports file notifications (Public Preview)â

Auto Loader now supports file notification mode on Google Cloud. Set .option("cloudFiles.useNotifications", "true") to allow Auto Loader to automatically set up Google Cloud Pub/Sub resources for you. With file notification mode, new files are detected and ingested as they arrive without listing the input directory. See Configure Auto Loader streams in file notification mode.

CREATE FUNCTION now supports creating table functionsâ

In addition to creating a scalar function that returns a scalar value, you can now create a table function that returns a set of rows. See CREATE FUNCTION (SQL and Python).

Kafka Streaming Source now reports estimatedTotalBytesBehindLatest metricâ

The Kafka streaming source now reports an estimate of how many bytes the consumer is behind the latest available byte after every batch. You can use this metric to track stream progress. See Retrieve Kafka metrics.

Example metric output:

StreamingQueryProgress {
  "batchId": 0,
  .....
  "sources": [ {
    "description" : "KafkaV2[Subscribe[topic-0]]",
    "metrics":{
      "avgOffsetsBehindLatest" : "1.0",
      "estimatedTotalBytesBehindLatest" : "80.0", // new
      "maxOffsetsBehindLatest" : "1",
      "minOffsetsBehindLatest" : "1"
    } ],
  ....
 }

For structs inside of arrays, Delta MERGE INTO now resolves struct fields by name and evolves struct schemasâ

Delta MERGE INTO now supports resolution of struct fields by name and automatic schema evolution for arrays of structs. When automatic schema evolution is enabled by setting spark.databricks.delta.schema.autoMerge.enabled to true, UPDATE and INSERT clauses will resolve struct fields inside of an array by name, casting to the corresponding data type that is defined in the target array and filling additional or missing fields in the source or target with null values. When automatic schema evolution is disabled, UPDATE and INSERT clauses will resolve struct fields inside of an array by name but will not be able to evolve the additional fields. See Update Delta Lake table schema.

Bug fixesâ

Fixed a memory leak in the Amazon S3 connector that could happen in long running jobs or services, which was caused by JVM DeleteOnExit functionality.

Library upgradesâ

Upgraded Python libraries:
- plotly from 4.14.3 to 5.1.0
Upgraded R libraries:
- base from 4.1.0 to 4.1.1
- bslib from 0.2.5.1 to 0.3.0
- cachem from 1.0.5 to 1.0.6
- compiler from 4.1.0 to 4.1.1
- datasets from 4.1.0 to 4.1.1
- future from 1.21.0 to 1.22.1
- gert from 1.3.1 to 1.3.2
- graphics from 4.1.0 to 4.1.1
- grDevices from 4.1.0 to 4.1.1
- grid from 4.1.0 to 4.1.1
Upgraded Java libraries:
- org.eclipse.jetty from 9.4.36.v20210114 to 9.4.42.v20210604

Apache Sparkâ

Databricks Runtime 9.1 LTS includes Apache Spark 3.1.2. This release includes all Spark fixes and improvements included in Databricks Runtime 9.0 (EoS), as well as the following additional bug fixes and improvements made to Spark:

[SPARK-36674][SQL][CHERRY-PICK] Support ILIKE - case insensitive LIKE
[SPARK-36353][SQL][3.1] RemoveNoopOperators should keep output schema
[SPARK-35876][SQL][3.1] ArraysZip should retain field names to avoid being re-written by analyzer/optimizer
[SPARK-36398][SQL] Redact sensitive information in Spark Thrift Server log
[SPARK-36498][SQL] Reorder inner fields of the input query in byName V2 write
[SPARK-36614][CORE][UI] Correct executor loss reason caused by decommission in UI
[SPARK-36012][SQL] Add null flag in SHOW CREATE TABLE
[SPARK-36509][CORE] Fix the issue that executors are never re-scheduled if the worker stops with standalone cluster
[SPARK-36603][CORE] Use WeakReference not SoftReference in LevelDB
[SPARK-36564][CORE] Fix NullPointerException in LiveRDDDistribution.toApi
[SPARK-36086][SQL][3.1] CollapseProject project replace alias should use origin column name
[SPARK-33527][SQL] Extend the function of decode so as consistent with mainstream databases
[SPARK-36400][SPARK-36398][SQL][WEBUI] Make ThriftServer recognize spark.sql.redaction.string.regex
[SPARK-34054][CORE] BlockManagerDecommissioner code cleanup
[SPARK-36500][CORE] Fix temp_shuffle file leaking when a task is interrupted
[SPARK-36489][SQL] Aggregate functions over no grouping keys, on tables with a single bucket, return multiple rows
[SPARK-36464][CORE] Fix Underlying Size Variable Initialization in ChunkedByteBufferOutputStream for Writing Over 2GB Data
[SPARK-36339][SQL][3.0] References to grouping that not part of aggregation should be replaced
[SPARK-36354][CORE] EventLogFileReader should skip rolling event log directories with no logs
[SPARK-36242][CORE][3.1] Ensure spill file closed before set success = true in ExternalSorter.spillMemoryIteratorToDisk method
[SPARK-36211][PYTHON] Correct typing of udf return value
[SPARK-34222][SQL] Enhance boolean simplification rule
[SPARK-35027][CORE] Close the inputStream in FileAppender when writinâ¦
[SPARK-36269][SQL] Fix only set data columns to Hive column names config
[SPARK-36213][SQL] Normalize PartitionSpec for Describe Table Command with PartitionSpec
[SPARK-36210][SQL] Preserve column insertion order in Dataset.withColumns
[SPARK-36079][SQL] Null-based filter estimate should always be in the range [0, 1]
[SPARK-28266][SQL] convertToLogicalRelation should not interpret path property when reading Hive tables

Maintenance updatesâ

See Databricks Runtime 9.1 LTS maintenance updates.

System environmentâ

Operating System: Ubuntu 20.04.4 LTS
Java: Zulu 8.56.0.21-CA-linux64
Scala: 2.12.10
Python: 3.8.10
R: 4.1.1
Delta Lake: 1.0.0

Installed Python librariesâ Installed R librariesâ

R libraries are installed from the Microsoft CRAN snapshot on 2021-09-08. The snapshot is no longer available.

Installed Java and Scala libraries (Scala 2.12 cluster version)â

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4