RetroSearch Browse

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Showing content from http://spark.apache.org/docs/latest/api/java/org/apache/spark/sql/functions.html below:

functions (Spark 4.0.0 JavaDoc)

org.apache.spark.sql.functions

public class functions extends Object

Commonly used functions available for DataFrame operations. Using functions defined here provides a little bit more compile-time safety to make sure the function exists.

You can call the functions defined here by two ways: _FUNC_(...) and functions.expr("_FUNC_(...)").

As an example, regr_count is a function that is defined here. You can use regr_count(col("yCol", col("xCol"))) to invoke the regr_count function. This way the programming language's compiler ensures regr_count exists and is of the proper form. You can also use expr("regr_count(yCol, xCol)") function to invoke the same function. In this case, Spark itself will ensure regr_count exists when it analyzes the query.

You can find the entire list of functions at SQL API documentation of your Spark version, see also the latest list

This function APIs usually have methods with Column signature only because it can support not only Column but also other types such as a native string. The other variants currently exist for historical reasons.

Since:: 1.3.0

Nested Class Summary
Nested Classes
Constructor Summary
Constructors
Method Summary
Computes the absolute value of a numeric value.

Returns the date that is numMonths after startDate.

Returns the date that is numMonths after startDate.

Returns a decrypted value of input.

Returns a decrypted value of input.

Returns a decrypted value of input.

Returns a decrypted value of input using AES in mode with padding.

Returns an encrypted value of input.

Returns an encrypted value of input.

Returns an encrypted value of input.

Returns an encrypted value of input.

Returns an encrypted value of input using AES in given mode with the specified padding.

Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state.

Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state.

Aggregate function: returns true if at least one value of e is true.

Aggregate function: returns some value of e for a group of rows.

Aggregate function: returns some value of e for a group of rows.

Aggregate function: returns the approximate number of distinct items in a group.

Aggregate function: returns the approximate number of distinct items in a group.

Aggregate function: returns the approximate number of distinct items in a group.

Aggregate function: returns the approximate number of distinct items in a group.

Aggregate function: returns the approximate percentile of the numeric column col which is the smallest value in the ordered col values (sorted from least to greatest) such that no more than percentage of col values is less than the value or equal to that value.

Creates a new array column.

Creates a new array column.

Creates a new array column.

Creates a new array column.

Aggregate function: returns a list of objects with duplicates.

Returns an ARRAY containing all elements from the source ARRAY as well as the new element.

Remove all null elements from the given array.

Returns null if the array is null, true if the array contains value, and false otherwise.

Removes duplicate values from the array.

Returns an array of the elements in the first array but not in the second array, without duplicates.

Adds an item into a given array at a specified position

Returns an array of the elements in the intersection of the given two arrays, without duplicates.

Concatenates the elements of column using the delimiter.

Concatenates the elements of column using the delimiter.

Returns the maximum value in the array.

Returns the minimum value in the array.

Locates the position of the first occurrence of the value in the given array as long.

Returns an array containing value as well as all elements from array.

Remove all elements that equal to element from the given array.

Creates an array containing the left argument repeated the number of times given by the right argument.

Creates an array containing the left argument repeated the number of times given by the right argument.

Returns the total number of elements in the array.

Sorts the input array in ascending order.

Sorts the input array based on the given comparator function.

Returns an array of the elements in the union of the given two arrays, without duplicates.

Returns true if a1 and a2 have at least one non-null element in common.

Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.

Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.

Returns a sort expression based on ascending order of the column.

Returns a sort expression based on ascending order of the column, and null values return before non-null values.

Returns a sort expression based on ascending order of the column, and null values appear after non-null values.

Computes the numeric value of the first character of the string column, and returns the result as an int column.

Returns null if the condition is true, and throws an exception otherwise.

Returns null if the condition is true; throws an exception with the error message otherwise.

Aggregate function: returns the average of the values in a group.

Aggregate function: returns the average of the values in a group.

Computes the BASE64 encoding of a binary column and returns it as a string column.

An expression that returns the string representation of the binary value of the given long column.

An expression that returns the string representation of the binary value of the given long column.

Aggregate function: returns the bitwise AND of all non-null input values, or null if none.

Returns the number of bits that are set in the argument expr as an unsigned 64-bit integer, or NULL if the argument is NULL.

Returns the value of the bit (0 or 1) at the specified position.

Calculates the bit length for the specified string column.

Aggregate function: returns the bitwise OR of all non-null input values, or null if none.

Aggregate function: returns the bitwise XOR of all non-null input values, or null if none.

Returns the bucket number for the given input column.

Returns the bit position for the given input column.

Returns a bitmap with the positions of the bits set from all the values from the input column.

Returns the number of set bits in the input bitmap.

Returns a bitmap that is the bitwise OR of all of the bitmaps from the input column.

Computes bitwise NOT (~) of a number.

Aggregate function: returns true if all values of e are true.

Aggregate function: returns true if at least one value of e is true.

Marks a DataFrame as small enough for use in broadcast joins.

Returns the value of the column e rounded to 0 decimal places with HALF_EVEN round mode.

Round the value of e to scale decimal places with HALF_EVEN round mode if scale is greater than or equal to 0 or at integral part when scale is less than 0.

Round the value of e to scale decimal places with HALF_EVEN round mode if scale is greater than or equal to 0 or at integral part when scale is less than 0.

Removes the leading and trailing space characters from str.

Remove the leading and trailing trim characters from str.

(Java-specific) A transform for any type that partitions by a hash of the input column.

(Java-specific) A transform for any type that partitions by a hash of the input column.

Call an user-defined function.

Call an user-defined function.

Call an user-defined function.

Returns length of array or map.

Computes the cube-root of the given column.

Computes the cube-root of the given value.

Computes the ceiling of the given value of e to 0 decimal places.

Computes the ceiling of the given value of e to 0 decimal places.

Computes the ceiling of the given value of e to scale decimal places.

Computes the ceiling of the given value of e to 0 decimal places.

Computes the ceiling of the given value of e to scale decimal places.

Returns the character length of string data or number of bytes of binary data.

Returns the character length of string data or number of bytes of binary data.

Returns the ASCII character having the binary equivalent to n.

Returns the first column that is not null, or null if all inputs are null.

Returns the first column that is not null, or null if all inputs are null.

Returns a
Column
based on the given column name.

Marks a given column with specified collation.

Returns the collation name of a given column.

Aggregate function: returns a list of objects with duplicates.

Aggregate function: returns a list of objects with duplicates.

Aggregate function: returns a set of objects with duplicate elements eliminated.

Aggregate function: returns a set of objects with duplicate elements eliminated.

Returns a
Column
based on the given column name.

Concatenates multiple input columns together into a single column.

Concatenates multiple input columns together into a single column.

Concatenates multiple input string columns together into a single string column, using the given separator.

Concatenates multiple input string columns together into a single string column, using the given separator.

Convert a number in a string column from one base to another.

Converts the timestamp without time zone sourceTs from the current time zone to targetTz.

Converts the timestamp without time zone sourceTs from the sourceTz time zone to targetTz.

Aggregate function: returns the Pearson Correlation Coefficient for two columns.

Aggregate function: returns the Pearson Correlation Coefficient for two columns.

Aggregate function: returns the number of items in a group.

Aggregate function: returns the number of items in a group.

Aggregate function: returns the number of distinct items in a group.

Aggregate function: returns the number of distinct items in a group.

Aggregate function: returns the number of TRUE values for the expression.

Returns a count-min sketch of a column with the given esp, confidence and seed.

Returns a count-min sketch of a column with the given esp, confidence and seed.

Aggregate function: returns the number of distinct items in a group.

Aggregate function: returns the number of distinct items in a group.

Aggregate function: returns the number of distinct items in a group.

Aggregate function: returns the number of distinct items in a group.

Aggregate function: returns the population covariance for two columns.

Aggregate function: returns the population covariance for two columns.

Aggregate function: returns the sample covariance for two columns.

Aggregate function: returns the sample covariance for two columns.

Calculates the cyclic redundancy check value (CRC32) of a binary column and returns the value as a bigint.

Window function: returns the cumulative distribution of values within a window partition, i.e.

Returns the current date at the start of query evaluation as a date column.

Returns the current catalog.

Returns the current database.

Returns the current date at the start of query evaluation as a date column.

Returns the current schema.

Returns the current timestamp at the start of query evaluation as a timestamp column.

Returns the current session local timezone.

Returns the user name of current execution context.

Returns the date that is days days after start

Returns the date that is days days after start

Returns the number of days from start to end.

Converts a date/timestamp/string to a value of string in the format specified by the date format given by the second argument.

Create date from the number of days since 1970-01-01.

Extracts a part of the date/timestamp or interval source.

Returns the date that is days days before start

Returns the date that is days days before start

Returns timestamp truncated to the unit specified by the format.

Returns the date that is days days after start

Returns the number of days from start to end.

Extracts a part of the date/timestamp or interval source.

Extracts the day of the month as an integer from a given date/timestamp/string.

Extracts the three-letter abbreviated day name from a given date/timestamp/string.

Extracts the day of the month as an integer from a given date/timestamp/string.

Extracts the day of the week as an integer from a given date/timestamp/string.

Extracts the day of the year as an integer from a given date/timestamp/string.

(Java-specific) A transform for timestamps and dates to partition data into days.

Computes the first argument into a string from a binary using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16', 'UTF-32').

Converts an angle measured in radians to an approximately equivalent angle measured in degrees.

Converts an angle measured in radians to an approximately equivalent angle measured in degrees.

Window function: returns the rank of rows within a window partition, without any gaps.

Returns a sort expression based on the descending order of the column.

Returns a sort expression based on the descending order of the column, and null values appear before non-null values.

Returns a sort expression based on the descending order of the column, and null values appear after non-null values.

Returns element of array at given index in value if column is array.

Returns the n-th input, e.g., returns input2 when n is 2.
elt(scala.collection.immutable.Seq<Column> inputs)
Returns the n-th input, e.g., returns input2 when n is 2.

Computes the first argument into a binary from a string using the provided character set (one of 'US-ASCII', 'ISO-8859-1', 'UTF-8', 'UTF-16BE', 'UTF-16LE', 'UTF-16', 'UTF-32').

Returns same result as the EQUAL(=) operator for non-null operands, but returns true if both are null, false if one of the them is null.

Aggregate function: returns true if all values of e are true.

Returns whether a predicate holds for one or more elements in the array.

Computes the exponential of the given column.

Computes the exponential of the given value.

Creates a new row for each element in the given array or map column.

Creates a new row for each element in the given array or map column.

Computes the exponential of the given column minus one.

Computes the exponential of the given value minus one.

Extracts a part of the date/timestamp or interval source.

Computes the factorial of the given value.

Returns an array of elements for which a predicate holds in a given array.

Returns an array of elements for which a predicate holds in a given array.

Returns the index (1-based) of the given string (str) in the comma-delimited list (strArray).

Aggregate function: returns the first value of a column in a group.

Aggregate function: returns the first value of a column in a group.

Aggregate function: returns the first value in a group.

Aggregate function: returns the first value in a group.

Aggregate function: returns the first value in a group.

Aggregate function: returns the first value in a group.

Creates a single array from an array of arrays.

Computes the floor of the given column value to 0 decimal places.

Computes the floor of the given value of e to 0 decimal places.

Computes the floor of the given value of e to scale decimal places.

Returns whether a predicate holds for every element in the array.

Formats numeric column x to a format like '#,###,###.##', rounded to d decimal places with HALF_EVEN round mode, and returns the result as a string column.

Formats the arguments in printf-style and returns the result as a string column.

Formats the arguments in printf-style and returns the result as a string column.

(Java-specific) Parses a column containing a CSV string into a StructType with the specified schema.

Parses a column containing a CSV string into a StructType with the specified schema.

(Java-specific) Parses a column containing a JSON string into a MapType with StringType as keys type, StructType or ArrayType with the specified schema.

(Scala-specific) Parses a column containing a JSON string into a MapType with StringType as keys type, StructType or ArrayType with the specified schema.

(Scala-specific) Parses a column containing a JSON string into a MapType with StringType as keys type, StructType or ArrayType of StructTypes with the specified schema.

(Java-specific) Parses a column containing a JSON string into a MapType with StringType as keys type, StructType or ArrayType of StructTypes with the specified schema.

Parses a column containing a JSON string into a MapType with StringType as keys type, StructType or ArrayType with the specified schema.

(Java-specific) Parses a column containing a JSON string into a MapType with StringType as keys type, StructType or ArrayType with the specified schema.

(Scala-specific) Parses a column containing a JSON string into a MapType with StringType as keys type, StructType or ArrayType with the specified schema.

Parses a column containing a JSON string into a StructType with the specified schema.

(Java-specific) Parses a column containing a JSON string into a StructType with the specified schema.

(Scala-specific) Parses a column containing a JSON string into a StructType with the specified schema.

Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string representing the timestamp of that moment in the current system time zone in the yyyy-MM-dd HH:mm:ss format.

Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string representing the timestamp of that moment in the current system time zone in the given format.

Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in UTC, and renders that time as a timestamp in the given time zone.

Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in UTC, and renders that time as a timestamp in the given time zone.

(Java-specific) Parses a column containing a XML string into a StructType with the specified schema.

(Java-specific) Parses a column containing a XML string into a StructType with the specified schema.

(Java-specific) Parses a column containing a XML string into a StructType with the specified schema.

Parses a column containing a XML string into the data type corresponding to the specified schema.

Parses a column containing a XML string into the data type corresponding to the specified schema.

Returns element of array at given (0-based) index.

Extracts json object from a json string based on json path specified, and returns json string of the extracted json object.

Returns the value of the bit (0 or 1) at the specified position.

Returns the greatest value of the list of column names, skipping null values.

Returns the greatest value of the list of column names, skipping null values.

Returns the greatest value of the list of values, skipping null values.

Returns the greatest value of the list of values, skipping null values.

Aggregate function: indicates whether a specified column in a GROUP BY list is aggregated or not, returns 1 for aggregated or 0 for not aggregated in the result set.

Aggregate function: indicates whether a specified column in a GROUP BY list is aggregated or not, returns 1 for aggregated or 0 for not aggregated in the result set.

Aggregate function: returns the level of grouping, equals to

Aggregate function: returns the level of grouping, equals to

Aggregate function: returns the level of grouping, equals to

Aggregate function: returns the level of grouping, equals to

Calculates the hash code of given columns, and returns the result as an int column.

Calculates the hash code of given columns, and returns the result as an int column.

Computes hex value of the given column.

Aggregate function: computes a histogram on numeric 'expr' using nb bins.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch configured with default lgConfigK value.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch configured with lgConfigK arg.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch configured with default lgConfigK value.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch configured with lgConfigK arg.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch configured with lgConfigK arg.

Returns the estimated number of unique values given the binary representation of a Datasketches HllSketch.

Returns the estimated number of unique values given the binary representation of a Datasketches HllSketch.

Merges two binary representations of Datasketches HllSketch objects, using a Datasketches Union object.

Merges two binary representations of Datasketches HllSketch objects, using a Datasketches Union object.

Merges two binary representations of Datasketches HllSketch objects, using a Datasketches Union object.

Merges two binary representations of Datasketches HllSketch objects, using a Datasketches Union object.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch, generated by merging previously created Datasketches HllSketch instances via a Datasketches Union instance.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch, generated by merging previously created Datasketches HllSketch instances via a Datasketches Union instance.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch, generated by merging previously created Datasketches HllSketch instances via a Datasketches Union instance.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch, generated by merging previously created Datasketches HllSketch instances via a Datasketches Union instance.

Aggregate function: returns the updatable binary representation of the Datasketches HllSketch, generated by merging previously created Datasketches HllSketch instances via a Datasketches Union instance.

Extracts the hours as an integer from a given date/timestamp/string.

(Java-specific) A transform for timestamps to partition data into hours.

Computes sqrt(a^2^ + b^2^) without intermediate overflow or underflow.

Computes sqrt(a^2^ + b^2^) without intermediate overflow or underflow.

Computes sqrt(a^2^ + b^2^) without intermediate overflow or underflow.

Computes sqrt(a^2^ + b^2^) without intermediate overflow or underflow.

Computes sqrt(a^2^ + b^2^) without intermediate overflow or underflow.

Computes sqrt(a^2^ + b^2^) without intermediate overflow or underflow.

Computes sqrt(a^2^ + b^2^) without intermediate overflow or underflow.

Computes sqrt(a^2^ + b^2^) without intermediate overflow or underflow.

Returns col2 if col1 is null, or col1 otherwise.

Returns true if str matches pattern with escapeChar('\') case-insensitively, null if any arguments are null, false otherwise.

Returns true if str matches pattern with escapeChar case-insensitively, null if any arguments are null, false otherwise.

Returns a new string column by converting the first letter of each word to uppercase.

Creates a new row for each element in the given array of structs.

Creates a new row for each element in the given array of structs.

Returns the length of the block being read, or -1 if not available.

Returns the start offset of the block being read, or -1 if not available.

Creates a string column for the file name of the current Spark task.

Locate the position of the first occurrence of substr column in the given string.

Locate the position of the first occurrence of substr column in the given string.

Returns true if the input is a valid UTF-8 string, otherwise returns false.

Check if a variant value is a variant null.

Return true iff the column is NaN.

Returns true if col is not null, or false otherwise.

Return true iff the column is null.

Calls a method with reflection.

Calls a method with reflection.

Returns the number of elements in the outermost JSON array.

Returns all the keys of the outermost JSON object as an array.

Creates a new row for a json column according to the given field names.

Creates a new row for a json column according to the given field names.

Aggregate function: returns the kurtosis of the values in a group.

Aggregate function: returns the kurtosis of the values in a group.

Window function: returns the value that is offset rows before the current row, and null if there is less than offset rows before the current row.

Window function: returns the value that is offset rows before the current row, and defaultValue if there is less than offset rows before the current row.

Window function: returns the value that is offset rows before the current row, and null if there is less than offset rows before the current row.

Window function: returns the value that is offset rows before the current row, and defaultValue if there is less than offset rows before the current row.

Window function: returns the value that is offset rows before the current row, and defaultValue if there is less than offset rows before the current row.

Aggregate function: returns the last value of the column in a group.

Aggregate function: returns the last value of the column in a group.

Aggregate function: returns the last value in a group.

Aggregate function: returns the last value in a group.

Returns the last day of the month which the given date belongs to.

Aggregate function: returns the last value in a group.

Aggregate function: returns the last value in a group.

Returns str with all characters changed to lowercase.

Window function: returns the value that is offset rows after the current row, and null if there is less than offset rows after the current row.

Window function: returns the value that is offset rows after the current row, and defaultValue if there is less than offset rows after the current row.

Window function: returns the value that is offset rows after the current row, and null if there is less than offset rows after the current row.

Window function: returns the value that is offset rows after the current row, and defaultValue if there is less than offset rows after the current row.

Window function: returns the value that is offset rows after the current row, and defaultValue if there is less than offset rows after the current row.

Returns the least value of the list of column names, skipping null values.

Returns the least value of the list of column names, skipping null values.

Returns the least value of the list of values, skipping null values.

Returns the least value of the list of values, skipping null values.

Returns the leftmost len(len can be string type) characters from the string str, if len is less or equal than 0 the result is an empty string.

Computes the character length of a given string or number of bytes of a binary string.

Computes the character length of a given string or number of bytes of a binary string.

Computes the Levenshtein distance of the two given string columns.

Computes the Levenshtein distance of the two given string columns if it's less than or equal to a given threshold.

Returns true if str matches pattern with escapeChar('\'), null if any arguments are null, false otherwise.

Returns true if str matches pattern with escapeChar, null if any arguments are null, false otherwise.

Aggregate function: returns the concatenation of non-null input values.

Aggregate function: returns the concatenation of non-null input values, separated by the delimiter.

Aggregate function: returns the concatenation of distinct non-null input values.

Aggregate function: returns the concatenation of distinct non-null input values, separated by the delimiter.

Creates a
Column
of literal value.

Computes the natural logarithm of the given value.

Returns the current timestamp without time zone at the start of query evaluation as a timestamp without time zone column.

Locate the position of the first occurrence of substr.

Locate the position of the first occurrence of substr in a string column, after position pos.

Returns the first argument-base logarithm of the second argument.

Returns the first argument-base logarithm of the second argument.

Computes the natural logarithm of the given column.

Computes the natural logarithm of the given value.

Computes the logarithm of the given value in base 10.

Computes the logarithm of the given value in base 10.

Computes the natural logarithm of the given column plus one.

Computes the natural logarithm of the given value plus one.

Computes the logarithm of the given value in base 2.

Computes the logarithm of the given column in base 2.

Converts a string column to lower case.

Left-pad the binary column with pad to a byte length of len.

Left-pad the string column with pad to a length of len.

Left-pad the string column with pad to a length of len.

Trim the spaces from left end for the specified string value.

Trim the specified character string from left end for the specified string column.

Trim the specified character string from left end for the specified string column.

Make DayTimeIntervalType duration.

Make DayTimeIntervalType duration from days.

Make DayTimeIntervalType duration from days and hours.

Make DayTimeIntervalType duration from days, hours and mins.

Make DayTimeIntervalType duration from days, hours, mins and secs.

Make interval from years.

Make interval from years and months.

Make interval from years, months and weeks.

Make interval from years, months, weeks and days.

Make interval from years, months, weeks, days and hours.

Make interval from years, months, weeks, days, hours and mins.

Make interval from years, months, weeks, days, hours, mins and secs.

Create timestamp from years, months, days, hours, mins and secs fields.

Create timestamp from years, months, days, hours, mins, secs and timezone fields.

Create the current timestamp with local time zone from years, months, days, hours, mins and secs fields.

Create the current timestamp with local time zone from years, months, days, hours, mins, secs and timezone fields.

Create local date-time from years, months, days, hours, mins, secs fields.

Returns a new string in which all invalid UTF-8 byte sequences, if any, are replaced by the Unicode replacement character (U+FFFD).

Make year-month interval.

Make year-month interval from years.

Make year-month interval from years, months.

Creates a new map column.
map(scala.collection.immutable.Seq<Column> cols)
Creates a new map column.

Returns the union of all the given maps.

Returns the union of all the given maps.

Returns true if the map contains the key.

Returns an unordered array of all entries in the given map.

Returns a map whose key-value pairs satisfy a predicate.

Creates a new map column.

Returns a map created from the given array of entries.

Returns an unordered array containing the keys of the map.

Returns an unordered array containing the values of the map.

Merge two given maps, key-wise into a single map using a function.

Masks the given string value.

Masks the given string value.

Masks the given string value.

Masks the given string value.

Masks the given string value.

Aggregate function: returns the maximum value of the column in a group.

Aggregate function: returns the maximum value of the expression in a group.

Aggregate function: returns the value associated with the maximum value of ord.

Calculates the MD5 digest of a binary column and returns the value as a 32 character hex string.

Aggregate function: returns the average of the values in a group.

Aggregate function: returns the average of the values in a group.

Aggregate function: returns the median of the values in a group.

Aggregate function: returns the minimum value of the column in a group.

Aggregate function: returns the minimum value of the expression in a group.

Aggregate function: returns the value associated with the minimum value of ord.

Extracts the minutes as an integer from a given date/timestamp/string.

Aggregate function: returns the most frequent value in a group.

Aggregate function: returns the most frequent value in a group.

A column expression that generates monotonically increasing 64-bit integers.

Extracts the month as an integer from a given date/timestamp/string.

Extracts the three-letter abbreviated month name from a given date/timestamp/string.

(Java-specific) A transform for timestamps and dates to partition data into months.

Returns number of months between dates start and end.

Returns number of months between dates end and start.

Creates a struct with the given field names and values.

Creates a struct with the given field names and values.

Returns col1 if it is not NaN, or col2 if col1 is NaN.

Returns the negated value.

Returns the first date which is later than the value of the date column that is on the specified day of the week.

Returns the first date which is later than the value of the date column that is on the specified day of the week.

Inversion of boolean expression, i.e.

Returns the current timestamp at the start of query evaluation.

Window function: returns the value that is the offsetth row of the window frame (counting from 1), and null if the size of window frame is less than offset rows.

Window function: returns the value that is the offsetth row of the window frame (counting from 1), and null if the size of window frame is less than offset rows.

Window function: returns the ntile group id (from 1 to n inclusive) in an ordered window partition.

Returns null if col1 equals to col2, or col1 otherwise.

Returns null if col is equal to zero, or col otherwise.

Returns col2 if col1 is null, or col1 otherwise.

Returns col2 if col1 is not null, or col3 otherwise.

Calculates the byte length for the specified string column.

Overlay the specified portion of src with replace, starting from byte position pos of src.

Overlay the specified portion of src with replace, starting from byte position pos of src and proceeding for len bytes.

Parses a JSON string and constructs a Variant value.

Extracts a part from a URL.

Extracts a part from a URL.

Window function: returns the relative rank (i.e.

Aggregate function: returns the exact percentile(s) of numeric column expr at the given percentage(s) with value range in [0.0, 1.0].

Aggregate function: returns the exact percentile(s) of numeric column expr at the given percentage(s) with value range in [0.0, 1.0].

Aggregate function: returns the approximate percentile of the numeric column col which is the smallest value in the ordered col values (sorted from least to greatest) such that no more than percentage of col values is less than the value or equal to that value.

Returns the positive value of dividend mod divisor.

Creates a new row for each element with position in the given array or map column.

Creates a new row for each element with position in the given array or map column.

Returns the position of the first occurrence of substr in str after position 1.

Returns the position of the first occurrence of substr in str after position start.

Returns the value of the first argument raised to the power of the second argument.

Returns the value of the first argument raised to the power of the second argument.

Returns the value of the first argument raised to the power of the second argument.

Returns the value of the first argument raised to the power of the second argument.

Returns the value of the first argument raised to the power of the second argument.

Returns the value of the first argument raised to the power of the second argument.

Returns the value of the first argument raised to the power of the second argument.

Returns the value of the first argument raised to the power of the second argument.

Returns the value of the first argument raised to the power of the second argument.

Formats the arguments in printf-style and returns the result as a string column.

Formats the arguments in printf-style and returns the result as a string column.

Aggregate function: returns the product of all numerical elements in a group.

Extracts the quarter as an integer from a given date/timestamp/string.

Converts an angle measured in degrees to an approximately equivalent angle measured in radians.

Converts an angle measured in degrees to an approximately equivalent angle measured in radians.

Throws an exception with the provided error message.

Generate a random column with independent and identically distributed (i.i.d.) samples uniformly distributed in [0.0, 1.0).

Generate a random column with independent and identically distributed (i.i.d.) samples uniformly distributed in [0.0, 1.0).

Generate a column with independent and identically distributed (i.i.d.) samples from the standard normal distribution.

Generate a column with independent and identically distributed (i.i.d.) samples from the standard normal distribution.

Returns a random value with independent and identically distributed (i.i.d.) uniformly distributed values in [0, 1).

Returns a random value with independent and identically distributed (i.i.d.) uniformly distributed values in [0, 1).

Returns a string of the specified length whose characters are chosen uniformly at random from the following pool of characters: 0-9, a-z, A-Z.

Returns a string of the specified length whose characters are chosen uniformly at random from the following pool of characters: 0-9, a-z, A-Z, with the chosen random seed.

Window function: returns the rank of rows within a window partition.

Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state.

Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state.

Calls a method with reflection.

Calls a method with reflection.

Returns true if str matches regexp, or false otherwise.

Returns a count of the number of times that the regular expression pattern regexp is matched in the string str.

Extract a specific group matched by a Java regex, from the specified string column.

Extract all strings in the str that match the regexp expression and corresponding to the first regex group index.

Extract all strings in the str that match the regexp expression and corresponding to the regex group index.

Searches a string for a regular expression and returns an integer that indicates the beginning position of the matched substring.

Searches a string for a regular expression and returns an integer that indicates the beginning position of the matched substring.

Returns true if str matches regexp, or false otherwise.

Replace all substrings of the specified string value that match regexp with rep.

Replace all substrings of the specified string value that match regexp with rep.

Returns the substring that matches the regular expression regexp within the string str.

Aggregate function: returns the average of the independent variable for non-null pairs in a group, where y is the dependent variable and x is the independent variable.

Aggregate function: returns the average of the independent variable for non-null pairs in a group, where y is the dependent variable and x is the independent variable.

Aggregate function: returns the number of non-null number pairs in a group, where y is the dependent variable and x is the independent variable.

Aggregate function: returns the intercept of the univariate linear regression line for non-null pairs in a group, where y is the dependent variable and x is the independent variable.

Aggregate function: returns the coefficient of determination for non-null pairs in a group, where y is the dependent variable and x is the independent variable.

Aggregate function: returns the slope of the linear regression line for non-null pairs in a group, where y is the dependent variable and x is the independent variable.

Aggregate function: returns REGR_COUNT(y, x) * VAR_POP(x) for non-null pairs in a group, where y is the dependent variable and x is the independent variable.

Aggregate function: returns REGR_COUNT(y, x) * COVAR_POP(y, x) for non-null pairs in a group, where y is the dependent variable and x is the independent variable.

Aggregate function: returns REGR_COUNT(y, x) * VAR_POP(y) for non-null pairs in a group, where y is the dependent variable and x is the independent variable.

Repeats a string column n times, and returns it as a new string column.

Repeats a string column n times, and returns it as a new string column.

Replaces all occurrences of search with replace.

Replaces all occurrences of search with replace.

Returns a reversed string or an array with reverse order of elements.

Returns the rightmost len(len can be string type) characters from the string str, if len is less or equal than 0 the result is an empty string.

Returns the double value that is closest in value to the argument and is equal to a mathematical integer.

Returns the double value that is closest in value to the argument and is equal to a mathematical integer.

Returns true if str matches regexp, or false otherwise.

Returns the value of the column e rounded to 0 decimal places with HALF_UP round mode.

Round the value of e to scale decimal places with HALF_UP round mode if scale is greater than or equal to 0 or at integral part when scale is less than 0.

Round the value of e to scale decimal places with HALF_UP round mode if scale is greater than or equal to 0 or at integral part when scale is less than 0.

Window function: returns a sequential number starting at 1 within a window partition.

Right-pad the binary column with pad to a byte length of len.

Right-pad the string column with pad to a length of len.

Right-pad the string column with pad to a length of len.

Trim the spaces from right end for the specified string value.

Trim the specified character string from right end for the specified string column.

Trim the specified character string from right end for the specified string column.

Parses a CSV string and infers its schema in DDL format.

Parses a CSV string and infers its schema in DDL format.

Parses a CSV string and infers its schema in DDL format using options.

Parses a JSON string and infers its schema in DDL format.

Parses a JSON string and infers its schema in DDL format.

Parses a JSON string and infers its schema in DDL format using options.

Returns schema in the SQL format of a variant.

Returns the merged schema in the SQL format of a variant column.

Parses a XML string and infers its schema in DDL format.

Parses a XML string and infers its schema in DDL format.

Parses a XML string and infers its schema in DDL format using options.

Extracts the seconds as an integer from a given date/timestamp/string.

Splits a string into arrays of sentences, where each sentence is an array of words.

Splits a string into arrays of sentences, where each sentence is an array of words.

Splits a string into arrays of sentences, where each sentence is an array of words.

Generate a sequence of integers from start to stop, incrementing by 1 if start is less than or equal to stop, otherwise -1.

Generate a sequence of integers from start to stop, incrementing by step.

Returns the user name of current execution context.

Generates session window given a timestamp specifying column.

Generates session window given a timestamp specifying column.

Returns a sha1 hash value as a hex string of the col.

Calculates the SHA-1 digest of a binary column and returns the value as a 40 character hex string.

Calculates the SHA-2 family of hash functions of a binary column and returns the value as a hex string.

Shift the given value numBits left.

(Signed) shift the given value numBits right.

Unsigned shift the given value numBits right.

Returns a random permutation of the given array.

Returns a random permutation of the given array.

Computes the signum of the given value.

Computes the signum of the given column.

Computes the signum of the given value.

Returns length of array or map.

Aggregate function: returns the skewness of the values in a group.

Aggregate function: returns the skewness of the values in a group.

Returns an array containing all the elements in x from index start (or starting from the end if start is negative) with the specified length.

Returns an array containing all the elements in x from index start (or starting from the end if start is negative) with the specified length.

Aggregate function: returns true if at least one value of e is true.

Sorts the input array for the given column in ascending order, according to the natural ordering of the array elements.

Sorts the input array for the given column in ascending or descending order, according to the natural ordering of the array elements.

Returns the soundex code for the specified expression.

Splits str around matches of the given pattern.

Splits str around matches of the given pattern.

Splits str around matches of the given pattern.

Splits str around matches of the given pattern.

Splits str by delimiter and return requested part of the split (1-based).

Computes the square root of the specified float value.

Computes the square root of the specified float value.

Separates col1, ..., colk into n rows.

Separates col1, ..., colk into n rows.

Aggregate function: alias for stddev_samp.

Aggregate function: alias for stddev_samp.

Aggregate function: alias for stddev_samp.

Aggregate function: returns the population standard deviation of the expression in a group.

Aggregate function: returns the population standard deviation of the expression in a group.

Aggregate function: returns the sample standard deviation of the expression in a group.

Aggregate function: returns the sample standard deviation of the expression in a group.

Creates a map after splitting the text into key/value pairs using delimiters.

Creates a map after splitting the text into key/value pairs using delimiters.

Creates a map after splitting the text into key/value pairs using delimiters.

Aggregate function: returns the concatenation of non-null input values.

Aggregate function: returns the concatenation of non-null input values, separated by the delimiter.

Aggregate function: returns the concatenation of distinct non-null input values.

Aggregate function: returns the concatenation of distinct non-null input values, separated by the delimiter.

Creates a new struct column that composes multiple input columns.

Creates a new struct column that composes multiple input columns.

Creates a new struct column.

Creates a new struct column.

Returns the substring of str that starts at pos, or the slice of byte array that starts at pos.

Returns the substring of str that starts at pos and is of length len, or the slice of byte array that starts at pos and is of length len.

Substring starts at pos and is of length len when str is String type or returns the slice of byte array that starts at pos in byte and is of length len when str is Binary type

Substring starts at pos and is of length len when str is String type or returns the slice of byte array that starts at pos in byte and is of length len when str is Binary type

Returns the substring from string str before count occurrences of the delimiter delim.

Aggregate function: returns the sum of all values in the given column.

Aggregate function: returns the sum of all values in the expression.

Aggregate function: returns the sum of distinct values in the expression.

Adds the specified number of units to the given timestamp.

Gets the difference between the timestamps in the specified units by truncating the fraction part.

Creates timestamp from the number of microseconds since UTC epoch.

Creates timestamp from the number of milliseconds since UTC epoch.

Converts the number of seconds from the Unix epoch (1970-01-01T00:00:00Z) to a timestamp.

Converts the input e to a binary value based on the default format "hex".

Converts the input e to a binary value based on the supplied format.

Convert e to a string based on the format.

Converts a column containing a StructType into a CSV string with the specified schema.

(Java-specific) Converts a column containing a StructType into a CSV string with the specified schema.

Converts the column into DateType by casting rules to DateType.

Converts the column into a DateType with a specified format

Converts a column containing a StructType, ArrayType or a MapType into a JSON string with the specified schema.

(Java-specific) Converts a column containing a StructType, ArrayType or a MapType into a JSON string with the specified schema.

(Scala-specific) Converts a column containing a StructType, ArrayType or a MapType into a JSON string with the specified schema.

Convert string 'e' to a number based on the string format 'format'.

Converts to a timestamp by casting rules to TimestampType.

Converts time string with the given pattern to timestamp.

Parses the timestamp expression with the default format to a timestamp without time zone.

Parses the timestamp expression with the format expression to a timestamp without time zone.

Parses the timestamp expression with the default format to a timestamp without time zone.

Parses the timestamp_str expression with the format expression to a timestamp without time zone.

Returns the UNIX timestamp of the given time.

Returns the UNIX timestamp of the given time.

Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in the given time zone, and renders that time as a timestamp in UTC.

Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in the given time zone, and renders that time as a timestamp in UTC.

Convert e to a string based on the format.

Converts a column containing nested inputs (array/map/struct) into a variants where maps and structs are converted to variant objects which are unordered unlike SQL structs.

Converts a column containing a StructType into a XML string with the specified schema.

(Java-specific) Converts a column containing a StructType into a XML string with the specified schema.

Returns an array of elements after applying a transformation to each element in the input array.

Returns an array of elements after applying a transformation to each element in the input array.

Applies a function to every key-value pair in a map and returns a map with the results of those applications as the new keys for the pairs.

Applies a function to every key-value pair in a map and returns a map with the results of those applications as the new values for the pairs.

Translate any character in the src by a character in replaceString.

Trim the spaces from both ends for the specified string column.

Trim the specified character from both ends for the specified string column.

Trim the specified character from both ends for the specified string column.

Returns date truncated to the unit specified by the format.

Returns the sum of left and right and the result is null on overflow.

Returns a decrypted value of input.

Returns a decrypted value of input.

Returns a decrypted value of input.

This is a special version of aes_decrypt that performs the same operation, but returns a NULL value instead of raising an error if the decryption cannot be performed.

Returns the mean calculated from values of a group and the result is null on overflow.

Returns dividend/divisor.

(array, index) - Returns element of array at given (1-based) index.

This is a special version of make_interval that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.

This is a special version of make_interval that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.

This is a special version of make_interval that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.

This is a special version of make_interval that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.

This is a special version of make_interval that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.

This is a special version of make_interval that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.

This is a special version of make_interval that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.

Try to create a timestamp from years, months, days, hours, mins, and secs fields.

Try to create a timestamp from years, months, days, hours, mins, secs and timezone fields.

Try to create the current timestamp with local time zone from years, months, days, hours, mins and secs fields.

Try to create the current timestamp with local time zone from years, months, days, hours, mins, secs and timezone fields.

Try to create a local date-time from years, months, days, hours, mins, secs fields.

Returns the remainder of dividend/divisor.

Returns left*right and the result is null on overflow.

Parses a JSON string and constructs a Variant value.

Extracts a part from a URL.

Extracts a part from a URL.

This is a special version of reflect that performs the same operation, but returns a NULL value instead of raising an error if the invoke method thrown exception.

This is a special version of reflect that performs the same operation, but returns a NULL value instead of raising an error if the invoke method thrown exception.

Returns left-right and the result is null on overflow.

Returns the sum calculated from values of a group and the result is null on overflow.

This is a special version of to_binary that performs the same operation, but returns a NULL value instead of raising an error if the conversion cannot be performed.

This is a special version of to_binary that performs the same operation, but returns a NULL value instead of raising an error if the conversion cannot be performed.

Convert string e to a number based on the string format format.

Parses the s to a timestamp.

Parses the s with the format to a timestamp.

This is a special version of url_decode that performs the same operation, but returns a NULL value instead of raising an error if the decoding cannot be performed.

Returns the input value if it corresponds to a valid UTF-8 string, or NULL otherwise.

Extracts a sub-variant from v according to path string, and then cast the sub-variant to targetType.

Extracts a sub-variant from v according to path column, and then cast the sub-variant to targetType.
typedlit(T literal, scala.reflect.api.TypeTags.TypeTag<T> evidence$2)
Creates a
Column
of literal value.
typedLit(T literal, scala.reflect.api.TypeTags.TypeTag<T> evidence$1)
Creates a
Column
of literal value.

Return DDL-formatted type string for the data type of the input.

Returns str with all characters changed to uppercase.

Obtains a UserDefinedFunction that wraps the given Aggregator so that it may be used with untyped Data Frames.
udaf(Aggregator<IN,BUF,OUT> agg, scala.reflect.api.TypeTags.TypeTag<IN> evidence$3)
Obtains a UserDefinedFunction that wraps the given Aggregator so that it may be used with untyped Data Frames.

Defines a Java UDF0 instance as user-defined function (UDF).

Defines a Java UDF1 instance as user-defined function (UDF).
udf(UDF10<?,?,?,?,?,?,?,?,?,?,?> f, DataType returnType)
Defines a Java UDF10 instance as user-defined function (UDF).

Defines a Java UDF2 instance as user-defined function (UDF).

Defines a Java UDF3 instance as user-defined function (UDF).

Defines a Java UDF4 instance as user-defined function (UDF).

Defines a Java UDF5 instance as user-defined function (UDF).

Defines a Java UDF6 instance as user-defined function (UDF).

Defines a Java UDF7 instance as user-defined function (UDF).

Defines a Java UDF8 instance as user-defined function (UDF).
udf(UDF9<?,?,?,?,?,?,?,?,?,?> f, DataType returnType)
Defines a Java UDF9 instance as user-defined function (UDF).
udf(scala.Function0<RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$4)
Defines a Scala closure of 0 arguments as user-defined function (UDF).
udf(scala.Function1<A1,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$5, scala.reflect.api.TypeTags.TypeTag<A1> evidence$6)
Defines a Scala closure of 1 arguments as user-defined function (UDF).
static <RT, A1, A2, A3, A4, A5, A6, A7, A8, A9, A10> UserDefinedFunction udf(scala.Function10<A1,A2,A3,A4,A5,A6,A7,A8,A9,A10,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$59, scala.reflect.api.TypeTags.TypeTag<A1> evidence$60, scala.reflect.api.TypeTags.TypeTag<A2> evidence$61, scala.reflect.api.TypeTags.TypeTag<A3> evidence$62, scala.reflect.api.TypeTags.TypeTag<A4> evidence$63, scala.reflect.api.TypeTags.TypeTag<A5> evidence$64, scala.reflect.api.TypeTags.TypeTag<A6> evidence$65, scala.reflect.api.TypeTags.TypeTag<A7> evidence$66, scala.reflect.api.TypeTags.TypeTag<A8> evidence$67, scala.reflect.api.TypeTags.TypeTag<A9> evidence$68, scala.reflect.api.TypeTags.TypeTag<A10> evidence$69)
Defines a Scala closure of 10 arguments as user-defined function (UDF).
udf(scala.Function2<A1,A2,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$7, scala.reflect.api.TypeTags.TypeTag<A1> evidence$8, scala.reflect.api.TypeTags.TypeTag<A2> evidence$9)
Defines a Scala closure of 2 arguments as user-defined function (UDF).
udf(scala.Function3<A1,A2,A3,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$10, scala.reflect.api.TypeTags.TypeTag<A1> evidence$11, scala.reflect.api.TypeTags.TypeTag<A2> evidence$12, scala.reflect.api.TypeTags.TypeTag<A3> evidence$13)
Defines a Scala closure of 3 arguments as user-defined function (UDF).
udf(scala.Function4<A1,A2,A3,A4,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$14, scala.reflect.api.TypeTags.TypeTag<A1> evidence$15, scala.reflect.api.TypeTags.TypeTag<A2> evidence$16, scala.reflect.api.TypeTags.TypeTag<A3> evidence$17, scala.reflect.api.TypeTags.TypeTag<A4> evidence$18)
Defines a Scala closure of 4 arguments as user-defined function (UDF).
udf(scala.Function5<A1,A2,A3,A4,A5,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$19, scala.reflect.api.TypeTags.TypeTag<A1> evidence$20, scala.reflect.api.TypeTags.TypeTag<A2> evidence$21, scala.reflect.api.TypeTags.TypeTag<A3> evidence$22, scala.reflect.api.TypeTags.TypeTag<A4> evidence$23, scala.reflect.api.TypeTags.TypeTag<A5> evidence$24)
Defines a Scala closure of 5 arguments as user-defined function (UDF).
udf(scala.Function6<A1,A2,A3,A4,A5,A6,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$25, scala.reflect.api.TypeTags.TypeTag<A1> evidence$26, scala.reflect.api.TypeTags.TypeTag<A2> evidence$27, scala.reflect.api.TypeTags.TypeTag<A3> evidence$28, scala.reflect.api.TypeTags.TypeTag<A4> evidence$29, scala.reflect.api.TypeTags.TypeTag<A5> evidence$30, scala.reflect.api.TypeTags.TypeTag<A6> evidence$31)
Defines a Scala closure of 6 arguments as user-defined function (UDF).
udf(scala.Function7<A1,A2,A3,A4,A5,A6,A7,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$32, scala.reflect.api.TypeTags.TypeTag<A1> evidence$33, scala.reflect.api.TypeTags.TypeTag<A2> evidence$34, scala.reflect.api.TypeTags.TypeTag<A3> evidence$35, scala.reflect.api.TypeTags.TypeTag<A4> evidence$36, scala.reflect.api.TypeTags.TypeTag<A5> evidence$37, scala.reflect.api.TypeTags.TypeTag<A6> evidence$38, scala.reflect.api.TypeTags.TypeTag<A7> evidence$39)
Defines a Scala closure of 7 arguments as user-defined function (UDF).
udf(scala.Function8<A1,A2,A3,A4,A5,A6,A7,A8,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$40, scala.reflect.api.TypeTags.TypeTag<A1> evidence$41, scala.reflect.api.TypeTags.TypeTag<A2> evidence$42, scala.reflect.api.TypeTags.TypeTag<A3> evidence$43, scala.reflect.api.TypeTags.TypeTag<A4> evidence$44, scala.reflect.api.TypeTags.TypeTag<A5> evidence$45, scala.reflect.api.TypeTags.TypeTag<A6> evidence$46, scala.reflect.api.TypeTags.TypeTag<A7> evidence$47, scala.reflect.api.TypeTags.TypeTag<A8> evidence$48)
Defines a Scala closure of 8 arguments as user-defined function (UDF).
udf(scala.Function9<A1,A2,A3,A4,A5,A6,A7,A8,A9,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$49, scala.reflect.api.TypeTags.TypeTag<A1> evidence$50, scala.reflect.api.TypeTags.TypeTag<A2> evidence$51, scala.reflect.api.TypeTags.TypeTag<A3> evidence$52, scala.reflect.api.TypeTags.TypeTag<A4> evidence$53, scala.reflect.api.TypeTags.TypeTag<A5> evidence$54, scala.reflect.api.TypeTags.TypeTag<A6> evidence$55, scala.reflect.api.TypeTags.TypeTag<A7> evidence$56, scala.reflect.api.TypeTags.TypeTag<A8> evidence$57, scala.reflect.api.TypeTags.TypeTag<A9> evidence$58)
Defines a Scala closure of 9 arguments as user-defined function (UDF).

Decodes a BASE64 encoded string column and returns it as a binary column.

Returns a random value with independent and identically distributed (i.i.d.) values with the specified range of numbers.

Returns a random value with independent and identically distributed (i.i.d.) values with the specified range of numbers, with the chosen random seed.

Returns the number of days since 1970-01-01.

Returns the number of microseconds since 1970-01-01 00:00:00 UTC.

Returns the number of milliseconds since 1970-01-01 00:00:00 UTC.

Returns the number of seconds since 1970-01-01 00:00:00 UTC.

Returns the current Unix timestamp (in seconds) as a long.

Converts time string in format yyyy-MM-dd HH:mm:ss to Unix timestamp (in seconds), using the default timezone and the default locale.

Converts time string with given pattern to Unix timestamp (in seconds).

Unwrap UDT data type column into its underlying type.

Converts a string column to upper case.

Decodes a str in 'application/x-www-form-urlencoded' format using a specific encoding scheme.

Translates a string into 'application/x-www-form-urlencoded' format using a specific encoding scheme.

Returns the user name of current execution context.

Returns an universally unique identifier (UUID) string.

Returns the input value if it corresponds to a valid UTF-8 string, or emits a SparkIllegalArgumentException exception otherwise.

Aggregate function: returns the population variance of the values in a group.

Aggregate function: returns the population variance of the values in a group.

Aggregate function: returns the unbiased variance of the values in a group.

Aggregate function: returns the unbiased variance of the values in a group.

Aggregate function: alias for var_samp.

Aggregate function: alias for var_samp.

Extracts a sub-variant from v according to path string, and then cast the sub-variant to targetType.

Extracts a sub-variant from v according to path column, and then cast the sub-variant to targetType.

Returns the Spark version.

Returns the day of the week for date/timestamp (0 = Monday, 1 = Tuesday, ..., 6 = Sunday).

Extracts the week number as an integer from a given date/timestamp/string.

Evaluates a list of conditions and returns one of multiple possible result expressions.

Returns the bucket number into which the value of this expression would fall after being evaluated.

Generates tumbling time windows given a timestamp specifying column.

Bucketize rows into one or more time windows given a timestamp specifying column.

Bucketize rows into one or more time windows given a timestamp specifying column.

Extracts the event time from the window column.

Returns a string array of values within the nodes of xml that match the XPath expression.

Returns true if the XPath expression evaluates to true, or if a matching node is found.

Returns a double value, the value zero if no match is found, or NaN if a match is found but the value is non-numeric.

Returns a float value, the value zero if no match is found, or NaN if a match is found but the value is non-numeric.

Returns an integer value, or the value zero if no match is found, or a match is found but the value is non-numeric.

Returns a long integer value, or the value zero if no match is found, or a match is found but the value is non-numeric.

Returns a double value, the value zero if no match is found, or NaN if a match is found but the value is non-numeric.

Returns a short integer value, or the value zero if no match is found, or a match is found but the value is non-numeric.

Returns the text contents of the first xml node that matches the XPath expression.

Calculates the hash code of given columns using the 64-bit variant of the xxHash algorithm, and returns the result as a long column.

Calculates the hash code of given columns using the 64-bit variant of the xxHash algorithm, and returns the result as a long column.

Extracts the year as an integer from a given date/timestamp/string.

(Java-specific) A transform for timestamps and dates to partition data into years.

Returns zero if col is null, or col otherwise.

Merge two given arrays, element-wise, into a single array using a function.

Constructor Details
- functions
  public functions()
Method Details
- countDistinct
  Aggregate function: returns the number of distinct items in a group.
  
  An alias of count_distinct, and it is encouraged to use count_distinct directly.
  
  Parameters:
  
  expr - (undocumented)
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- countDistinct
  Aggregate function: returns the number of distinct items in a group.
  
  An alias of count_distinct, and it is encouraged to use count_distinct directly.
  
  Parameters:
  
  columnName - (undocumented)
  
  columnNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- count_distinct
  
  Parameters:
  
  expr - (undocumented)
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- grouping_id
  Aggregate function: returns the level of grouping, equals to
```
   (grouping(c1) <<; (n-1)) + (grouping(c2) <<; (n-2)) + ... + grouping(cn)
 
```
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
  
  Note:
  
  The list of columns should match with grouping columns exactly, or empty (means all the grouping columns).
- grouping_id
  Aggregate function: returns the level of grouping, equals to
```
   (grouping(c1) <<; (n-1)) + (grouping(c2) <<; (n-2)) + ... + grouping(cn)
 
```
  Parameters:
  
  colName - (undocumented)
  
  colNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
  
  Note:
  
  The list of columns should match with grouping columns exactly.
- array
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- array
  
  Parameters:
  
  colName - (undocumented)
  
  colNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- map
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0
- named_struct
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- coalesce
  Returns the first column that is not null, or null if all inputs are null.
  
  For example, coalesce(a, b, c) will return a if a is not null, or b if a is null and b is not null, or c if both a and b are null but c is not null.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- struct
  Creates a new struct column. If the input column is a column in a
  DataFrame
  , or a derived column expression that is named (i.e. aliased), its name would be retained as the StructField's name, otherwise, the newly generated StructField's name would be auto generated as
  col
  with a suffix
  index + 1
  , i.e. col1, col2, col3, ...
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- struct
  
  Parameters:
  
  colName - (undocumented)
  
  colNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- greatest
  
  Parameters:
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- greatest
  
  Parameters:
  
  columnName - (undocumented)
  
  columnNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- least
  
  Parameters:
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- least
  
  Parameters:
  
  columnName - (undocumented)
  
  columnNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- hash
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- xxhash64
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- reflect
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- java_method
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_reflect
  This is a special version of
  reflect
  that performs the same operation, but returns a NULL value instead of raising an error if the invoke method thrown exception.
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- stack
  Separates
  col1
  , ...,
  colk
  into
  n
  rows. Uses column names col0, col1, etc. by default unless specified otherwise.
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- concat_ws
  
  Parameters:
  
  sep - (undocumented)
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  Input strings which are null are skipped.
- format_string
  
  Parameters:
  
  format - (undocumented)
  
  arguments - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- printf
  
  Parameters:
  
  format - (undocumented)
  
  arguments - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- elt
  Returns the
  n
  -th input, e.g., returns
  input2
  when
  n
  is 2. The function returns NULL if the index exceeds the length of the array and
  spark.sql.ansi.enabled
  is set to false. If
  spark.sql.ansi.enabled
  is set to true, it throws ArrayIndexOutOfBoundsException for invalid indices.
  
  Parameters:
  
  inputs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- concat
  
  Parameters:
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  Returns null if any of the input columns are null.
- json_tuple
  
  Parameters:
  
  json - (undocumented)
  
  fields - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- arrays_zip
  Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- map_concat
  Returns the union of all the given maps.
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- callUDF
  
  Parameters:
  
  udfName - (undocumented)
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- call_udf
  Call an user-defined function. Example:
```
  import org.apache.spark.sql._

  val df = Seq(("id1", 1), ("id2", 4), ("id3", 5)).toDF("id", "value")
  val spark = df.sparkSession
  spark.udf.register("simpleUDF", (v: Int) => v * v)
  df.select($"id", call_udf("simpleUDF", $"value"))
 
```
  Parameters:
  
  udfName - (undocumented)
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- call_function
  
  Parameters:
  
  funcName - function name that follows the SQL identifier syntax (can be quoted, can be qualified)
  
  cols - the expression parameters of function
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- col
  Returns a
  Column
  based on the given column name.
  
  Parameters:
  
  colName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- column
  
  Parameters:
  
  colName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- lit
  Creates a
  Column
  of literal value.
  
  The passed in object is returned directly if it is already a Column. If the object is a Scala Symbol, it is converted into a Column also. Otherwise, a new Column is created to represent the literal value.
  
  Parameters:
  
  literal - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- typedLit public static <T> Column typedLit(T literal, scala.reflect.api.TypeTags.TypeTag<T> evidence$1)
  Creates a
  Column
  of literal value.
  
  An alias of typedlit, and it is encouraged to use typedlit directly.
  
  Parameters:
  
  literal - (undocumented)
  
  evidence$1 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.2.0
- typedlit public static <T> Column typedlit(T literal, scala.reflect.api.TypeTags.TypeTag<T> evidence$2)
  Creates a
  Column
  of literal value.
  
  The passed in object is returned directly if it is already a Column. If the object is a Scala Symbol, it is converted into a Column also. Otherwise, a new Column is created to represent the literal value. The difference between this function and lit(java.lang.Object) is that this function can handle parameterized scala types e.g.: List, Seq and Map.
  
  Parameters:
  
  literal - (undocumented)
  
  evidence$2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
  
  Note:
  
  typedlit will call expensive Scala reflection APIs. lit is preferred if parameterized Scala types are not used.
- asc
  Returns a sort expression based on ascending order of the column.
```
   df.sort(asc("dept"), desc("age"))
 
```
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- asc_nulls_first
  Returns a sort expression based on ascending order of the column, and null values return before non-null values.
```
   df.sort(asc_nulls_first("dept"), desc("age"))
 
```
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- asc_nulls_last
  Returns a sort expression based on ascending order of the column, and null values appear after non-null values.
```
   df.sort(asc_nulls_last("dept"), desc("age"))
 
```
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- desc
  Returns a sort expression based on the descending order of the column.
```
   df.sort(asc("dept"), desc("age"))
 
```
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- desc_nulls_first
  Returns a sort expression based on the descending order of the column, and null values appear before non-null values.
```
   df.sort(asc("dept"), desc_nulls_first("age"))
 
```
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- desc_nulls_last
  Returns a sort expression based on the descending order of the column, and null values appear after non-null values.
```
   df.sort(asc("dept"), desc_nulls_last("age"))
 
```
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- approxCountDistinct
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- approxCountDistinct
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- approxCountDistinct public static Column approxCountDistinct(Column e, double rsd)
  
  Parameters:
  
  e - (undocumented)
  
  rsd - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- approxCountDistinct public static Column approxCountDistinct(String columnName, double rsd)
  
  Parameters:
  
  columnName - (undocumented)
  
  rsd - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- approx_count_distinct
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- approx_count_distinct public static Column approx_count_distinct(String columnName)
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- approx_count_distinct public static Column approx_count_distinct(Column e, double rsd)
  
  Parameters:
  
  rsd - maximum relative standard deviation allowed (default = 0.05)
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- approx_count_distinct public static Column approx_count_distinct(String columnName, double rsd)
  
  Parameters:
  
  rsd - maximum relative standard deviation allowed (default = 0.05)
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- avg
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- avg
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- collect_list
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
  
  Note:
  
  The function is non-deterministic because the order of collected results depends on the order of the rows which may be non-deterministic after a shuffle.
- collect_list
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
  
  Note:
  
  The function is non-deterministic because the order of collected results depends on the order of the rows which may be non-deterministic after a shuffle.
- collect_set
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
  
  Note:
  
  The function is non-deterministic because the order of collected results depends on the order of the rows which may be non-deterministic after a shuffle.
- collect_set
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
  
  Note:
  
  The function is non-deterministic because the order of collected results depends on the order of the rows which may be non-deterministic after a shuffle.
- count_min_sketch
  Returns a count-min sketch of a column with the given esp, confidence and seed. The result is an array of bytes, which can be deserialized to a
  CountMinSketch
  before usage. Count-min sketch is a probabilistic data structure used for cardinality estimation using sub-linear space.
  
  Parameters:
  
  e - (undocumented)
  
  eps - (undocumented)
  
  confidence - (undocumented)
  
  seed - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- count_min_sketch
  Returns a count-min sketch of a column with the given esp, confidence and seed. The result is an array of bytes, which can be deserialized to a
  CountMinSketch
  before usage. Count-min sketch is a probabilistic data structure used for cardinality estimation using sub-linear space.
  
  Parameters:
  
  e - (undocumented)
  
  eps - (undocumented)
  
  confidence - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- corr
  
  Parameters:
  
  column1 - (undocumented)
  
  column2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- corr
  
  Parameters:
  
  columnName1 - (undocumented)
  
  columnName2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- count
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- count
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- countDistinct public static Column countDistinct(Column expr, scala.collection.immutable.Seq<Column> exprs)
  Aggregate function: returns the number of distinct items in a group.
  
  An alias of count_distinct, and it is encouraged to use count_distinct directly.
  
  Parameters:
  
  expr - (undocumented)
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- countDistinct public static Column countDistinct(String columnName, scala.collection.immutable.Seq<String> columnNames)
  Aggregate function: returns the number of distinct items in a group.
  
  An alias of count_distinct, and it is encouraged to use count_distinct directly.
  
  Parameters:
  
  columnName - (undocumented)
  
  columnNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- count_distinct public static Column count_distinct(Column expr, scala.collection.immutable.Seq<Column> exprs)
  
  Parameters:
  
  expr - (undocumented)
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- covar_pop
  
  Parameters:
  
  column1 - (undocumented)
  
  column2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- covar_pop
  
  Parameters:
  
  columnName1 - (undocumented)
  
  columnName2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- covar_samp
  
  Parameters:
  
  column1 - (undocumented)
  
  column2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- covar_samp
  
  Parameters:
  
  columnName1 - (undocumented)
  
  columnName2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- first
  Aggregate function: returns the first value in a group.
  
  The function by default returns the first values it sees. It will return the first non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  e - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- first public static Column first(String columnName, boolean ignoreNulls)
  Aggregate function: returns the first value of a column in a group.
  
  The function by default returns the first values it sees. It will return the first non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  columnName - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- first
  Aggregate function: returns the first value in a group.
  
  The function by default returns the first values it sees. It will return the first non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- first
  Aggregate function: returns the first value of a column in a group.
  
  The function by default returns the first values it sees. It will return the first non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- first_value
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- first_value
  Aggregate function: returns the first value in a group.
  
  The function by default returns the first values it sees. It will return the first non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  e - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- grouping
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- grouping
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- grouping_id public static Column grouping_id(scala.collection.immutable.Seq<Column> cols)
  Aggregate function: returns the level of grouping, equals to
```
   (grouping(c1) <<; (n-1)) + (grouping(c2) <<; (n-2)) + ... + grouping(cn)
 
```
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
  
  Note:
  
  The list of columns should match with grouping columns exactly, or empty (means all the grouping columns).
- grouping_id public static Column grouping_id(String colName, scala.collection.immutable.Seq<String> colNames)
  Aggregate function: returns the level of grouping, equals to
```
   (grouping(c1) <<; (n-1)) + (grouping(c2) <<; (n-2)) + ... + grouping(cn)
 
```
  Parameters:
  
  colName - (undocumented)
  
  colNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
  
  Note:
  
  The list of columns should match with grouping columns exactly.
- hll_sketch_agg
  
  Parameters:
  
  e - (undocumented)
  
  lgConfigK - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_sketch_agg public static Column hll_sketch_agg(Column e, int lgConfigK)
  
  Parameters:
  
  e - (undocumented)
  
  lgConfigK - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_sketch_agg public static Column hll_sketch_agg(String columnName, int lgConfigK)
  
  Parameters:
  
  columnName - (undocumented)
  
  lgConfigK - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_sketch_agg
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_sketch_agg
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union_agg
  
  Parameters:
  
  e - (undocumented)
  
  allowDifferentLgConfigK - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union_agg public static Column hll_union_agg(Column e, boolean allowDifferentLgConfigK)
  
  Parameters:
  
  e - (undocumented)
  
  allowDifferentLgConfigK - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union_agg public static Column hll_union_agg(String columnName, boolean allowDifferentLgConfigK)
  
  Parameters:
  
  columnName - (undocumented)
  
  allowDifferentLgConfigK - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union_agg
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union_agg
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- kurtosis
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- kurtosis
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- last
  Aggregate function: returns the last value in a group.
  
  The function by default returns the last values it sees. It will return the last non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  e - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- last public static Column last(String columnName, boolean ignoreNulls)
  Aggregate function: returns the last value of the column in a group.
  
  The function by default returns the last values it sees. It will return the last non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  columnName - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- last
  Aggregate function: returns the last value in a group.
  
  The function by default returns the last values it sees. It will return the last non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- last
  Aggregate function: returns the last value of the column in a group.
  
  The function by default returns the last values it sees. It will return the last non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- last_value
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- last_value
  Aggregate function: returns the last value in a group.
  
  The function by default returns the last values it sees. It will return the last non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  Parameters:
  
  e - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  Note:
  
  The function is non-deterministic because its results depends on the order of the rows which may be non-deterministic after a shuffle.
- mode
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- mode
  Aggregate function: returns the most frequent value in a group.
  
  When multiple values have the same greatest frequency then either any of values is returned if deterministic is false or is not defined, or the lowest value is returned if deterministic is true.
  
  Parameters:
  
  e - (undocumented)
  
  deterministic - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- max
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- max
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- max_by
  
  Parameters:
  
  e - (undocumented)
  
  ord - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
  
  Note:
  
  The function is non-deterministic so the output order can be different for those associated the same values of e.
- mean
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- mean
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- median
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- min
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- min
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- min_by
  
  Parameters:
  
  e - (undocumented)
  
  ord - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
  
  Note:
  
  The function is non-deterministic so the output order can be different for those associated the same values of e.
- percentile
  Aggregate function: returns the exact percentile(s) of numeric column
  expr
  at the given percentage(s) with value range in [0.0, 1.0].
  
  Parameters:
  
  e - (undocumented)
  
  percentage - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- percentile
  Aggregate function: returns the exact percentile(s) of numeric column
  expr
  at the given percentage(s) with value range in [0.0, 1.0].
  
  Parameters:
  
  e - (undocumented)
  
  percentage - (undocumented)
  
  frequency - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- percentile_approx
  Aggregate function: returns the approximate
  percentile
  of the numeric column
  col
  which is the smallest value in the ordered
  col
  values (sorted from least to greatest) such that no more than
  percentage
  of
  col
  values is less than the value or equal to that value.
  
  If percentage is an array, each value must be between 0.0 and 1.0. If it is a single floating point value, it must be between 0.0 and 1.0.
  
  The accuracy parameter is a positive numeric literal which controls approximation accuracy at the cost of memory. Higher value of accuracy yields better accuracy, 1.0/accuracy is the relative error of the approximation.
  
  Parameters:
  
  e - (undocumented)
  
  percentage - (undocumented)
  
  accuracy - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.1.0
- approx_percentile
  Aggregate function: returns the approximate
  percentile
  of the numeric column
  col
  which is the smallest value in the ordered
  col
  values (sorted from least to greatest) such that no more than
  percentage
  of
  col
  values is less than the value or equal to that value.
  
  If percentage is an array, each value must be between 0.0 and 1.0. If it is a single floating point value, it must be between 0.0 and 1.0.
  
  The accuracy parameter is a positive numeric literal which controls approximation accuracy at the cost of memory. Higher value of accuracy yields better accuracy, 1.0/accuracy is the relative error of the approximation.
  
  Parameters:
  
  e - (undocumented)
  
  percentage - (undocumented)
  
  accuracy - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- product
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- skewness
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- skewness
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- std
  Aggregate function: alias for
  stddev_samp
  .
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- stddev
  Aggregate function: alias for
  stddev_samp
  .
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- stddev
  Aggregate function: alias for
  stddev_samp
  .
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- stddev_samp
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- stddev_samp
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- stddev_pop
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- stddev_pop
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- sum
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- sum
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- sumDistinct
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- sumDistinct
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- sum_distinct
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- listagg
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- listagg
  
  Parameters:
  
  e - (undocumented)
  
  delimiter - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- listagg_distinct
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- listagg_distinct
  
  Parameters:
  
  e - (undocumented)
  
  delimiter - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- string_agg
  Aggregate function: returns the concatenation of non-null input values. Alias for
  listagg
  .
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- string_agg
  Aggregate function: returns the concatenation of non-null input values, separated by the delimiter. Alias for
  listagg
  .
  
  Parameters:
  
  e - (undocumented)
  
  delimiter - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- string_agg_distinct
  Aggregate function: returns the concatenation of distinct non-null input values. Alias for
  listagg
  .
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- string_agg_distinct
  Aggregate function: returns the concatenation of distinct non-null input values, separated by the delimiter. Alias for
  listagg
  .
  
  Parameters:
  
  e - (undocumented)
  
  delimiter - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- variance
  Aggregate function: alias for
  var_samp
  .
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- variance
  Aggregate function: alias for
  var_samp
  .
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- var_samp
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- var_samp
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- var_pop
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- var_pop
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- regr_avgx
  Aggregate function: returns the average of the independent variable for non-null pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regr_avgy
  Aggregate function: returns the average of the independent variable for non-null pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regr_count
  Aggregate function: returns the number of non-null number pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regr_intercept
  Aggregate function: returns the intercept of the univariate linear regression line for non-null pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regr_r2
  Aggregate function: returns the coefficient of determination for non-null pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regr_slope
  Aggregate function: returns the slope of the linear regression line for non-null pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regr_sxx
  Aggregate function: returns REGR_COUNT(y, x) * VAR_POP(x) for non-null pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regr_sxy
  Aggregate function: returns REGR_COUNT(y, x) * COVAR_POP(y, x) for non-null pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regr_syy
  Aggregate function: returns REGR_COUNT(y, x) * VAR_POP(y) for non-null pairs in a group, where
  y
  is the dependent variable and
  x
  is the independent variable.
  
  Parameters:
  
  y - (undocumented)
  
  x - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- any_value
  Aggregate function: returns some value of
  e
  for a group of rows.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- any_value
  Aggregate function: returns some value of
  e
  for a group of rows. If
  isIgnoreNull
  is true, returns only non-null values.
  
  Parameters:
  
  e - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- count_if
  Aggregate function: returns the number of
  TRUE
  values for the expression.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- histogram_numeric
  
  Parameters:
  
  e - (undocumented)
  
  nBins - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- every
  Aggregate function: returns true if all values of
  e
  are true.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bool_and
  Aggregate function: returns true if all values of
  e
  are true.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- some
  Aggregate function: returns true if at least one value of
  e
  is true.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- any
  Aggregate function: returns true if at least one value of
  e
  is true.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bool_or
  Aggregate function: returns true if at least one value of
  e
  is true.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bit_and
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bit_or
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bit_xor
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- cume_dist public static Column cume_dist
  ()
  
  Window function: returns the cumulative distribution of values within a window partition, i.e. the fraction of rows that are below the current row.
```
   N = total number of rows in the partition
   cumeDist(x) = number of values before (and including) x / N
 
```
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- dense_rank public static Column dense_rank
  ()
  
  Window function: returns the rank of rows within a window partition, without any gaps.
  
  The difference between rank and dense_rank is that denseRank leaves no gaps in ranking sequence when there are ties. That is, if you were ranking a competition using dense_rank and had three people tie for second place, you would say that all three were in second place and that the next person came in third. Rank would give me sequential numbers, making the person that came in third place (after the ties) would register as coming in fifth.
  
  This is equivalent to the DENSE_RANK function in SQL.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- lag
  Window function: returns the value that is
  offset
  rows before the current row, and
  null
  if there is less than
  offset
  rows before the current row. For example, an
  offset
  of one will return the previous row at any given point in the window partition.
  
  This is equivalent to the LAG function in SQL.
  
  Parameters:
  
  e - (undocumented)
  
  offset - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- lag
  Window function: returns the value that is
  offset
  rows before the current row, and
  null
  if there is less than
  offset
  rows before the current row. For example, an
  offset
  of one will return the previous row at any given point in the window partition.
  
  This is equivalent to the LAG function in SQL.
  
  Parameters:
  
  columnName - (undocumented)
  
  offset - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- lag
  Window function: returns the value that is
  offset
  rows before the current row, and
  defaultValue
  if there is less than
  offset
  rows before the current row. For example, an
  offset
  of one will return the previous row at any given point in the window partition.
  
  This is equivalent to the LAG function in SQL.
  
  Parameters:
  
  columnName - (undocumented)
  
  offset - (undocumented)
  
  defaultValue - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- lag
  Window function: returns the value that is
  offset
  rows before the current row, and
  defaultValue
  if there is less than
  offset
  rows before the current row. For example, an
  offset
  of one will return the previous row at any given point in the window partition.
  
  This is equivalent to the LAG function in SQL.
  
  Parameters:
  
  e - (undocumented)
  
  offset - (undocumented)
  
  defaultValue - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- lag
  Window function: returns the value that is
  offset
  rows before the current row, and
  defaultValue
  if there is less than
  offset
  rows before the current row.
  ignoreNulls
  determines whether null values of row are included in or eliminated from the calculation. For example, an
  offset
  of one will return the previous row at any given point in the window partition.
  
  This is equivalent to the LAG function in SQL.
  
  Parameters:
  
  e - (undocumented)
  
  offset - (undocumented)
  
  defaultValue - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- lead
  Window function: returns the value that is
  offset
  rows after the current row, and
  null
  if there is less than
  offset
  rows after the current row. For example, an
  offset
  of one will return the next row at any given point in the window partition.
  
  This is equivalent to the LEAD function in SQL.
  
  Parameters:
  
  columnName - (undocumented)
  
  offset - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- lead
  Window function: returns the value that is
  offset
  rows after the current row, and
  null
  if there is less than
  offset
  rows after the current row. For example, an
  offset
  of one will return the next row at any given point in the window partition.
  
  This is equivalent to the LEAD function in SQL.
  
  Parameters:
  
  e - (undocumented)
  
  offset - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- lead
  Window function: returns the value that is
  offset
  rows after the current row, and
  defaultValue
  if there is less than
  offset
  rows after the current row. For example, an
  offset
  of one will return the next row at any given point in the window partition.
  
  This is equivalent to the LEAD function in SQL.
  
  Parameters:
  
  columnName - (undocumented)
  
  offset - (undocumented)
  
  defaultValue - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- lead
  Window function: returns the value that is
  offset
  rows after the current row, and
  defaultValue
  if there is less than
  offset
  rows after the current row. For example, an
  offset
  of one will return the next row at any given point in the window partition.
  
  This is equivalent to the LEAD function in SQL.
  
  Parameters:
  
  e - (undocumented)
  
  offset - (undocumented)
  
  defaultValue - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- lead
  Window function: returns the value that is
  offset
  rows after the current row, and
  defaultValue
  if there is less than
  offset
  rows after the current row.
  ignoreNulls
  determines whether null values of row are included in or eliminated from the calculation. The default value of
  ignoreNulls
  is false. For example, an
  offset
  of one will return the next row at any given point in the window partition.
  
  This is equivalent to the LEAD function in SQL.
  
  Parameters:
  
  e - (undocumented)
  
  offset - (undocumented)
  
  defaultValue - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- nth_value public static Column nth_value(Column e, int offset, boolean ignoreNulls)
  Window function: returns the value that is the
  offset
  th row of the window frame (counting from 1), and
  null
  if the size of window frame is less than
  offset
  rows.
  
  It will return the offsetth non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
  
  This is equivalent to the nth_value function in SQL.
  
  Parameters:
  
  e - (undocumented)
  
  offset - (undocumented)
  
  ignoreNulls - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.1.0
- nth_value
  Window function: returns the value that is the
  offset
  th row of the window frame (counting from 1), and
  null
  if the size of window frame is less than
  offset
  rows.
  
  This is equivalent to the nth_value function in SQL.
  
  Parameters:
  
  e - (undocumented)
  
  offset - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.1.0
- ntile public static Column ntile(int n)
  Window function: returns the ntile group id (from 1 to
  n
  inclusive) in an ordered window partition. For example, if
  n
  is 4, the first quarter of the rows will get value 1, the second quarter will get 2, the third quarter will get 3, and the last quarter will get 4.
  
  This is equivalent to the NTILE function in SQL.
  
  Parameters:
  
  n - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- percent_rank public static Column percent_rank
  ()
  
  Window function: returns the relative rank (i.e. percentile) of rows within a window partition.
  
  This is computed by:
```
   (rank of row in its partition - 1) / (number of rows in the partition - 1)
 
```
  This is equivalent to the PERCENT_RANK function in SQL.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- rank
  Window function: returns the rank of rows within a window partition.
  
  The difference between rank and dense_rank is that dense_rank leaves no gaps in ranking sequence when there are ties. That is, if you were ranking a competition using dense_rank and had three people tie for second place, you would say that all three were in second place and that the next person came in third. Rank would give me sequential numbers, making the person that came in third place (after the ties) would register as coming in fifth.
  
  This is equivalent to the RANK function in SQL.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- row_number public static Column row_number
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- array public static Column array(scala.collection.immutable.Seq<Column> cols)
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- array public static Column array(String colName, scala.collection.immutable.Seq<String> colNames)
  
  Parameters:
  
  colName - (undocumented)
  
  colNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- map public static Column map(scala.collection.immutable.Seq<Column> cols)
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0
- named_struct public static Column named_struct(scala.collection.immutable.Seq<Column> cols)
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- map_from_arrays
  
  Parameters:
  
  keys - (undocumented)
  
  values - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4
- str_to_map
  Creates a map after splitting the text into key/value pairs using delimiters. Both
  pairDelim
  and
  keyValueDelim
  are treated as regular expressions.
  
  Parameters:
  
  text - (undocumented)
  
  pairDelim - (undocumented)
  
  keyValueDelim - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- str_to_map
  Creates a map after splitting the text into key/value pairs using delimiters. The
  pairDelim
  is treated as regular expressions.
  
  Parameters:
  
  text - (undocumented)
  
  pairDelim - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- str_to_map
  
  Parameters:
  
  text - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- broadcast
  Marks a DataFrame as small enough for use in broadcast joins.
  
  The following example marks the right DataFrame for broadcast hash join using joinKey.
```
   // left and right are DataFrames
   left.join(broadcast(right), "joinKey")
 
```
  Parameters:
  
  df - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- coalesce public static Column coalesce(scala.collection.immutable.Seq<Column> e)
  Returns the first column that is not null, or null if all inputs are null.
  
  For example, coalesce(a, b, c) will return a if a is not null, or b if a is null and b is not null, or c if both a and b are null but c is not null.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- input_file_name public static Column input_file_name
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- isnan
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- isnull
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- monotonicallyIncreasingId public static Column monotonicallyIncreasingId
  ()
  
  A column expression that generates monotonically increasing 64-bit integers.
  
  The generated ID is guaranteed to be monotonically increasing and unique, but not consecutive. The current implementation puts the partition ID in the upper 31 bits, and the record number within each partition in the lower 33 bits. The assumption is that the data frame has less than 1 billion partitions, and each partition has less than 8 billion records.
  
  As an example, consider a DataFrame with two partitions, each with 3 records. This expression would return the following IDs:
```
 0, 1, 2, 8589934592 (1L << 33), 8589934593, 8589934594.
 
```
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- monotonically_increasing_id public static Column monotonically_increasing_id
  ()
  
  A column expression that generates monotonically increasing 64-bit integers.
  
  The generated ID is guaranteed to be monotonically increasing and unique, but not consecutive. The current implementation puts the partition ID in the upper 31 bits, and the record number within each partition in the lower 33 bits. The assumption is that the data frame has less than 1 billion partitions, and each partition has less than 8 billion records.
  
  As an example, consider a DataFrame with two partitions, each with 3 records. This expression would return the following IDs:
```
 0, 1, 2, 8589934592 (1L << 33), 8589934593, 8589934594.
 
```
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- nanvl
  Returns col1 if it is not NaN, or col2 if col1 is NaN.
  
  Both inputs should be floating point columns (DoubleType or FloatType).
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- negate
  Unary minus, i.e. negate the expression.
```
   // Select the amount column and negates all values.
   // Scala:
   df.select( -df("amount") )

   // Java:
   df.select( negate(df.col("amount")) );
 
```
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- not
  Inversion of boolean expression, i.e. NOT.
```
   // Scala: select rows that are not active (isActive === false)
   df.filter( !df("isActive") )

   // Java:
   df.filter( not(df.col("isActive")) );
 
```
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- rand public static Column rand(long seed)
  
  Parameters:
  
  seed - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
  
  Note:
  
  The function is non-deterministic in general case.
- rand
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
  
  Note:
  
  The function is non-deterministic in general case.
- randn public static Column randn(long seed)
  
  Parameters:
  
  seed - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
  
  Note:
  
  The function is non-deterministic in general case.
- randn
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
  
  Note:
  
  The function is non-deterministic in general case.
- randstr
  
  Parameters:
  
  length - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- randstr
  
  Parameters:
  
  length - (undocumented)
  
  seed - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- spark_partition_id public static Column spark_partition_id
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
  
  Note:
  
  This is non-deterministic because it depends on data partitioning and task scheduling.
- sqrt
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- sqrt
  
  Parameters:
  
  colName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- try_add
  Returns the sum of
  left
  and
  right
  and the result is null on overflow. The acceptable input types are the same with the
  +
  operator.
  
  Parameters:
  
  left - (undocumented)
  
  right - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_avg
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_divide
  Returns
  dividend/divisor
  . It always performs floating point division. Its result is always null if
  divisor
  is 0.
  
  Parameters:
  
  left - (undocumented)
  
  right - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_mod
  Returns the remainder of
  dividend/divisor
  . Its result is always null if
  divisor
  is 0.
  
  Parameters:
  
  left - (undocumented)
  
  right - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- try_multiply
  Returns
  left*right
  and the result is null on overflow. The acceptable input types are the same with the
  *
  operator.
  
  Parameters:
  
  left - (undocumented)
  
  right - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_subtract
  Returns
  left-right
  and the result is null on overflow. The acceptable input types are the same with the
  -
  operator.
  
  Parameters:
  
  left - (undocumented)
  
  right - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_sum
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- struct public static Column struct(scala.collection.immutable.Seq<Column> cols)
  Creates a new struct column. If the input column is a column in a
  DataFrame
  , or a derived column expression that is named (i.e. aliased), its name would be retained as the StructField's name, otherwise, the newly generated StructField's name would be auto generated as
  col
  with a suffix
  index + 1
  , i.e. col1, col2, col3, ...
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- struct public static Column struct(String colName, scala.collection.immutable.Seq<String> colNames)
  
  Parameters:
  
  colName - (undocumented)
  
  colNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- when
  Evaluates a list of conditions and returns one of multiple possible result expressions. If otherwise is not defined at the end, null is returned for unmatched conditions.
```
   // Example: encoding gender string column into integer.

   // Scala:
   people.select(when(people("gender") === "male", 0)
     .when(people("gender") === "female", 1)
     .otherwise(2))

   // Java:
   people.select(when(col("gender").equalTo("male"), 0)
     .when(col("gender").equalTo("female"), 1)
     .otherwise(2))
 
```
  Parameters:
  
  condition - (undocumented)
  
  value - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- bitwiseNOT
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- bitwise_not
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- bit_count
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bit_get
  
  Parameters:
  
  e - (undocumented)
  
  pos - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- getbit
  
  Parameters:
  
  e - (undocumented)
  
  pos - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- expr
  Parses the expression string into the column that it represents, similar to
  Dataset.selectExpr(java.lang.String...)
  .
```
   // get the number of words of each length
   df.groupBy(expr("length(word)")).count()
 
```
  Parameters:
  
  expr - (undocumented)
  
  Returns:
  
  (undocumented)
- abs
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- acos
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  inverse cosine of e in radians, as if computed by java.lang.Math.acos
  
  Since:
  
  1.4.0
- acos
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  inverse cosine of columnName, as if computed by java.lang.Math.acos
  
  Since:
  
  1.4.0
- acosh
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  inverse hyperbolic cosine of e
  
  Since:
  
  3.1.0
- acosh
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  inverse hyperbolic cosine of columnName
  
  Since:
  
  3.1.0
- asin
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  inverse sine of e in radians, as if computed by java.lang.Math.asin
  
  Since:
  
  1.4.0
- asin
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  inverse sine of columnName, as if computed by java.lang.Math.asin
  
  Since:
  
  1.4.0
- asinh
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  inverse hyperbolic sine of e
  
  Since:
  
  3.1.0
- asinh
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  inverse hyperbolic sine of columnName
  
  Since:
  
  3.1.0
- atan
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  inverse tangent of e as if computed by java.lang.Math.atan
  
  Since:
  
  1.4.0
- atan
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  inverse tangent of columnName, as if computed by java.lang.Math.atan
  
  Since:
  
  1.4.0
- atan2
  
  Parameters:
  
  y - coordinate on y-axis
  
  x - coordinate on x-axis
  
  Returns:
  
  the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates, as if computed by java.lang.Math.atan2
  
  Since:
  
  1.4.0
- atan2
  
  Parameters:
  
  y - coordinate on y-axis
  
  xName - coordinate on x-axis
  
  Returns:
  
  the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates, as if computed by java.lang.Math.atan2
  
  Since:
  
  1.4.0
- atan2
  
  Parameters:
  
  yName - coordinate on y-axis
  
  x - coordinate on x-axis
  
  Returns:
  
  the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates, as if computed by java.lang.Math.atan2
  
  Since:
  
  1.4.0
- atan2
  
  Parameters:
  
  yName - coordinate on y-axis
  
  xName - coordinate on x-axis
  
  Returns:
  
  the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates, as if computed by java.lang.Math.atan2
  
  Since:
  
  1.4.0
- atan2
  
  Parameters:
  
  y - coordinate on y-axis
  
  xValue - coordinate on x-axis
  
  Returns:
  
  the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates, as if computed by java.lang.Math.atan2
  
  Since:
  
  1.4.0
- atan2
  
  Parameters:
  
  yName - coordinate on y-axis
  
  xValue - coordinate on x-axis
  
  Returns:
  
  the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates, as if computed by java.lang.Math.atan2
  
  Since:
  
  1.4.0
- atan2
  
  Parameters:
  
  yValue - coordinate on y-axis
  
  x - coordinate on x-axis
  
  Returns:
  
  the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates, as if computed by java.lang.Math.atan2
  
  Since:
  
  1.4.0
- atan2
  
  Parameters:
  
  yValue - coordinate on y-axis
  
  xName - coordinate on x-axis
  
  Returns:
  
  the theta component of the point (r, theta) in polar coordinates that corresponds to the point (x, y) in Cartesian coordinates, as if computed by java.lang.Math.atan2
  
  Since:
  
  1.4.0
- atanh
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  inverse hyperbolic tangent of e
  
  Since:
  
  3.1.0
- atanh
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  inverse hyperbolic tangent of columnName
  
  Since:
  
  3.1.0
- bin
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- bin
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- cbrt
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- cbrt
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- ceil
  Computes the ceiling of the given value of
  e
  to
  scale
  decimal places.
  
  Parameters:
  
  e - (undocumented)
  
  scale - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
- ceil
  Computes the ceiling of the given value of
  e
  to 0 decimal places.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- ceil
  Computes the ceiling of the given value of
  e
  to 0 decimal places.
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- ceiling
  Computes the ceiling of the given value of
  e
  to
  scale
  decimal places.
  
  Parameters:
  
  e - (undocumented)
  
  scale - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- ceiling
  Computes the ceiling of the given value of
  e
  to 0 decimal places.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- conv public static Column conv(Column num, int fromBase, int toBase)
  
  Parameters:
  
  num - (undocumented)
  
  fromBase - (undocumented)
  
  toBase - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- cos
  
  Parameters:
  
  e - angle in radians
  
  Returns:
  
  cosine of the angle, as if computed by java.lang.Math.cos
  
  Since:
  
  1.4.0
- cos
  
  Parameters:
  
  columnName - angle in radians
  
  Returns:
  
  cosine of the angle, as if computed by java.lang.Math.cos
  
  Since:
  
  1.4.0
- cosh
  
  Parameters:
  
  e - hyperbolic angle
  
  Returns:
  
  hyperbolic cosine of the angle, as if computed by java.lang.Math.cosh
  
  Since:
  
  1.4.0
- cosh
  
  Parameters:
  
  columnName - hyperbolic angle
  
  Returns:
  
  hyperbolic cosine of the angle, as if computed by java.lang.Math.cosh
  
  Since:
  
  1.4.0
- cot
  
  Parameters:
  
  e - angle in radians
  
  Returns:
  
  cotangent of the angle
  
  Since:
  
  3.3.0
- csc
  
  Parameters:
  
  e - angle in radians
  
  Returns:
  
  cosecant of the angle
  
  Since:
  
  3.3.0
- e
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- exp
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- exp
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- expm1
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- expm1
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- factorial
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- floor
  Computes the floor of the given value of
  e
  to
  scale
  decimal places.
  
  Parameters:
  
  e - (undocumented)
  
  scale - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
- floor
  Computes the floor of the given value of
  e
  to 0 decimal places.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- floor
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- greatest public static Column greatest(scala.collection.immutable.Seq<Column> exprs)
  
  Parameters:
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- greatest public static Column greatest(String columnName, scala.collection.immutable.Seq<String> columnNames)
  
  Parameters:
  
  columnName - (undocumented)
  
  columnNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- hex
  
  Parameters:
  
  column - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- unhex
  
  Parameters:
  
  column - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- hypot
  Computes
  sqrt(a^2^ + b^2^)
  without intermediate overflow or underflow.
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- hypot
  Computes
  sqrt(a^2^ + b^2^)
  without intermediate overflow or underflow.
  
  Parameters:
  
  l - (undocumented)
  
  rightName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- hypot
  Computes
  sqrt(a^2^ + b^2^)
  without intermediate overflow or underflow.
  
  Parameters:
  
  leftName - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- hypot
  Computes
  sqrt(a^2^ + b^2^)
  without intermediate overflow or underflow.
  
  Parameters:
  
  leftName - (undocumented)
  
  rightName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- hypot
  Computes
  sqrt(a^2^ + b^2^)
  without intermediate overflow or underflow.
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- hypot
  Computes
  sqrt(a^2^ + b^2^)
  without intermediate overflow or underflow.
  
  Parameters:
  
  leftName - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- hypot
  Computes
  sqrt(a^2^ + b^2^)
  without intermediate overflow or underflow.
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- hypot
  Computes
  sqrt(a^2^ + b^2^)
  without intermediate overflow or underflow.
  
  Parameters:
  
  l - (undocumented)
  
  rightName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- least public static Column least(scala.collection.immutable.Seq<Column> exprs)
  
  Parameters:
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- least public static Column least(String columnName, scala.collection.immutable.Seq<String> columnNames)
  
  Parameters:
  
  columnName - (undocumented)
  
  columnNames - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- ln
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- log
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- log
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- log
  
  Parameters:
  
  base - (undocumented)
  
  a - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- log
  
  Parameters:
  
  base - (undocumented)
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- log10
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- log10
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- log1p
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- log1p
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- log2
  
  Parameters:
  
  expr - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- log2
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- negative
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- pi
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- positive
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- pow
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- pow
  
  Parameters:
  
  l - (undocumented)
  
  rightName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- pow
  
  Parameters:
  
  leftName - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- pow
  
  Parameters:
  
  leftName - (undocumented)
  
  rightName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- pow
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- pow
  
  Parameters:
  
  leftName - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- pow
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- pow
  
  Parameters:
  
  l - (undocumented)
  
  rightName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- power
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- pmod
  
  Parameters:
  
  dividend - (undocumented)
  
  divisor - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- rint
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- rint
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- round
  Returns the value of the column
  e
  rounded to 0 decimal places with HALF_UP round mode.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- round
  Round the value of
  e
  to
  scale
  decimal places with HALF_UP round mode if
  scale
  is greater than or equal to 0 or at integral part when
  scale
  is less than 0.
  
  Parameters:
  
  e - (undocumented)
  
  scale - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- round
  Round the value of
  e
  to
  scale
  decimal places with HALF_UP round mode if
  scale
  is greater than or equal to 0 or at integral part when
  scale
  is less than 0.
  
  Parameters:
  
  e - (undocumented)
  
  scale - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- bround
  Returns the value of the column
  e
  rounded to 0 decimal places with HALF_EVEN round mode.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- bround
  Round the value of
  e
  to
  scale
  decimal places with HALF_EVEN round mode if
  scale
  is greater than or equal to 0 or at integral part when
  scale
  is less than 0.
  
  Parameters:
  
  e - (undocumented)
  
  scale - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- bround
  Round the value of
  e
  to
  scale
  decimal places with HALF_EVEN round mode if
  scale
  is greater than or equal to 0 or at integral part when
  scale
  is less than 0.
  
  Parameters:
  
  e - (undocumented)
  
  scale - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- sec
  
  Parameters:
  
  e - angle in radians
  
  Returns:
  
  secant of the angle
  
  Since:
  
  3.3.0
- shiftLeft
  
  Parameters:
  
  e - (undocumented)
  
  numBits - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- shiftleft
  
  Parameters:
  
  e - (undocumented)
  
  numBits - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- shiftRight
  
  Parameters:
  
  e - (undocumented)
  
  numBits - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- shiftright
  
  Parameters:
  
  e - (undocumented)
  
  numBits - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- shiftRightUnsigned public static Column shiftRightUnsigned(Column e, int numBits)
  
  Parameters:
  
  e - (undocumented)
  
  numBits - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- shiftrightunsigned public static Column shiftrightunsigned(Column e, int numBits)
  
  Parameters:
  
  e - (undocumented)
  
  numBits - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- sign
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- signum
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- signum
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- sin
  
  Parameters:
  
  e - angle in radians
  
  Returns:
  
  sine of the angle, as if computed by java.lang.Math.sin
  
  Since:
  
  1.4.0
- sin
  
  Parameters:
  
  columnName - angle in radians
  
  Returns:
  
  sine of the angle, as if computed by java.lang.Math.sin
  
  Since:
  
  1.4.0
- sinh
  
  Parameters:
  
  e - hyperbolic angle
  
  Returns:
  
  hyperbolic sine of the given value, as if computed by java.lang.Math.sinh
  
  Since:
  
  1.4.0
- sinh
  
  Parameters:
  
  columnName - hyperbolic angle
  
  Returns:
  
  hyperbolic sine of the given value, as if computed by java.lang.Math.sinh
  
  Since:
  
  1.4.0
- tan
  
  Parameters:
  
  e - angle in radians
  
  Returns:
  
  tangent of the given value, as if computed by java.lang.Math.tan
  
  Since:
  
  1.4.0
- tan
  
  Parameters:
  
  columnName - angle in radians
  
  Returns:
  
  tangent of the given value, as if computed by java.lang.Math.tan
  
  Since:
  
  1.4.0
- tanh
  
  Parameters:
  
  e - hyperbolic angle
  
  Returns:
  
  hyperbolic tangent of the given value, as if computed by java.lang.Math.tanh
  
  Since:
  
  1.4.0
- tanh
  
  Parameters:
  
  columnName - hyperbolic angle
  
  Returns:
  
  hyperbolic tangent of the given value, as if computed by java.lang.Math.tanh
  
  Since:
  
  1.4.0
- toDegrees
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- toDegrees
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- degrees
  
  Parameters:
  
  e - angle in radians
  
  Returns:
  
  angle in degrees, as if computed by java.lang.Math.toDegrees
  
  Since:
  
  2.1.0
- degrees
  
  Parameters:
  
  columnName - angle in radians
  
  Returns:
  
  angle in degrees, as if computed by java.lang.Math.toDegrees
  
  Since:
  
  2.1.0
- toRadians
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- toRadians
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.4.0
- radians
  
  Parameters:
  
  e - angle in degrees
  
  Returns:
  
  angle in radians, as if computed by java.lang.Math.toRadians
  
  Since:
  
  2.1.0
- radians
  
  Parameters:
  
  columnName - angle in degrees
  
  Returns:
  
  angle in radians, as if computed by java.lang.Math.toRadians
  
  Since:
  
  2.1.0
- width_bucket
  
  Parameters:
  
  v - value to compute a bucket number in the histogram
  
  min - minimum value of the histogram
  
  max - maximum value of the histogram
  
  numBucket - the number of buckets
  
  Returns:
  
  the bucket number into which the value would fall after being evaluated
  
  Since:
  
  3.5.0
- current_catalog public static Column current_catalog
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- current_database public static Column current_database
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- current_schema public static Column current_schema
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- current_user public static Column current_user
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- md5
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- sha1
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- sha2
  
  Parameters:
  
  e - column to compute SHA-2 on.
  
  numBits - one of 224, 256, 384, or 512.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- crc32
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- hash public static Column hash(scala.collection.immutable.Seq<Column> cols)
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- xxhash64 public static Column xxhash64(scala.collection.immutable.Seq<Column> cols)
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- assert_true
  
  Parameters:
  
  c - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.1.0
- assert_true
  
  Parameters:
  
  c - (undocumented)
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.1.0
- raise_error
  
  Parameters:
  
  c - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.1.0
- hll_sketch_estimate
  
  Parameters:
  
  c - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_sketch_estimate
  
  Parameters:
  
  columnName - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union
  
  Parameters:
  
  c1 - (undocumented)
  
  c2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union
  
  Parameters:
  
  columnName1 - (undocumented)
  
  columnName2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union
  
  Parameters:
  
  c1 - (undocumented)
  
  c2 - (undocumented)
  
  allowDifferentLgConfigK - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hll_union public static Column hll_union(String columnName1, String columnName2, boolean allowDifferentLgConfigK)
  
  Parameters:
  
  columnName1 - (undocumented)
  
  columnName2 - (undocumented)
  
  allowDifferentLgConfigK - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- user
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- session_user public static Column session_user
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- uuid
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- aes_encrypt
  Returns an encrypted value of
  input
  using AES in given
  mode
  with the specified
  padding
  . Key lengths of 16, 24 and 32 bits are supported. Supported combinations of (
  mode
  ,
  padding
  ) are ('ECB', 'PKCS'), ('GCM', 'NONE') and ('CBC', 'PKCS'). Optional initialization vectors (IVs) are only supported for CBC and GCM modes. These must be 16 bytes for CBC and 12 bytes for GCM. If not provided, a random vector will be generated and prepended to the output. Optional additional authenticated data (AAD) is only supported for GCM. If provided for encryption, the identical AAD value must be provided for decryption. The default mode is GCM.
  
  Parameters:
  
  input - The binary value to encrypt.
  
  key - The passphrase to use to encrypt the data.
  
  mode - Specifies which block cipher mode should be used to encrypt messages. Valid modes: ECB, GCM, CBC.
  
  padding - Specifies how to pad messages whose length is not a multiple of the block size. Valid values: PKCS, NONE, DEFAULT. The DEFAULT padding means PKCS for ECB, NONE for GCM and PKCS for CBC.
  
  iv - Optional initialization vector. Only supported for CBC and GCM modes. Valid values: None or "". 16-byte array for CBC mode. 12-byte array for GCM mode.
  
  aad - Optional additional authenticated data. Only supported for GCM mode. This can be any free-form input and must be provided for both encryption and decryption.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- aes_encrypt
  Returns an encrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  mode - (undocumented)
  
  padding - (undocumented)
  
  iv - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.aes_encrypt(Column, Column, Column, Column, Column, Column)
- aes_encrypt
  Returns an encrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  mode - (undocumented)
  
  padding - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.aes_encrypt(Column, Column, Column, Column, Column, Column)
- aes_encrypt
  Returns an encrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  mode - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.aes_encrypt(Column, Column, Column, Column, Column, Column)
- aes_encrypt
  Returns an encrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.aes_encrypt(Column, Column, Column, Column, Column, Column)
- aes_decrypt
  Returns a decrypted value of
  input
  using AES in
  mode
  with
  padding
  . Key lengths of 16, 24 and 32 bits are supported. Supported combinations of (
  mode
  ,
  padding
  ) are ('ECB', 'PKCS'), ('GCM', 'NONE') and ('CBC', 'PKCS'). Optional additional authenticated data (AAD) is only supported for GCM. If provided for encryption, the identical AAD value must be provided for decryption. The default mode is GCM.
  
  Parameters:
  
  input - The binary value to decrypt.
  
  key - The passphrase to use to decrypt the data.
  
  mode - Specifies which block cipher mode should be used to decrypt messages. Valid modes: ECB, GCM, CBC.
  
  padding - Specifies how to pad messages whose length is not a multiple of the block size. Valid values: PKCS, NONE, DEFAULT. The DEFAULT padding means PKCS for ECB, NONE for GCM and PKCS for CBC.
  
  aad - Optional additional authenticated data. Only supported for GCM mode. This can be any free-form input and must be provided for both encryption and decryption.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- aes_decrypt
  Returns a decrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  mode - (undocumented)
  
  padding - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.aes_decrypt(Column, Column, Column, Column, Column)
- aes_decrypt
  Returns a decrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  mode - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.aes_decrypt(Column, Column, Column, Column, Column)
- aes_decrypt
  Returns a decrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.aes_decrypt(Column, Column, Column, Column, Column)
- try_aes_decrypt
  This is a special version of
  aes_decrypt
  that performs the same operation, but returns a NULL value instead of raising an error if the decryption cannot be performed.
  
  Parameters:
  
  input - The binary value to decrypt.
  
  key - The passphrase to use to decrypt the data.
  
  mode - Specifies which block cipher mode should be used to decrypt messages. Valid modes: ECB, GCM, CBC.
  
  padding - Specifies how to pad messages whose length is not a multiple of the block size. Valid values: PKCS, NONE, DEFAULT. The DEFAULT padding means PKCS for ECB, NONE for GCM and PKCS for CBC.
  
  aad - Optional additional authenticated data. Only supported for GCM mode. This can be any free-form input and must be provided for both encryption and decryption.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_aes_decrypt
  Returns a decrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  mode - (undocumented)
  
  padding - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.try_aes_decrypt(Column, Column, Column, Column, Column)
- try_aes_decrypt
  Returns a decrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  mode - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.try_aes_decrypt(Column, Column, Column, Column, Column)
- try_aes_decrypt
  Returns a decrypted value of
  input
  .
  Parameters:
  
  input - (undocumented)
  
  key - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  See Also:
  - org.apache.spark.sql.functions.try_aes_decrypt(Column, Column, Column, Column, Column)
- sha
  Returns a sha1 hash value as a hex string of the
  col
  .
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- input_file_block_length public static Column input_file_block_length
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- input_file_block_start public static Column input_file_block_start
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- reflect public static Column reflect(scala.collection.immutable.Seq<Column> cols)
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- java_method public static Column java_method(scala.collection.immutable.Seq<Column> cols)
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_reflect public static Column try_reflect(scala.collection.immutable.Seq<Column> cols)
  This is a special version of
  reflect
  that performs the same operation, but returns a NULL value instead of raising an error if the invoke method thrown exception.
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- version public static Column version
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- typeof
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- stack public static Column stack(scala.collection.immutable.Seq<Column> cols)
  Separates
  col1
  , ...,
  colk
  into
  n
  rows. Uses column names col0, col1, etc. by default unless specified otherwise.
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- uniform
  
  Parameters:
  
  min - (undocumented)
  
  max - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- uniform
  
  Parameters:
  
  min - (undocumented)
  
  max - (undocumented)
  
  seed - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- random
  
  Parameters:
  
  seed - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- random public static Column random
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bitmap_bit_position
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bitmap_bucket_number
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bitmap_construct_agg
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bitmap_count
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bitmap_or_agg
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- ascii
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- base64
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- bit_length
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
- concat_ws
  
  Parameters:
  
  sep - (undocumented)
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  Input strings which are null are skipped.
- decode
  
  Parameters:
  
  value - (undocumented)
  
  charset - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- encode
  
  Parameters:
  
  value - (undocumented)
  
  charset - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- is_valid_utf8
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_valid_utf8
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- validate_utf8
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- try_validate_utf8
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- format_number
  Formats numeric column x to a format like '#,###,###.##', rounded to d decimal places with HALF_EVEN round mode, and returns the result as a string column.
  
  If d is 0, the result has no decimal point or fractional part. If d is less than 0, the result will be null.
  
  Parameters:
  
  x - (undocumented)
  
  d - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- format_string public static Column format_string(String format, scala.collection.immutable.Seq<Column> arguments)
  
  Parameters:
  
  format - (undocumented)
  
  arguments - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- initcap
  Returns a new string column by converting the first letter of each word to uppercase. Words are delimited by whitespace.
  
  For example, "hello world" will become "Hello World".
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- instr
  
  Parameters:
  
  str - (undocumented)
  
  substring - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  The position is not zero based, but 1 based index. Returns 0 if substr could not be found in str.
- instr
  
  Parameters:
  
  str - (undocumented)
  
  substring - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
  
  Note:
  
  The position is not zero based, but 1 based index. Returns 0 if substr could not be found in str.
- length
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- len
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- lower
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- levenshtein
  Computes the Levenshtein distance of the two given string columns if it's less than or equal to a given threshold.
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  threshold - (undocumented)
  
  Returns:
  
  result distance, or -1
  
  Since:
  
  3.5.0
- levenshtein
  Computes the Levenshtein distance of the two given string columns.
  
  Parameters:
  
  l - (undocumented)
  
  r - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- locate
  
  Parameters:
  
  substr - (undocumented)
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  The position is not zero based, but 1 based index. Returns 0 if substr could not be found in str.
- locate
  
  Parameters:
  
  substr - (undocumented)
  
  str - (undocumented)
  
  pos - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  The position is not zero based, but 1 based index. returns 0 if substr could not be found in str.
- lpad
  
  Parameters:
  
  str - (undocumented)
  
  len - (undocumented)
  
  pad - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- lpad
  
  Parameters:
  
  str - (undocumented)
  
  len - (undocumented)
  
  pad - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
- lpad
  
  Parameters:
  
  str - (undocumented)
  
  len - (undocumented)
  
  pad - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- ltrim
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- ltrim
  Trim the specified character string from left end for the specified string column.
  
  Parameters:
  
  e - (undocumented)
  
  trimString - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- ltrim
  Trim the specified character string from left end for the specified string column.
  
  Parameters:
  
  e - (undocumented)
  
  trim - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- octet_length
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
- collate
  
  Parameters:
  
  e - (undocumented)
  
  collation - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- collation
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- rlike
  Returns true if
  str
  matches
  regexp
  , or false otherwise.
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regexp
  Returns true if
  str
  matches
  regexp
  , or false otherwise.
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regexp_like
  Returns true if
  str
  matches
  regexp
  , or false otherwise.
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regexp_count
  Returns a count of the number of times that the regular expression pattern
  regexp
  is matched in the string
  str
  .
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regexp_extract
  
  Parameters:
  
  e - (undocumented)
  
  exp - (undocumented)
  
  groupIdx - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- regexp_extract_all
  Extract all strings in the
  str
  that match the
  regexp
  expression and corresponding to the first regex group index.
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regexp_extract_all
  Extract all strings in the
  str
  that match the
  regexp
  expression and corresponding to the regex group index.
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  idx - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regexp_replace
  
  Parameters:
  
  e - (undocumented)
  
  pattern - (undocumented)
  
  replacement - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- regexp_replace
  
  Parameters:
  
  e - (undocumented)
  
  pattern - (undocumented)
  
  replacement - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- regexp_substr
  Returns the substring that matches the regular expression
  regexp
  within the string
  str
  . If the regular expression is not found, the result is null.
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regexp_instr
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- regexp_instr
  
  Parameters:
  
  str - (undocumented)
  
  regexp - (undocumented)
  
  idx - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- unbase64
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- rpad
  
  Parameters:
  
  str - (undocumented)
  
  len - (undocumented)
  
  pad - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- rpad
  
  Parameters:
  
  str - (undocumented)
  
  len - (undocumented)
  
  pad - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
- rpad
  
  Parameters:
  
  str - (undocumented)
  
  len - (undocumented)
  
  pad - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- repeat
  
  Parameters:
  
  str - (undocumented)
  
  n - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- repeat
  
  Parameters:
  
  str - (undocumented)
  
  n - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- rtrim
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- rtrim
  Trim the specified character string from right end for the specified string column.
  
  Parameters:
  
  e - (undocumented)
  
  trimString - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- rtrim
  Trim the specified character string from right end for the specified string column.
  
  Parameters:
  
  e - (undocumented)
  
  trim - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- soundex
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- split
  
  Parameters:
  
  str - a string expression to split
  
  pattern - a string representing a regular expression. The regex string should be a Java regular expression.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- split
  
  Parameters:
  
  str - a string expression to split
  
  pattern - a column of string representing a regular expression. The regex string should be a Java regular expression.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- split
  Parameters:
  
  str - a string expression to split
  
  pattern - a string representing a regular expression. The regex string should be a Java regular expression.
  limit - an integer expression which controls the number of times the regex is applied.
  - limit greater than 0: The resulting array's length will not be more than limit, and the resulting array's last entry will contain all input beyond the last matched regex.
  - limit less than or equal to 0: regex will be applied as many times as possible, and the resulting array can be of any size.
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- split
  Parameters:
  
  str - a string expression to split
  
  pattern - a column of string representing a regular expression. The regex string should be a Java regular expression.
  limit - a column of integer expression which controls the number of times the regex is applied.
  - limit greater than 0: The resulting array's length will not be more than limit, and the resulting array's last entry will contain all input beyond the last matched regex.
  - limit less than or equal to 0: regex will be applied as many times as possible, and the resulting array can be of any size.
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- substring public static Column substring(Column str, int pos, int len)
  Substring starts at
  pos
  and is of length
  len
  when str is String type or returns the slice of byte array that starts at
  pos
  in byte and is of length
  len
  when str is Binary type
  
  Parameters:
  
  str - (undocumented)
  
  pos - (undocumented)
  
  len - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  The position is not zero based, but 1 based index.
- substring
  Substring starts at
  pos
  and is of length
  len
  when str is String type or returns the slice of byte array that starts at
  pos
  in byte and is of length
  len
  when str is Binary type
  
  Parameters:
  
  str - (undocumented)
  
  pos - (undocumented)
  
  len - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
  
  Note:
  
  The position is not zero based, but 1 based index.
- substring_index
  
  Parameters:
  
  str - (undocumented)
  
  delim - (undocumented)
  
  count - (undocumented)
  
  Returns:
  
  (undocumented)
- overlay
  Overlay the specified portion of
  src
  with
  replace
  , starting from byte position
  pos
  of
  src
  and proceeding for
  len
  bytes.
  
  Parameters:
  
  src - (undocumented)
  
  replace - (undocumented)
  
  pos - (undocumented)
  
  len - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- overlay
  Overlay the specified portion of
  src
  with
  replace
  , starting from byte position
  pos
  of
  src
  .
  
  Parameters:
  
  src - (undocumented)
  
  replace - (undocumented)
  
  pos - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- sentences
  Splits a string into arrays of sentences, where each sentence is an array of words.
  
  Parameters:
  
  string - (undocumented)
  
  language - (undocumented)
  
  country - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- sentences
  Splits a string into arrays of sentences, where each sentence is an array of words. The default country('') is used.
  
  Parameters:
  
  string - (undocumented)
  
  language - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- sentences
  Splits a string into arrays of sentences, where each sentence is an array of words. The default locale is used.
  
  Parameters:
  
  string - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- translate
  Translate any character in the src by a character in replaceString. The characters in replaceString correspond to the characters in matchingString. The translate will happen when any character in the string matches the character in the
  matchingString
  .
  
  Parameters:
  
  src - (undocumented)
  
  matchingString - (undocumented)
  
  replaceString - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- trim
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- trim
  Trim the specified character from both ends for the specified string column.
  
  Parameters:
  
  e - (undocumented)
  
  trimString - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- trim
  Trim the specified character from both ends for the specified string column.
  
  Parameters:
  
  e - (undocumented)
  
  trim - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- upper
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- to_binary
  Converts the input
  e
  to a binary value based on the supplied
  format
  . The
  format
  can be a case-insensitive string literal of "hex", "utf-8", "utf8", or "base64". By default, the binary format for conversion is "hex" if
  format
  is omitted. The function returns NULL if at least one of the input parameters is NULL.
  
  Parameters:
  
  e - (undocumented)
  
  f - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_binary
  Converts the input
  e
  to a binary value based on the default format "hex". The function returns NULL if at least one of the input parameters is NULL.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_char
  Convert
  e
  to a string based on the
  format
  . Throws an exception if the conversion fails. The format can consist of the following characters, case insensitive: '0' or '9': Specifies an expected digit between 0 and 9. A sequence of 0 or 9 in the format string matches a sequence of digits in the input value, generating a result string of the same length as the corresponding sequence in the format string. The result string is left-padded with zeros if the 0/9 sequence comprises more digits than the matching part of the decimal value, starts with 0, and is before the decimal point. Otherwise, it is padded with spaces. '.' or 'D': Specifies the position of the decimal point (optional, only allowed once). ',' or 'G': Specifies the position of the grouping (thousands) separator (,). There must be a 0 or 9 to the left and right of each grouping separator. '$': Specifies the location of the $ currency sign. This character may only be specified once. 'S' or 'MI': Specifies the position of a '-' or '+' sign (optional, only allowed once at the beginning or end of the format string). Note that 'S' prints '+' for positive values but 'MI' prints a space. 'PR': Only allowed at the end of the format string; specifies that the result string will be wrapped by angle brackets if the input value is negative.
  
  If e is a datetime, format shall be a valid datetime pattern, see Datetime Patterns. If e is a binary, it is converted to a string in one of the formats: 'base64': a base 64 string. 'hex': a string in the hexadecimal format. 'utf-8': the input binary is decoded to UTF-8 string.
  
  Parameters:
  
  e - (undocumented)
  
  format - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_varchar
  Convert
  e
  to a string based on the
  format
  . Throws an exception if the conversion fails. The format can consist of the following characters, case insensitive: '0' or '9': Specifies an expected digit between 0 and 9. A sequence of 0 or 9 in the format string matches a sequence of digits in the input value, generating a result string of the same length as the corresponding sequence in the format string. The result string is left-padded with zeros if the 0/9 sequence comprises more digits than the matching part of the decimal value, starts with 0, and is before the decimal point. Otherwise, it is padded with spaces. '.' or 'D': Specifies the position of the decimal point (optional, only allowed once). ',' or 'G': Specifies the position of the grouping (thousands) separator (,). There must be a 0 or 9 to the left and right of each grouping separator. '$': Specifies the location of the $ currency sign. This character may only be specified once. 'S' or 'MI': Specifies the position of a '-' or '+' sign (optional, only allowed once at the beginning or end of the format string). Note that 'S' prints '+' for positive values but 'MI' prints a space. 'PR': Only allowed at the end of the format string; specifies that the result string will be wrapped by angle brackets if the input value is negative.
  
  If e is a datetime, format shall be a valid datetime pattern, see Datetime Patterns. If e is a binary, it is converted to a string in one of the formats: 'base64': a base 64 string. 'hex': a string in the hexadecimal format. 'utf-8': the input binary is decoded to UTF-8 string.
  
  Parameters:
  
  e - (undocumented)
  
  format - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_number
  
  Parameters:
  
  e - (undocumented)
  
  format - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- replace
  Replaces all occurrences of
  search
  with
  replace
  .
  
  Parameters:
  
  src - A column of string to be replaced
  
  search - A column of string, If search is not found in str, str is returned unchanged.
  
  replace - A column of string, If replace is not specified or is an empty string, nothing replaces the string that is removed from str.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- replace
  Replaces all occurrences of
  search
  with
  replace
  .
  
  Parameters:
  
  src - A column of string to be replaced
  
  search - A column of string, If search is not found in src, src is returned unchanged.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- split_part
  Splits
  str
  by delimiter and return requested part of the split (1-based). If any input is null, returns null. if
  partNum
  is out of range of split parts, returns empty string. If
  partNum
  is 0, throws an error. If
  partNum
  is negative, the parts are counted backward from the end of the string. If the
  delimiter
  is an empty string, the
  str
  is not split.
  
  Parameters:
  
  str - (undocumented)
  
  delimiter - (undocumented)
  
  partNum - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- substr
  Returns the substring of
  str
  that starts at
  pos
  and is of length
  len
  , or the slice of byte array that starts at
  pos
  and is of length
  len
  .
  
  Parameters:
  
  str - (undocumented)
  
  pos - (undocumented)
  
  len - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- substr
  Returns the substring of
  str
  that starts at
  pos
  , or the slice of byte array that starts at
  pos
  .
  
  Parameters:
  
  str - (undocumented)
  
  pos - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_parse_url
  
  Parameters:
  
  url - (undocumented)
  
  partToExtract - (undocumented)
  
  key - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- try_parse_url
  
  Parameters:
  
  url - (undocumented)
  
  partToExtract - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- parse_url
  
  Parameters:
  
  url - (undocumented)
  
  partToExtract - (undocumented)
  
  key - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- parse_url
  
  Parameters:
  
  url - (undocumented)
  
  partToExtract - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- printf public static Column printf(Column format, scala.collection.immutable.Seq<Column> arguments)
  
  Parameters:
  
  format - (undocumented)
  
  arguments - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- url_decode
  Decodes a
  str
  in 'application/x-www-form-urlencoded' format using a specific encoding scheme.
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_url_decode
  This is a special version of
  url_decode
  that performs the same operation, but returns a NULL value instead of raising an error if the decoding cannot be performed.
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- url_encode
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- position
  Returns the position of the first occurrence of
  substr
  in
  str
  after position
  start
  . The given
  start
  and return value are 1-based.
  
  Parameters:
  
  substr - (undocumented)
  
  str - (undocumented)
  
  start - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- position
  Returns the position of the first occurrence of
  substr
  in
  str
  after position
  1
  . The return value are 1-based.
  
  Parameters:
  
  substr - (undocumented)
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- endswith
  
  Parameters:
  
  str - (undocumented)
  
  suffix - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- startswith
  
  Parameters:
  
  str - (undocumented)
  
  prefix - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- btrim
  Removes the leading and trailing space characters from
  str
  .
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- btrim
  Remove the leading and trailing
  trim
  characters from
  str
  .
  
  Parameters:
  
  str - (undocumented)
  
  trim - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_to_binary
  This is a special version of
  to_binary
  that performs the same operation, but returns a NULL value instead of raising an error if the conversion cannot be performed.
  
  Parameters:
  
  e - (undocumented)
  
  f - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_to_binary
  This is a special version of
  to_binary
  that performs the same operation, but returns a NULL value instead of raising an error if the conversion cannot be performed.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_to_number
  Convert string
  e
  to a number based on the string format
  format
  . Returns NULL if the string
  e
  does not match the expected format. The format follows the same semantics as the to_number function.
  
  Parameters:
  
  e - (undocumented)
  
  format - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- char_length
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- character_length
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- chr
  Returns the ASCII character having the binary equivalent to
  n
  . If n is larger than 256 the result is equivalent to chr(n % 256)
  
  Parameters:
  
  n - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- contains
  
  Parameters:
  
  left - (undocumented)
  
  right - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- elt public static Column elt(scala.collection.immutable.Seq<Column> inputs)
  Returns the
  n
  -th input, e.g., returns
  input2
  when
  n
  is 2. The function returns NULL if the index exceeds the length of the array and
  spark.sql.ansi.enabled
  is set to false. If
  spark.sql.ansi.enabled
  is set to true, it throws ArrayIndexOutOfBoundsException for invalid indices.
  
  Parameters:
  
  inputs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- find_in_set
  Returns the index (1-based) of the given string (
  str
  ) in the comma-delimited list (
  strArray
  ). Returns 0, if the string was not found or if the given string (
  str
  ) contains a comma.
  
  Parameters:
  
  str - (undocumented)
  
  strArray - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- like
  Returns true if str matches
  pattern
  with
  escapeChar
  , null if any arguments are null, false otherwise.
  
  Parameters:
  
  str - (undocumented)
  
  pattern - (undocumented)
  
  escapeChar - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- like
  Returns true if str matches
  pattern
  with
  escapeChar
  ('\'), null if any arguments are null, false otherwise.
  
  Parameters:
  
  str - (undocumented)
  
  pattern - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- ilike
  Returns true if str matches
  pattern
  with
  escapeChar
  case-insensitively, null if any arguments are null, false otherwise.
  
  Parameters:
  
  str - (undocumented)
  
  pattern - (undocumented)
  
  escapeChar - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- ilike
  Returns true if str matches
  pattern
  with
  escapeChar
  ('\') case-insensitively, null if any arguments are null, false otherwise.
  
  Parameters:
  
  str - (undocumented)
  
  pattern - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- lcase
  Returns
  str
  with all characters changed to lowercase.
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- ucase
  Returns
  str
  with all characters changed to uppercase.
  
  Parameters:
  
  str - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- left
  Returns the leftmost
  len
  (
  len
  can be string type) characters from the string
  str
  , if
  len
  is less or equal than 0 the result is an empty string.
  
  Parameters:
  
  str - (undocumented)
  
  len - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- right
  Returns the rightmost
  len
  (
  len
  can be string type) characters from the string
  str
  , if
  len
  is less or equal than 0 the result is an empty string.
  
  Parameters:
  
  str - (undocumented)
  
  len - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- add_months public static Column add_months(Column startDate, int numMonths)
  Returns the date that is
  numMonths
  after
  startDate
  .
  
  Parameters:
  
  startDate - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  numMonths - The number of months to add to startDate, can be negative to subtract months
  
  Returns:
  
  A date, or null if startDate was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- add_months
  Returns the date that is
  numMonths
  after
  startDate
  .
  
  Parameters:
  
  startDate - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  numMonths - A column of the number of months to add to startDate, can be negative to subtract months
  
  Returns:
  
  A date, or null if startDate was a string that could not be cast to a date
  
  Since:
  
  3.0.0
- curdate public static Column curdate
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- current_date public static Column current_date
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- current_timezone public static Column current_timezone
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- current_timestamp public static Column current_timestamp
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- now
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- localtimestamp public static Column localtimestamp
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
- date_format
  Converts a date/timestamp/string to a value of string in the format specified by the date format given by the second argument.
  
  See Datetime Patterns for valid date and time format patterns
  
  Parameters:
  
  dateExpr - A date, timestamp or string. If a string, the data must be in a format that can be cast to a timestamp, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  format - A pattern dd.MM.yyyy would return a string like 18.03.1993
  
  Returns:
  
  A string, or null if dateExpr was a string that could not be cast to a timestamp
  
  Throws:
  
  IllegalArgumentException - if the format pattern is invalid
  
  Since:
  
  1.5.0
  
  Note:
  
  Use specialized functions like year(org.apache.spark.sql.Column) whenever possible as they benefit from a specialized implementation.
- date_add
  Returns the date that is
  days
  days after
  start
  
  Parameters:
  
  start - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  days - The number of days to add to start, can be negative to subtract days
  
  Returns:
  
  A date, or null if start was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- date_add
  Returns the date that is
  days
  days after
  start
  
  Parameters:
  
  start - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  days - A column of the number of days to add to start, can be negative to subtract days
  
  Returns:
  
  A date, or null if start was a string that could not be cast to a date
  
  Since:
  
  3.0.0
- dateadd
  Returns the date that is
  days
  days after
  start
  
  Parameters:
  
  start - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  days - A column of the number of days to add to start, can be negative to subtract days
  
  Returns:
  
  A date, or null if start was a string that could not be cast to a date
  
  Since:
  
  3.5.0
- date_sub
  Returns the date that is
  days
  days before
  start
  
  Parameters:
  
  start - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  days - The number of days to subtract from start, can be negative to add days
  
  Returns:
  
  A date, or null if start was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- date_sub
  Returns the date that is
  days
  days before
  start
  
  Parameters:
  
  start - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  days - A column of the number of days to subtract from start, can be negative to add days
  
  Returns:
  
  A date, or null if start was a string that could not be cast to a date
  
  Since:
  
  3.0.0
- datediff
  Returns the number of days from
  start
  to
  end
  .
  
  Only considers the date part of the input. For example:
```
 dateddiff("2018-01-10 00:00:00", "2018-01-09 23:59:59")
 // returns 1
 
```
  Parameters:
  
  end - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  start - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  Returns:
  
  An integer, or null if either end or start were strings that could not be cast to a date. Negative if end is before start
  
  Since:
  
  1.5.0
- date_diff
  Returns the number of days from
  start
  to
  end
  .
  
  Only considers the date part of the input. For example:
```
 dateddiff("2018-01-10 00:00:00", "2018-01-09 23:59:59")
 // returns 1
 
```
  Parameters:
  
  end - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  start - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  Returns:
  
  An integer, or null if either end or start were strings that could not be cast to a date. Negative if end is before start
  
  Since:
  
  3.5.0
- date_from_unix_date
  Create date from the number of
  days
  since 1970-01-01.
  
  Parameters:
  
  days - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- year
  Extracts the year as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- quarter
  Extracts the quarter as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- month
  Extracts the month as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- dayofweek
  Extracts the day of the week as an integer from a given date/timestamp/string. Ranges from 1 for a Sunday through to 7 for a Saturday
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  2.3.0
- dayofmonth
  Extracts the day of the month as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- day
  Extracts the day of the month as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  3.5.0
- dayofyear
  Extracts the day of the year as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- hour
  Extracts the hours as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- extract
  
  Parameters:
  
  field - selects which part of the source should be extracted.
  
  source - a date/timestamp or interval column from where field should be extracted.
  
  Returns:
  
  a part of the date/timestamp or interval source
  
  Since:
  
  3.5.0
- date_part
  
  Parameters:
  
  field - selects which part of the source should be extracted, and supported string values are as same as the fields of the equivalent function extract.
  
  source - a date/timestamp or interval column from where field should be extracted.
  
  Returns:
  
  a part of the date/timestamp or interval source
  
  Since:
  
  3.5.0
- datepart
  
  Parameters:
  
  field - selects which part of the source should be extracted, and supported string values are as same as the fields of the equivalent function EXTRACT.
  
  source - a date/timestamp or interval column from where field should be extracted.
  
  Returns:
  
  a part of the date/timestamp or interval source
  
  Since:
  
  3.5.0
- last_day
  
  Parameters:
  
  e - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  Returns:
  
  A date, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- minute
  Extracts the minutes as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- weekday
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_date
  
  Parameters:
  
  year - (undocumented)
  
  month - (undocumented)
  
  day - (undocumented)
  
  Returns:
  
  A date created from year, month and day fields.
  
  Since:
  
  3.3.0
- months_between
  Returns number of months between dates
  start
  and
  end
  .
  
  A whole number is returned if both inputs have the same day of month or both are the last day of their respective months. Otherwise, the difference is calculated assuming 31 days per month.
  
  For example:
```
 months_between("2017-11-14", "2017-07-14")  // returns 4.0
 months_between("2017-01-01", "2017-01-10")  // returns 0.29032258
 months_between("2017-06-01", "2017-06-16 12:00:00")  // returns -0.5
 
```
  Parameters:
  
  end - A date, timestamp or string. If a string, the data must be in a format that can be cast to a timestamp, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  start - A date, timestamp or string. If a string, the data must be in a format that can cast to a timestamp, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  Returns:
  
  A double, or null if either end or start were strings that could not be cast to a timestamp. Negative if end is before start
  
  Since:
  
  1.5.0
- months_between
  Returns number of months between dates end and start. If roundOff is set to true, the result is rounded off to 8 digits; it is not rounded otherwise.
  
  Parameters:
  
  end - (undocumented)
  
  start - (undocumented)
  
  roundOff - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- next_day
  Returns the first date which is later than the value of the
  date
  column that is on the specified day of the week.
  
  For example, next_day('2015-07-27', "Sunday") returns 2015-08-02 because that is the first Sunday after 2015-07-27.
  
  Parameters:
  
  date - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  dayOfWeek - Case insensitive, and accepts: "Mon", "Tue", "Wed", "Thu", "Fri", "Sat", "Sun"
  
  Returns:
  
  A date, or null if date was a string that could not be cast to a date or if dayOfWeek was an invalid value
  
  Since:
  
  1.5.0
- next_day
  Returns the first date which is later than the value of the
  date
  column that is on the specified day of the week.
  
  For example, next_day('2015-07-27', "Sunday") returns 2015-08-02 because that is the first Sunday after 2015-07-27.
  
  Parameters:
  
  date - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  dayOfWeek - A column of the day of week. Case insensitive, and accepts: "Mon", "Tue", "Wed", "Thu", "Fri", "Sat", "Sun"
  
  Returns:
  
  A date, or null if date was a string that could not be cast to a date or if dayOfWeek was an invalid value
  
  Since:
  
  3.2.0
- second
  Extracts the seconds as an integer from a given date/timestamp/string.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a timestamp
  
  Since:
  
  1.5.0
- weekofyear
  Extracts the week number as an integer from a given date/timestamp/string.
  
  A week is considered to start on a Monday and week 1 is the first week with more than 3 days, as defined by ISO 8601
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  An integer, or null if the input was a string that could not be cast to a date
  
  Since:
  
  1.5.0
- from_unixtime
  
  Parameters:
  
  ut - A number of a type that is castable to a long, such as string or integer. Can be negative for timestamps before the unix epoch
  
  Returns:
  
  A string, or null if the input was a string that could not be cast to a long
  
  Since:
  
  1.5.0
- from_unixtime
  Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string representing the timestamp of that moment in the current system time zone in the given format.
  
  See Datetime Patterns for valid date and time format patterns
  
  Parameters:
  
  ut - A number of a type that is castable to a long, such as string or integer. Can be negative for timestamps before the unix epoch
  
  f - A date time pattern that the input will be formatted to
  
  Returns:
  
  A string, or null if ut was a string that could not be cast to a long or f was an invalid date time pattern
  
  Since:
  
  1.5.0
- unix_timestamp public static Column unix_timestamp
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  All calls of unix_timestamp within the same query return the same value (i.e. the current timestamp is calculated at the start of query evaluation).
- unix_timestamp
  
  Parameters:
  
  s - A date, timestamp or string. If a string, the data must be in the yyyy-MM-dd HH:mm:ss format
  
  Returns:
  
  A long, or null if the input was a string not of the correct format
  
  Since:
  
  1.5.0
- unix_timestamp
  Converts time string with given pattern to Unix timestamp (in seconds).
  
  See Datetime Patterns for valid date and time format patterns
  
  Parameters:
  
  s - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  p - A date time pattern detailing the format of s when s is a string
  
  Returns:
  
  A long, or null if s was a string that could not be cast to a date or p was an invalid format
  
  Since:
  
  1.5.0
- to_timestamp
  Converts to a timestamp by casting rules to
  TimestampType
  .
  
  Parameters:
  
  s - A date, timestamp or string. If a string, the data must be in a format that can be cast to a timestamp, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  Returns:
  
  A timestamp, or null if the input was a string that could not be cast to a timestamp
  
  Since:
  
  2.2.0
- to_timestamp
  Converts time string with the given pattern to timestamp.
  
  See Datetime Patterns for valid date and time format patterns
  
  Parameters:
  
  s - A date, timestamp or string. If a string, the data must be in a format that can be cast to a timestamp, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  fmt - A date time pattern detailing the format of s when s is a string
  
  Returns:
  
  A timestamp, or null if s was a string that could not be cast to a timestamp or fmt was an invalid format
  
  Since:
  
  2.2.0
- try_to_timestamp
  Parses the
  s
  with the
  format
  to a timestamp. The function always returns null on an invalid input with
  /
  without ANSI SQL mode enabled. The result data type is consistent with the value of configuration
  spark.sql.timestampType
  .
  
  Parameters:
  
  s - (undocumented)
  
  format - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_to_timestamp
  Parses the
  s
  to a timestamp. The function always returns null on an invalid input with
  /
  without ANSI SQL mode enabled. It follows casting rules to a timestamp. The result data type is consistent with the value of configuration
  spark.sql.timestampType
  .
  
  Parameters:
  
  s - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_date
  Converts the column into
  DateType
  by casting rules to
  DateType
  .
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- to_date
  Converts the column into a
  DateType
  with a specified format
  
  See Datetime Patterns for valid date and time format patterns
  
  Parameters:
  
  e - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  fmt - A date time pattern detailing the format of e when eis a string
  
  Returns:
  
  A date, or null if e was a string that could not be cast to a date or fmt was an invalid format
  
  Since:
  
  2.2.0
- unix_date
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- unix_micros
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- unix_millis
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- unix_seconds
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- trunc
  Returns date truncated to the unit specified by the format.
  
  For example, trunc("2018-11-19 12:01:19", "year") returns 2018-01-01
  
  Parameters:
  
  date - A date, timestamp or string. If a string, the data must be in a format that can be cast to a date, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  format - : 'year', 'yyyy', 'yy' to truncate by year, or 'month', 'mon', 'mm' to truncate by month Other options are: 'week', 'quarter'
  
  Returns:
  
  A date, or null if date was a string that could not be cast to a date or format was an invalid value
  
  Since:
  
  1.5.0
- date_trunc
  Returns timestamp truncated to the unit specified by the format.
  
  For example, date_trunc("year", "2018-11-19 12:01:19") returns 2018-01-01 00:00:00
  
  Parameters:
  
  format - : 'year', 'yyyy', 'yy' to truncate by year, 'month', 'mon', 'mm' to truncate by month, 'day', 'dd' to truncate by day, Other options are: 'microsecond', 'millisecond', 'second', 'minute', 'hour', 'week', 'quarter'
  
  timestamp - A date, timestamp or string. If a string, the data must be in a format that can be cast to a timestamp, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  Returns:
  
  A timestamp, or null if timestamp was a string that could not be cast to a timestamp or format was an invalid value
  
  Since:
  
  2.3.0
- from_utc_timestamp
  
  Parameters:
  
  ts - A date, timestamp or string. If a string, the data must be in a format that can be cast to a timestamp, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  tz - A string detailing the time zone ID that the input should be adjusted to. It should be in the format of either region-based zone IDs or zone offsets. Region IDs must have the form 'area/city', such as 'America/Los_Angeles'. Zone offsets must be in the format '(+|-)HH:mm', for example '-08:00' or '+01:00'. Also 'UTC' and 'Z' are supported as aliases of '+00:00'. Other short names are not recommended to use because they can be ambiguous.
  
  Returns:
  
  A timestamp, or null if ts was a string that could not be cast to a timestamp or tz was an invalid value
  
  Since:
  
  1.5.0
- from_utc_timestamp
  Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in UTC, and renders that time as a timestamp in the given time zone. For example, 'GMT+1' would yield '2017-07-14 03:40:00.0'.
  
  Parameters:
  
  ts - (undocumented)
  
  tz - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- to_utc_timestamp
  
  Parameters:
  
  ts - A date, timestamp or string. If a string, the data must be in a format that can be cast to a timestamp, such as yyyy-MM-dd or yyyy-MM-dd HH:mm:ss.SSSS
  
  tz - A string detailing the time zone ID that the input should be adjusted to. It should be in the format of either region-based zone IDs or zone offsets. Region IDs must have the form 'area/city', such as 'America/Los_Angeles'. Zone offsets must be in the format '(+|-)HH:mm', for example '-08:00' or '+01:00'. Also 'UTC' and 'Z' are supported as aliases of '+00:00'. Other short names are not recommended to use because they can be ambiguous.
  
  Returns:
  
  A timestamp, or null if ts was a string that could not be cast to a timestamp or tz was an invalid value
  
  Since:
  
  1.5.0
- to_utc_timestamp
  Given a timestamp like '2017-07-14 02:40:00.0', interprets it as a time in the given time zone, and renders that time as a timestamp in UTC. For example, 'GMT+1' would yield '2017-07-14 01:40:00.0'.
  
  Parameters:
  
  ts - (undocumented)
  
  tz - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- window
  Bucketize rows into one or more time windows given a timestamp specifying column. Window starts are inclusive but the window ends are exclusive, e.g. 12:05 will be in the window [12:05,12:10) but not in [12:00,12:05). Windows can support microsecond precision. Windows in the order of months are not supported. The following example takes the average stock price for a one minute window every 10 seconds starting 5 seconds after the hour:
```
   val df = ... // schema => timestamp: TimestampType, stockId: StringType, price: DoubleType
   df.groupBy(window($"timestamp", "1 minute", "10 seconds", "5 seconds"), $"stockId")
     .agg(mean("price"))
 
```
  The windows will look like:
```
   09:00:05-09:01:05
   09:00:15-09:01:15
   09:00:25-09:01:25 ...
 
```
  For a streaming query, you may use the function current_timestamp to generate windows on processing time.
  
  Parameters:
  
  timeColumn - The column or the expression to use as the timestamp for windowing by time. The time column must be of TimestampType or TimestampNTZType.
  
  windowDuration - A string specifying the width of the window, e.g. 10 minutes, 1 second. Check org.apache.spark.unsafe.types.CalendarInterval for valid duration identifiers. Note that the duration is a fixed length of time, and does not vary over time according to a calendar. For example, 1 day always means 86,400,000 milliseconds, not a calendar day.
  
  slideDuration - A string specifying the sliding interval of the window, e.g. 1 minute. A new window will be generated every slideDuration. Must be less than or equal to the windowDuration. Check org.apache.spark.unsafe.types.CalendarInterval for valid duration identifiers. This duration is likewise absolute, and does not vary according to a calendar.
  
  startTime - The offset with respect to 1970-01-01 00:00:00 UTC with which to start window intervals. For example, in order to have hourly tumbling windows that start 15 minutes past the hour, e.g. 12:15-13:15, 13:15-14:15... provide startTime as 15 minutes.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- window
  Bucketize rows into one or more time windows given a timestamp specifying column. Window starts are inclusive but the window ends are exclusive, e.g. 12:05 will be in the window [12:05,12:10) but not in [12:00,12:05). Windows can support microsecond precision. Windows in the order of months are not supported. The windows start beginning at 1970-01-01 00:00:00 UTC. The following example takes the average stock price for a one minute window every 10 seconds:
```
   val df = ... // schema => timestamp: TimestampType, stockId: StringType, price: DoubleType
   df.groupBy(window($"timestamp", "1 minute", "10 seconds"), $"stockId")
     .agg(mean("price"))
 
```
  The windows will look like:
```
   09:00:00-09:01:00
   09:00:10-09:01:10
   09:00:20-09:01:20 ...
 
```
  For a streaming query, you may use the function current_timestamp to generate windows on processing time.
  
  Parameters:
  
  timeColumn - The column or the expression to use as the timestamp for windowing by time. The time column must be of TimestampType or TimestampNTZType.
  
  windowDuration - A string specifying the width of the window, e.g. 10 minutes, 1 second. Check org.apache.spark.unsafe.types.CalendarInterval for valid duration identifiers. Note that the duration is a fixed length of time, and does not vary over time according to a calendar. For example, 1 day always means 86,400,000 milliseconds, not a calendar day.
  
  slideDuration - A string specifying the sliding interval of the window, e.g. 1 minute. A new window will be generated every slideDuration. Must be less than or equal to the windowDuration. Check org.apache.spark.unsafe.types.CalendarInterval for valid duration identifiers. This duration is likewise absolute, and does not vary according to a calendar.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- window
  Generates tumbling time windows given a timestamp specifying column. Window starts are inclusive but the window ends are exclusive, e.g. 12:05 will be in the window [12:05,12:10) but not in [12:00,12:05). Windows can support microsecond precision. Windows in the order of months are not supported. The windows start beginning at 1970-01-01 00:00:00 UTC. The following example takes the average stock price for a one minute tumbling window:
```
   val df = ... // schema => timestamp: TimestampType, stockId: StringType, price: DoubleType
   df.groupBy(window($"timestamp", "1 minute"), $"stockId")
     .agg(mean("price"))
 
```
  The windows will look like:
```
   09:00:00-09:01:00
   09:01:00-09:02:00
   09:02:00-09:03:00 ...
 
```
  For a streaming query, you may use the function current_timestamp to generate windows on processing time.
  
  Parameters:
  
  timeColumn - The column or the expression to use as the timestamp for windowing by time. The time column must be of TimestampType or TimestampNTZType.
  
  windowDuration - A string specifying the width of the window, e.g. 10 minutes, 1 second. Check org.apache.spark.unsafe.types.CalendarInterval for valid duration identifiers.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- window_time
  Extracts the event time from the window column.
  
  The window column is of StructType { start: Timestamp, end: Timestamp } where start is inclusive and end is exclusive. Since event time can support microsecond precision, window_time(window) = window.end - 1 microsecond.
  
  Parameters:
  
  windowColumn - The window column (typically produced by window aggregation) of type StructType { start: Timestamp, end: Timestamp }
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- session_window
  Generates session window given a timestamp specifying column.
  
  Session window is one of dynamic windows, which means the length of window is varying according to the given inputs. The length of session window is defined as "the timestamp of latest input of the session + gap duration", so when the new inputs are bound to the current session window, the end time of session window can be expanded according to the new inputs.
  
  Windows can support microsecond precision. gapDuration in the order of months are not supported.
  
  For a streaming query, you may use the function current_timestamp to generate windows on processing time.
  
  Parameters:
  
  timeColumn - The column or the expression to use as the timestamp for windowing by time. The time column must be of TimestampType or TimestampNTZType.
  
  gapDuration - A string specifying the timeout of the session, e.g. 10 minutes, 1 second. Check org.apache.spark.unsafe.types.CalendarInterval for valid duration identifiers.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- session_window
  Generates session window given a timestamp specifying column.
  
  Session window is one of dynamic windows, which means the length of window is varying according to the given inputs. For static gap duration, the length of session window is defined as "the timestamp of latest input of the session + gap duration", so when the new inputs are bound to the current session window, the end time of session window can be expanded according to the new inputs.
  
  Besides a static gap duration value, users can also provide an expression to specify gap duration dynamically based on the input row. With dynamic gap duration, the closing of a session window does not depend on the latest input anymore. A session window's range is the union of all events' ranges which are determined by event start time and evaluated gap duration during the query execution. Note that the rows with negative or zero gap duration will be filtered out from the aggregation.
  
  Windows can support microsecond precision. gapDuration in the order of months are not supported.
  
  For a streaming query, you may use the function current_timestamp to generate windows on processing time.
  
  Parameters:
  
  timeColumn - The column or the expression to use as the timestamp for windowing by time. The time column must be of TimestampType or TimestampNTZType.
  
  gapDuration - A column specifying the timeout of the session. It could be static value, e.g. 10 minutes, 1 second, or an expression/UDF that specifies gap duration dynamically based on the input row.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- timestamp_seconds
  Converts the number of seconds from the Unix epoch (1970-01-01T00:00:00Z) to a timestamp.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.1.0
- timestamp_millis
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- timestamp_micros
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- timestamp_diff
  
  Parameters:
  
  unit - (undocumented)
  
  start - (undocumented)
  
  end - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- timestamp_add
  
  Parameters:
  
  unit - (undocumented)
  
  quantity - (undocumented)
  
  ts - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- to_timestamp_ltz
  Parses the
  timestamp
  expression with the
  format
  expression to a timestamp without time zone. Returns null with invalid input.
  
  Parameters:
  
  timestamp - (undocumented)
  
  format - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_timestamp_ltz
  Parses the
  timestamp
  expression with the default format to a timestamp without time zone. The default format follows casting rules to a timestamp. Returns null with invalid input.
  
  Parameters:
  
  timestamp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_timestamp_ntz
  Parses the
  timestamp_str
  expression with the
  format
  expression to a timestamp without time zone. Returns null with invalid input.
  
  Parameters:
  
  timestamp - (undocumented)
  
  format - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_timestamp_ntz
  Parses the
  timestamp
  expression with the default format to a timestamp without time zone. The default format follows casting rules to a timestamp. Returns null with invalid input.
  
  Parameters:
  
  timestamp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_unix_timestamp
  
  Parameters:
  
  timeExp - (undocumented)
  
  format - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_unix_timestamp
  
  Parameters:
  
  timeExp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- monthname
  
  Parameters:
  
  timeExp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- dayname
  
  Parameters:
  
  timeExp - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- array_contains
  Returns null if the array is null, true if the array contains value, and false otherwise.
  
  Parameters:
  
  column - (undocumented)
  
  value - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- array_append
  
  Parameters:
  
  column - (undocumented)
  
  element - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- arrays_overlap
  Returns true if a1 and a2 have at least one non-null element in common. If not and both the arrays are non-empty and any of them contains a null, it returns null. It returns false otherwise.
  
  Parameters:
  
  a1 - (undocumented)
  
  a2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- slice
  Returns an array containing all the elements in
  x
  from index
  start
  (or starting from the end if
  start
  is negative) with the specified
  length
  .
  
  Parameters:
  
  x - the array column to be sliced
  
  start - the starting index
  
  length - the length of the slice
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- slice
  Returns an array containing all the elements in
  x
  from index
  start
  (or starting from the end if
  start
  is negative) with the specified
  length
  .
  
  Parameters:
  
  x - the array column to be sliced
  
  start - the starting index
  
  length - the length of the slice
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.1.0
- array_join
  Concatenates the elements of column using the delimiter. Null values are replaced with nullReplacement.
  
  Parameters:
  
  column - (undocumented)
  
  delimiter - (undocumented)
  
  nullReplacement - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_join
  Concatenates the elements of column using the delimiter.
  
  Parameters:
  
  column - (undocumented)
  
  delimiter - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- concat public static Column concat(scala.collection.immutable.Seq<Column> exprs)
  
  Parameters:
  
  exprs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
  
  Note:
  
  Returns null if any of the input columns are null.
- array_position
  
  Parameters:
  
  column - (undocumented)
  
  value - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
  
  Note:
  
  The position is not zero based, but 1 based index. Returns 0 if value could not be found in array.
- element_at
  
  Parameters:
  
  column - (undocumented)
  
  value - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- try_element_at
  (array, index) - Returns element of array at given (1-based) index. If Index is 0, Spark will throw an error. If index < 0, accesses elements from the last to the first. The function always returns NULL if the index exceeds the length of the array.
  
  (map, key) - Returns value for given key. The function always returns NULL if the key is not contained in the map.
  
  Parameters:
  
  column - (undocumented)
  
  value - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- get
  
  Parameters:
  
  column - (undocumented)
  
  index - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- array_sort
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_sort
  
  Parameters:
  
  e - (undocumented)
  
  comparator - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- array_remove
  
  Parameters:
  
  column - (undocumented)
  
  element - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_compact
  
  Parameters:
  
  column - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- array_prepend
  
  Parameters:
  
  column - (undocumented)
  
  element - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- array_distinct
  Removes duplicate values from the array.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_intersect
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_insert
  
  Parameters:
  
  arr - (undocumented)
  
  pos - (undocumented)
  
  value - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- array_union
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_except
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- transform
  Returns an array of elements after applying a transformation to each element in the input array.
```
   df.select(transform(col("i"), x => x + 1))
 
```
  Parameters:
  
  column - the input array column
  
  f - col => transformed_col, the lambda function to transform the input column
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- transform
  Returns an array of elements after applying a transformation to each element in the input array.
```
   df.select(transform(col("i"), (x, i) => x + i))
 
```
  Parameters:
  
  column - the input array column
  
  f - (col, index) => transformed_col, the lambda function to transform the input column given the index. Indices start at 0.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- exists
  Returns whether a predicate holds for one or more elements in the array.
```
   df.select(exists(col("i"), _ % 2 === 0))
 
```
  Parameters:
  
  column - the input array column
  
  f - col => predicate, the Boolean predicate to check the input column
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- forall
  Returns whether a predicate holds for every element in the array.
```
   df.select(forall(col("i"), x => x % 2 === 0))
 
```
  Parameters:
  
  column - the input array column
  
  f - col => predicate, the Boolean predicate to check the input column
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- filter
  Returns an array of elements for which a predicate holds in a given array.
```
   df.select(filter(col("s"), x => x % 2 === 0))
 
```
  Parameters:
  
  column - the input array column
  
  f - col => predicate, the Boolean predicate to filter the input column
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- filter
  Returns an array of elements for which a predicate holds in a given array.
```
   df.select(filter(col("s"), (x, i) => i % 2 === 0))
 
```
  Parameters:
  
  column - the input array column
  
  f - (col, index) => predicate, the Boolean predicate to filter the input column given the index. Indices start at 0.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- aggregate
  Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state. The final state is converted into the final result by applying a finish function.
```
   df.select(aggregate(col("i"), lit(0), (acc, x) => acc + x, _ * 10))
 
```
  Parameters:
  
  expr - the input array column
  
  initialValue - the initial value
  
  merge - (combined_value, input_value) => combined_value, the merge function to merge an input value to the combined_value
  
  finish - combined_value => final_value, the lambda function to convert the combined value of all inputs to final result
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- aggregate
  Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state.
```
   df.select(aggregate(col("i"), lit(0), (acc, x) => acc + x))
 
```
  Parameters:
  
  expr - the input array column
  
  initialValue - the initial value
  
  merge - (combined_value, input_value) => combined_value, the merge function to merge an input value to the combined_value
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- reduce
  Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state. The final state is converted into the final result by applying a finish function.
```
   df.select(aggregate(col("i"), lit(0), (acc, x) => acc + x, _ * 10))
 
```
  Parameters:
  
  expr - the input array column
  
  initialValue - the initial value
  
  merge - (combined_value, input_value) => combined_value, the merge function to merge an input value to the combined_value
  
  finish - combined_value => final_value, the lambda function to convert the combined value of all inputs to final result
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- reduce
  Applies a binary operator to an initial state and all elements in the array, and reduces this to a single state.
```
   df.select(aggregate(col("i"), lit(0), (acc, x) => acc + x))
 
```
  Parameters:
  
  expr - the input array column
  
  initialValue - the initial value
  
  merge - (combined_value, input_value) => combined_value, the merge function to merge an input value to the combined_value
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- zip_with
  Merge two given arrays, element-wise, into a single array using a function. If one array is shorter, nulls are appended at the end to match the length of the longer array, before applying the function.
```
   df.select(zip_with(df1("val1"), df1("val2"), (x, y) => x + y))
 
```
  Parameters:
  
  left - the left input array column
  
  right - the right input array column
  
  f - (lCol, rCol) => col, the lambda function to merge two input columns into one column
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- transform_keys
  Applies a function to every key-value pair in a map and returns a map with the results of those applications as the new keys for the pairs.
```
   df.select(transform_keys(col("i"), (k, v) => k + v))
 
```
  Parameters:
  
  expr - the input map column
  
  f - (key, value) => new_key, the lambda function to transform the key of input map column
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- transform_values
  Applies a function to every key-value pair in a map and returns a map with the results of those applications as the new values for the pairs.
```
   df.select(transform_values(col("i"), (k, v) => k + v))
 
```
  Parameters:
  
  expr - the input map column
  
  f - (key, value) => new_value, the lambda function to transform the value of input map column
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- map_filter
  Returns a map whose key-value pairs satisfy a predicate.
```
   df.select(map_filter(col("m"), (k, v) => k * 10 === v))
 
```
  Parameters:
  
  expr - the input map column
  
  f - (key, value) => predicate, the Boolean predicate to filter the input map column
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- map_zip_with
  Merge two given maps, key-wise into a single map using a function.
```
   df.select(map_zip_with(df("m1"), df("m2"), (k, v1, v2) => k === v1 + v2))
 
```
  Parameters:
  
  left - the left input map column
  
  right - the right input map column
  
  f - (key, value1, value2) => new_value, the lambda function to merge the map values
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- explode
  Creates a new row for each element in the given array or map column. Uses the default column name
  col
  for elements in the array and
  key
  and
  value
  for elements in the map unless specified otherwise.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- explode_outer
  Creates a new row for each element in the given array or map column. Uses the default column name
  col
  for elements in the array and
  key
  and
  value
  for elements in the map unless specified otherwise. Unlike explode, if the array/map is null or empty then null is produced.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.2.0
- posexplode
  Creates a new row for each element with position in the given array or map column. Uses the default column name
  pos
  for position, and
  col
  for elements in the array and
  key
  and
  value
  for elements in the map unless specified otherwise.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- posexplode_outer
  Creates a new row for each element with position in the given array or map column. Uses the default column name
  pos
  for position, and
  col
  for elements in the array and
  key
  and
  value
  for elements in the map unless specified otherwise. Unlike posexplode, if the array/map is null or empty then the row (null, null) is produced.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.2.0
- inline
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- inline_outer
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0
- get_json_object
  
  Parameters:
  
  e - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- json_tuple public static Column json_tuple(Column json, scala.collection.immutable.Seq<String> fields)
  
  Parameters:
  
  json - (undocumented)
  
  fields - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.6.0
- from_json
  (Scala-specific) Parses a column containing a JSON string into a
  StructType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema to use when parsing the json string
  
  options - options to control how the json is parsed. Accepts the same options as the json data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- from_json
  (Scala-specific) Parses a column containing a JSON string into a
  MapType
  with
  StringType
  as keys type,
  StructType
  or
  ArrayType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema to use when parsing the json string
  
  options - options to control how the json is parsed. accepts the same options and the json data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.2.0
- from_json
  (Java-specific) Parses a column containing a JSON string into a
  StructType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema to use when parsing the json string
  
  options - options to control how the json is parsed. accepts the same options and the json data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- from_json
  (Java-specific) Parses a column containing a JSON string into a
  MapType
  with
  StringType
  as keys type,
  StructType
  or
  ArrayType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema to use when parsing the json string
  
  options - options to control how the json is parsed. accepts the same options and the json data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.2.0
- from_json
  Parses a column containing a JSON string into a
  StructType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema to use when parsing the json string
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- from_json
  Parses a column containing a JSON string into a
  MapType
  with
  StringType
  as keys type,
  StructType
  or
  ArrayType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema to use when parsing the json string
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.2.0
- from_json
  (Java-specific) Parses a column containing a JSON string into a
  MapType
  with
  StringType
  as keys type,
  StructType
  or
  ArrayType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema as a DDL-formatted string.
  
  options - options to control how the json is parsed. accepts the same options and the json data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- from_json
  (Scala-specific) Parses a column containing a JSON string into a
  MapType
  with
  StringType
  as keys type,
  StructType
  or
  ArrayType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema as a DDL-formatted string.
  
  options - options to control how the json is parsed. accepts the same options and the json data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- from_json
  (Scala-specific) Parses a column containing a JSON string into a
  MapType
  with
  StringType
  as keys type,
  StructType
  or
  ArrayType
  of
  StructType
  s with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema to use when parsing the json string
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- from_json
  (Java-specific) Parses a column containing a JSON string into a
  MapType
  with
  StringType
  as keys type,
  StructType
  or
  ArrayType
  of
  StructType
  s with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing JSON data.
  
  schema - the schema to use when parsing the json string
  
  options - options to control how the json is parsed. accepts the same options and the json data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- try_parse_json
  
  Parameters:
  
  json - a string column that contains JSON data.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- parse_json
  
  Parameters:
  
  json - a string column that contains JSON data.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- to_variant_object
  
  Parameters:
  
  col - a column with a nested schema or column name.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- is_variant_null
  
  Parameters:
  
  v - a variant column.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- variant_get
  Extracts a sub-variant from
  v
  according to
  path
  string, and then cast the sub-variant to
  targetType
  . Returns null if the path does not exist. Throws an exception if the cast fails.
  
  Parameters:
  
  v - a variant column.
  
  path - the extraction path. A valid path should start with $ and is followed by zero or more segments like [123], .name, ['name'], or ["name"].
  
  targetType - the target data type to cast into, in a DDL-formatted string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- variant_get
  Extracts a sub-variant from
  v
  according to
  path
  column, and then cast the sub-variant to
  targetType
  . Returns null if the path does not exist. Throws an exception if the cast fails.
  
  Parameters:
  
  v - a variant column.
  
  path - the column containing the extraction path strings. A valid path string should start with $ and is followed by zero or more segments like [123], .name, ['name'], or ["name"].
  
  targetType - the target data type to cast into, in a DDL-formatted string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- try_variant_get
  Extracts a sub-variant from
  v
  according to
  path
  string, and then cast the sub-variant to
  targetType
  . Returns null if the path does not exist or the cast fails..
  
  Parameters:
  
  v - a variant column.
  
  path - the extraction path. A valid path should start with $ and is followed by zero or more segments like [123], .name, ['name'], or ["name"].
  
  targetType - the target data type to cast into, in a DDL-formatted string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- try_variant_get
  Extracts a sub-variant from
  v
  according to
  path
  column, and then cast the sub-variant to
  targetType
  . Returns null if the path does not exist or the cast fails..
  
  Parameters:
  
  v - a variant column.
  
  path - the column containing the extraction path strings. A valid path string should start with $ and is followed by zero or more segments like [123], .name, ['name'], or ["name"].
  
  targetType - the target data type to cast into, in a DDL-formatted string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- schema_of_variant
  
  Parameters:
  
  v - a variant column.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- schema_of_variant_agg
  
  Parameters:
  
  v - a variant column.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- schema_of_json
  
  Parameters:
  
  json - a JSON string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- schema_of_json
  
  Parameters:
  
  json - a foldable string column containing a JSON string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- schema_of_json
  
  Parameters:
  
  json - a foldable string column containing JSON data.
  
  options - options to control how the json is parsed. accepts the same options and the json data source. See Data Source Option in the version you use.
  
  Returns:
  
  a column with string literal containing schema in DDL format.
  
  Since:
  
  3.0.0
- json_array_length
  Returns the number of elements in the outermost JSON array.
  NULL
  is returned in case of any other valid JSON string,
  NULL
  or an invalid JSON.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- json_object_keys
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- to_json
  (Scala-specific) Converts a column containing a
  StructType
  ,
  ArrayType
  or a
  MapType
  into a JSON string with the specified schema. Throws an exception, in the case of an unsupported type.
  
  Parameters:
  
  e - a column containing a struct, an array or a map.
  
  options - options to control how the struct column is converted into a json string. accepts the same options and the json data source. See Data Source Option in the version you use. Additionally the function supports the pretty option which enables pretty JSON generation.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- to_json
  (Java-specific) Converts a column containing a
  StructType
  ,
  ArrayType
  or a
  MapType
  into a JSON string with the specified schema. Throws an exception, in the case of an unsupported type.
  
  Parameters:
  
  e - a column containing a struct, an array or a map.
  
  options - options to control how the struct column is converted into a json string. accepts the same options and the json data source. See Data Source Option in the version you use. Additionally the function supports the pretty option which enables pretty JSON generation.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- to_json
  Converts a column containing a
  StructType
  ,
  ArrayType
  or a
  MapType
  into a JSON string with the specified schema. Throws an exception, in the case of an unsupported type.
  
  Parameters:
  
  e - a column containing a struct, an array or a map.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.1.0
- mask
  
  Parameters:
  
  input - string value to mask. Supported types: STRING, VARCHAR, CHAR
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- mask
  
  Parameters:
  
  input - string value to mask. Supported types: STRING, VARCHAR, CHAR
  
  upperChar - character to replace upper-case characters with. Specify NULL to retain original character.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- mask
  
  Parameters:
  
  input - string value to mask. Supported types: STRING, VARCHAR, CHAR
  
  upperChar - character to replace upper-case characters with. Specify NULL to retain original character.
  
  lowerChar - character to replace lower-case characters with. Specify NULL to retain original character.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- mask
  
  Parameters:
  
  input - string value to mask. Supported types: STRING, VARCHAR, CHAR
  
  upperChar - character to replace upper-case characters with. Specify NULL to retain original character.
  
  lowerChar - character to replace lower-case characters with. Specify NULL to retain original character.
  
  digitChar - character to replace digit characters with. Specify NULL to retain original character.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- mask
  
  Parameters:
  
  input - string value to mask. Supported types: STRING, VARCHAR, CHAR
  
  upperChar - character to replace upper-case characters with. Specify NULL to retain original character.
  
  lowerChar - character to replace lower-case characters with. Specify NULL to retain original character.
  
  digitChar - character to replace digit characters with. Specify NULL to retain original character.
  
  otherChar - character to replace all other characters with. Specify NULL to retain original character.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- size
  Returns length of array or map.
  
  This function returns -1 for null input only if spark.sql.ansi.enabled is false and spark.sql.legacy.sizeOfNull is true. Otherwise, it returns null for null input. With the default settings, the function returns null for null input.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- cardinality
  Returns length of array or map. This is an alias of
  size
  function.
  
  This function returns -1 for null input only if spark.sql.ansi.enabled is false and spark.sql.legacy.sizeOfNull is true. Otherwise, it returns null for null input. With the default settings, the function returns null for null input.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- sort_array
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- sort_array
  
  Parameters:
  
  e - (undocumented)
  
  asc - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- array_min
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_max
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_size
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- array_agg
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
  
  Note:
  
  The function is non-deterministic because the order of collected results depends on the order of the rows which may be non-deterministic after a shuffle.
- shuffle
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
  
  Note:
  
  The function is non-deterministic.
- shuffle
  
  Parameters:
  
  e - (undocumented)
  
  seed - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
  
  Note:
  
  The function is non-deterministic.
- reverse
  Returns a reversed string or an array with reverse order of elements.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- flatten
  Creates a single array from an array of arrays. If a structure of nested arrays is deeper than two levels, only one level of nesting is removed.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- sequence
  
  Parameters:
  
  start - (undocumented)
  
  stop - (undocumented)
  
  step - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- sequence
  
  Parameters:
  
  start - (undocumented)
  
  stop - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_repeat
  
  Parameters:
  
  left - (undocumented)
  
  right - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- array_repeat
  
  Parameters:
  
  e - (undocumented)
  
  count - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- map_contains_key
  Returns true if the map contains the key.
  
  Parameters:
  
  column - (undocumented)
  
  key - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.3.0
- map_keys
  Returns an unordered array containing the keys of the map.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- map_values
  Returns an unordered array containing the values of the map.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- map_entries
  Returns an unordered array of all entries in the given map.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- map_from_entries
  Returns a map created from the given array of entries.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- arrays_zip public static Column arrays_zip(scala.collection.immutable.Seq<Column> e)
  Returns a merged array of structs in which the N-th struct contains all N-th values of input arrays.
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- map_concat public static Column map_concat(scala.collection.immutable.Seq<Column> cols)
  Returns the union of all the given maps.
  
  Parameters:
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.4.0
- from_csv
  Parses a column containing a CSV string into a
  StructType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing CSV data.
  
  schema - the schema to use when parsing the CSV string
  
  options - options to control how the CSV is parsed. accepts the same options and the CSV data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- from_csv
  (Java-specific) Parses a column containing a CSV string into a
  StructType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing CSV data.
  
  schema - the schema to use when parsing the CSV string
  
  options - options to control how the CSV is parsed. accepts the same options and the CSV data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- schema_of_csv
  
  Parameters:
  
  csv - a CSV string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- schema_of_csv
  
  Parameters:
  
  csv - a foldable string column containing a CSV string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- schema_of_csv
  
  Parameters:
  
  csv - a foldable string column containing a CSV string.
  
  options - options to control how the CSV is parsed. accepts the same options and the CSV data source. See Data Source Option in the version you use.
  
  Returns:
  
  a column with string literal containing schema in DDL format.
  
  Since:
  
  3.0.0
- to_csv
  (Java-specific) Converts a column containing a
  StructType
  into a CSV string with the specified schema. Throws an exception, in the case of an unsupported type.
  
  Parameters:
  
  e - a column containing a struct.
  
  options - options to control how the struct column is converted into a CSV string. It accepts the same options and the CSV data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- to_csv
  Converts a column containing a
  StructType
  into a CSV string with the specified schema. Throws an exception, in the case of an unsupported type.
  
  Parameters:
  
  e - a column containing a struct.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- from_xml
  Parses a column containing a XML string into the data type corresponding to the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing XML data.
  
  schema - the schema to use when parsing the XML string
  
  options - options to control how the XML is parsed. accepts the same options and the XML data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- from_xml
  (Java-specific) Parses a column containing a XML string into a
  StructType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing XML data.
  
  schema - the schema as a DDL-formatted string.
  
  options - options to control how the XML is parsed. accepts the same options and the xml data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- from_xml
  (Java-specific) Parses a column containing a XML string into a
  StructType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing XML data.
  
  schema - the schema to use when parsing the XML string
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- from_xml
  (Java-specific) Parses a column containing a XML string into a
  StructType
  with the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing XML data.
  
  schema - the schema to use when parsing the XML string
  
  options - options to control how the XML is parsed. accepts the same options and the XML data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- from_xml
  Parses a column containing a XML string into the data type corresponding to the specified schema. Returns
  null
  , in the case of an unparseable string.
  
  Parameters:
  
  e - a string column containing XML data.
  
  schema - the schema to use when parsing the XML string
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- schema_of_xml
  
  Parameters:
  
  xml - a XML string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- schema_of_xml
  
  Parameters:
  
  xml - a foldable string column containing a XML string.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- schema_of_xml
  
  Parameters:
  
  xml - a foldable string column containing XML data.
  
  options - options to control how the xml is parsed. accepts the same options and the XML data source. See Data Source Option in the version you use.
  
  Returns:
  
  a column with string literal containing schema in DDL format.
  
  Since:
  
  4.0.0
- to_xml
  (Java-specific) Converts a column containing a
  StructType
  into a XML string with the specified schema. Throws an exception, in the case of an unsupported type.
  
  Parameters:
  
  e - a column containing a struct.
  
  options - options to control how the struct column is converted into a XML string. It accepts the same options as the XML data source. See Data Source Option in the version you use.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- to_xml
  Converts a column containing a
  StructType
  into a XML string with the specified schema. Throws an exception, in the case of an unsupported type.
  
  Parameters:
  
  e - a column containing a struct.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- years
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- months
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- days
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- xpath
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- xpath_boolean
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- xpath_double
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- xpath_number
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- xpath_float
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- xpath_int
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- xpath_long
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- xpath_short
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- xpath_string
  
  Parameters:
  
  xml - (undocumented)
  
  path - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- hours
  
  Parameters:
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- convert_timezone
  Converts the timestamp without time zone
  sourceTs
  from the
  sourceTz
  time zone to
  targetTz
  .
  
  Parameters:
  
  sourceTz - the time zone for the input timestamp. If it is missed, the current session time zone is used as the source time zone.
  
  targetTz - the time zone to which the input timestamp should be converted.
  
  sourceTs - a timestamp without time zone.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- convert_timezone
  Converts the timestamp without time zone
  sourceTs
  from the current time zone to
  targetTz
  .
  
  Parameters:
  
  targetTz - the time zone to which the input timestamp should be converted.
  
  sourceTs - a timestamp without time zone.
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_dt_interval
  
  Parameters:
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_dt_interval
  
  Parameters:
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_dt_interval
  
  Parameters:
  
  days - (undocumented)
  
  hours - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_dt_interval
  
  Parameters:
  
  days - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_dt_interval public static Column make_dt_interval
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_interval
  This is a special version of
  make_interval
  that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_interval
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_interval
  This is a special version of
  make_interval
  that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_interval
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_interval
  This is a special version of
  make_interval
  that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_interval
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_interval
  This is a special version of
  make_interval
  that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  days - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_interval
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  days - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_interval
  This is a special version of
  make_interval
  that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_interval
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  weeks - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_interval
  This is a special version of
  make_interval
  that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_interval
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_interval
  This is a special version of
  make_interval
  that performs the same operation, but returns a NULL value instead of raising an error if interval cannot be created.
  
  Parameters:
  
  years - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_interval
  
  Parameters:
  
  years - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_interval public static Column make_interval
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_timestamp
  Create timestamp from years, months, days, hours, mins, secs and timezone fields. The result data type is consistent with the value of configuration
  spark.sql.timestampType
  . If the configuration
  spark.sql.ansi.enabled
  is false, the function returns NULL on invalid inputs. Otherwise, it will throw an error instead.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  timezone - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_timestamp
  Create timestamp from years, months, days, hours, mins and secs fields. The result data type is consistent with the value of configuration
  spark.sql.timestampType
  . If the configuration
  spark.sql.ansi.enabled
  is false, the function returns NULL on invalid inputs. Otherwise, it will throw an error instead.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_timestamp
  Try to create a timestamp from years, months, days, hours, mins, secs and timezone fields. The result data type is consistent with the value of configuration
  spark.sql.timestampType
  . The function returns NULL on invalid inputs.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  timezone - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- try_make_timestamp
  Try to create a timestamp from years, months, days, hours, mins, and secs fields. The result data type is consistent with the value of configuration
  spark.sql.timestampType
  . The function returns NULL on invalid inputs.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_timestamp_ltz
  Create the current timestamp with local time zone from years, months, days, hours, mins, secs and timezone fields. If the configuration
  spark.sql.ansi.enabled
  is false, the function returns NULL on invalid inputs. Otherwise, it will throw an error instead.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  timezone - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_timestamp_ltz
  Create the current timestamp with local time zone from years, months, days, hours, mins and secs fields. If the configuration
  spark.sql.ansi.enabled
  is false, the function returns NULL on invalid inputs. Otherwise, it will throw an error instead.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_timestamp_ltz
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  timezone - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- try_make_timestamp_ltz
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_timestamp_ntz
  Create local date-time from years, months, days, hours, mins, secs fields. If the configuration
  spark.sql.ansi.enabled
  is false, the function returns NULL on invalid inputs. Otherwise, it will throw an error instead.
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- try_make_timestamp_ntz
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  days - (undocumented)
  
  hours - (undocumented)
  
  mins - (undocumented)
  
  secs - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- make_ym_interval
  
  Parameters:
  
  years - (undocumented)
  
  months - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_ym_interval
  
  Parameters:
  
  years - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- make_ym_interval public static Column make_ym_interval
  ()
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- bucket
  
  Parameters:
  
  numBuckets - (undocumented)
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- bucket
  
  Parameters:
  
  numBuckets - (undocumented)
  
  e - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.0.0
- ifnull
  Returns
  col2
  if
  col1
  is null, or
  col1
  otherwise.
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- isnotnull
  Returns true if
  col
  is not null, or false otherwise.
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- equal_null
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- nullif
  Returns null if
  col1
  equals to
  col2
  , or
  col1
  otherwise.
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- nullifzero
  Returns null if
  col
  is equal to zero, or
  col
  otherwise.
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- nvl
  Returns
  col2
  if
  col1
  is null, or
  col1
  otherwise.
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- nvl2
  Returns
  col2
  if
  col1
  is not null, or
  col3
  otherwise.
  
  Parameters:
  
  col1 - (undocumented)
  
  col2 - (undocumented)
  
  col3 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- zeroifnull
  Returns zero if
  col
  is null, or
  col
  otherwise.
  
  Parameters:
  
  col - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  4.0.0
- udaf
  Obtains a
  UserDefinedFunction
  that wraps the given
  Aggregator
  so that it may be used with untyped Data Frames.
```
   val agg = // Aggregator[IN, BUF, OUT]

   // declare a UDF based on agg
   val aggUDF = udaf(agg)
   val aggData = df.agg(aggUDF($"colname"))

   // register agg as a named function
   spark.udf.register("myAggName", udaf(agg))
 
```
  Parameters:
  
  agg - the typed Aggregator
  
  evidence$3 - (undocumented)
  
  Returns:
  
  a UserDefinedFunction that can be used as an aggregating expression.
  
  Note:
  
  The input encoder is inferred from the input type IN.
- udaf
  Obtains a
  UserDefinedFunction
  that wraps the given
  Aggregator
  so that it may be used with untyped Data Frames.
```
   Aggregator<IN, BUF, OUT> agg = // custom Aggregator
   Encoder<IN> enc = // input encoder

   // declare a UDF based on agg
   UserDefinedFunction aggUDF = udaf(agg, enc)
   DataFrame aggData = df.agg(aggUDF($"colname"))

   // register agg as a named function
   spark.udf.register("myAggName", udaf(agg, enc))
 
```
  Parameters:
  
  agg - the typed Aggregator
  
  inputEncoder - a specific input encoder to use
  
  Returns:
  
  a UserDefinedFunction that can be used as an aggregating expression
  
  Note:
  
  This overloading takes an explicit input encoder, to support UDAF declarations in Java.
- udf public static <RT> UserDefinedFunction udf(scala.Function0<RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$4)
  Defines a Scala closure of 0 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$4 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1> UserDefinedFunction udf(scala.Function1<A1,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$5, scala.reflect.api.TypeTags.TypeTag<A1> evidence$6)
  Defines a Scala closure of 1 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$5 - (undocumented)
  
  evidence$6 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2> UserDefinedFunction udf(scala.Function2<A1,A2,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$7, scala.reflect.api.TypeTags.TypeTag<A1> evidence$8, scala.reflect.api.TypeTags.TypeTag<A2> evidence$9)
  Defines a Scala closure of 2 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$7 - (undocumented)
  
  evidence$8 - (undocumented)
  
  evidence$9 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2, A3> UserDefinedFunction udf(scala.Function3<A1,A2,A3,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$10, scala.reflect.api.TypeTags.TypeTag<A1> evidence$11, scala.reflect.api.TypeTags.TypeTag<A2> evidence$12, scala.reflect.api.TypeTags.TypeTag<A3> evidence$13)
  Defines a Scala closure of 3 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$10 - (undocumented)
  
  evidence$11 - (undocumented)
  
  evidence$12 - (undocumented)
  
  evidence$13 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2, A3, A4> UserDefinedFunction udf(scala.Function4<A1,A2,A3,A4,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$14, scala.reflect.api.TypeTags.TypeTag<A1> evidence$15, scala.reflect.api.TypeTags.TypeTag<A2> evidence$16, scala.reflect.api.TypeTags.TypeTag<A3> evidence$17, scala.reflect.api.TypeTags.TypeTag<A4> evidence$18)
  Defines a Scala closure of 4 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$14 - (undocumented)
  
  evidence$15 - (undocumented)
  
  evidence$16 - (undocumented)
  
  evidence$17 - (undocumented)
  
  evidence$18 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2, A3, A4, A5> UserDefinedFunction udf(scala.Function5<A1,A2,A3,A4,A5,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$19, scala.reflect.api.TypeTags.TypeTag<A1> evidence$20, scala.reflect.api.TypeTags.TypeTag<A2> evidence$21, scala.reflect.api.TypeTags.TypeTag<A3> evidence$22, scala.reflect.api.TypeTags.TypeTag<A4> evidence$23, scala.reflect.api.TypeTags.TypeTag<A5> evidence$24)
  Defines a Scala closure of 5 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$19 - (undocumented)
  
  evidence$20 - (undocumented)
  
  evidence$21 - (undocumented)
  
  evidence$22 - (undocumented)
  
  evidence$23 - (undocumented)
  
  evidence$24 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2, A3, A4, A5, A6> UserDefinedFunction udf(scala.Function6<A1,A2,A3,A4,A5,A6,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$25, scala.reflect.api.TypeTags.TypeTag<A1> evidence$26, scala.reflect.api.TypeTags.TypeTag<A2> evidence$27, scala.reflect.api.TypeTags.TypeTag<A3> evidence$28, scala.reflect.api.TypeTags.TypeTag<A4> evidence$29, scala.reflect.api.TypeTags.TypeTag<A5> evidence$30, scala.reflect.api.TypeTags.TypeTag<A6> evidence$31)
  Defines a Scala closure of 6 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$25 - (undocumented)
  
  evidence$26 - (undocumented)
  
  evidence$27 - (undocumented)
  
  evidence$28 - (undocumented)
  
  evidence$29 - (undocumented)
  
  evidence$30 - (undocumented)
  
  evidence$31 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2, A3, A4, A5, A6, A7> UserDefinedFunction udf(scala.Function7<A1,A2,A3,A4,A5,A6,A7,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$32, scala.reflect.api.TypeTags.TypeTag<A1> evidence$33, scala.reflect.api.TypeTags.TypeTag<A2> evidence$34, scala.reflect.api.TypeTags.TypeTag<A3> evidence$35, scala.reflect.api.TypeTags.TypeTag<A4> evidence$36, scala.reflect.api.TypeTags.TypeTag<A5> evidence$37, scala.reflect.api.TypeTags.TypeTag<A6> evidence$38, scala.reflect.api.TypeTags.TypeTag<A7> evidence$39)
  Defines a Scala closure of 7 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$32 - (undocumented)
  
  evidence$33 - (undocumented)
  
  evidence$34 - (undocumented)
  
  evidence$35 - (undocumented)
  
  evidence$36 - (undocumented)
  
  evidence$37 - (undocumented)
  
  evidence$38 - (undocumented)
  
  evidence$39 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2, A3, A4, A5, A6, A7, A8> UserDefinedFunction udf(scala.Function8<A1,A2,A3,A4,A5,A6,A7,A8,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$40, scala.reflect.api.TypeTags.TypeTag<A1> evidence$41, scala.reflect.api.TypeTags.TypeTag<A2> evidence$42, scala.reflect.api.TypeTags.TypeTag<A3> evidence$43, scala.reflect.api.TypeTags.TypeTag<A4> evidence$44, scala.reflect.api.TypeTags.TypeTag<A5> evidence$45, scala.reflect.api.TypeTags.TypeTag<A6> evidence$46, scala.reflect.api.TypeTags.TypeTag<A7> evidence$47, scala.reflect.api.TypeTags.TypeTag<A8> evidence$48)
  Defines a Scala closure of 8 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$40 - (undocumented)
  
  evidence$41 - (undocumented)
  
  evidence$42 - (undocumented)
  
  evidence$43 - (undocumented)
  
  evidence$44 - (undocumented)
  
  evidence$45 - (undocumented)
  
  evidence$46 - (undocumented)
  
  evidence$47 - (undocumented)
  
  evidence$48 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2, A3, A4, A5, A6, A7, A8, A9> UserDefinedFunction udf(scala.Function9<A1,A2,A3,A4,A5,A6,A7,A8,A9,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$49, scala.reflect.api.TypeTags.TypeTag<A1> evidence$50, scala.reflect.api.TypeTags.TypeTag<A2> evidence$51, scala.reflect.api.TypeTags.TypeTag<A3> evidence$52, scala.reflect.api.TypeTags.TypeTag<A4> evidence$53, scala.reflect.api.TypeTags.TypeTag<A5> evidence$54, scala.reflect.api.TypeTags.TypeTag<A6> evidence$55, scala.reflect.api.TypeTags.TypeTag<A7> evidence$56, scala.reflect.api.TypeTags.TypeTag<A8> evidence$57, scala.reflect.api.TypeTags.TypeTag<A9> evidence$58)
  Defines a Scala closure of 9 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$49 - (undocumented)
  
  evidence$50 - (undocumented)
  
  evidence$51 - (undocumented)
  
  evidence$52 - (undocumented)
  
  evidence$53 - (undocumented)
  
  evidence$54 - (undocumented)
  
  evidence$55 - (undocumented)
  
  evidence$56 - (undocumented)
  
  evidence$57 - (undocumented)
  
  evidence$58 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf public static <RT, A1, A2, A3, A4, A5, A6, A7, A8, A9, A10> UserDefinedFunction udf(scala.Function10<A1,A2,A3,A4,A5,A6,A7,A8,A9,A10,RT> f, scala.reflect.api.TypeTags.TypeTag<RT> evidence$59, scala.reflect.api.TypeTags.TypeTag<A1> evidence$60, scala.reflect.api.TypeTags.TypeTag<A2> evidence$61, scala.reflect.api.TypeTags.TypeTag<A3> evidence$62, scala.reflect.api.TypeTags.TypeTag<A4> evidence$63, scala.reflect.api.TypeTags.TypeTag<A5> evidence$64, scala.reflect.api.TypeTags.TypeTag<A6> evidence$65, scala.reflect.api.TypeTags.TypeTag<A7> evidence$66, scala.reflect.api.TypeTags.TypeTag<A8> evidence$67, scala.reflect.api.TypeTags.TypeTag<A9> evidence$68, scala.reflect.api.TypeTags.TypeTag<A10> evidence$69)
  Defines a Scala closure of 10 arguments as user-defined function (UDF). The data types are automatically inferred based on the Scala closure's signature. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  evidence$59 - (undocumented)
  
  evidence$60 - (undocumented)
  
  evidence$61 - (undocumented)
  
  evidence$62 - (undocumented)
  
  evidence$63 - (undocumented)
  
  evidence$64 - (undocumented)
  
  evidence$65 - (undocumented)
  
  evidence$66 - (undocumented)
  
  evidence$67 - (undocumented)
  
  evidence$68 - (undocumented)
  
  evidence$69 - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.3.0
- udf
  Defines a Java UDF0 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a Java UDF1 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a Java UDF2 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a Java UDF3 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a Java UDF4 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a Java UDF5 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a Java UDF6 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a Java UDF7 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a Java UDF8 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf public static UserDefinedFunction udf(UDF9<?,?,?,?,?,?,?,?,?,?> f, DataType returnType)
  Defines a Java UDF9 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf public static UserDefinedFunction udf(UDF10<?,?,?,?,?,?,?,?,?,?,?> f, DataType returnType)
  Defines a Java UDF10 instance as user-defined function (UDF). The caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Parameters:
  
  f - (undocumented)
  
  returnType - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.3.0
- udf
  Defines a deterministic user-defined function (UDF) using a Scala closure. For this variant, the caller must specify the output data type, and there is no automatic input type coercion. By default the returned UDF is deterministic. To change it to nondeterministic, call the API
  UserDefinedFunction.asNondeterministic()
  .
  
  Note that, although the Scala closure can have primitive-type function argument, it doesn't work well with null values. Because the Scala closure is passed in as Any type, there is no type information for the function arguments. Without the type information, Spark may blindly pass null to the Scala closure with primitive-type argument, and the closure will see the default value of the Java type for the null argument, e.g. udf((x: Int) => x, IntegerType), the result is 0 for null input.
  
  Parameters:
  
  f - A closure in Scala
  
  dataType - The output data type of the UDF
  
  Returns:
  
  (undocumented)
  
  Since:
  
  2.0.0
- callUDF
  
  Parameters:
  
  udfName - (undocumented)
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  1.5.0
- call_udf public static Column call_udf(String udfName, scala.collection.immutable.Seq<Column> cols)
  Call an user-defined function. Example:
```
  import org.apache.spark.sql._

  val df = Seq(("id1", 1), ("id2", 4), ("id3", 5)).toDF("id", "value")
  val spark = df.sparkSession
  spark.udf.register("simpleUDF", (v: Int) => v * v)
  df.select($"id", call_udf("simpleUDF", $"value"))
 
```
  Parameters:
  
  udfName - (undocumented)
  
  cols - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.2.0
- call_function public static Column call_function(String funcName, scala.collection.immutable.Seq<Column> cols)
  
  Parameters:
  
  funcName - function name that follows the SQL identifier syntax (can be quoted, can be qualified)
  
  cols - the expression parameters of function
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.5.0
- unwrap_udt
  Unwrap UDT data type column into its underlying type.
  
  Parameters:
  
  column - (undocumented)
  
  Returns:
  
  (undocumented)
  
  Since:
  
  3.4.0

RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4