Function Description
aes_decrypt(expr, key[, mode[, padding[, aad]]]) Returns a decrypted value of `expr` using AES in `mode` with `padding`. Key lengths of 16, 24 and 32 bits are supported. Supported combinations of (`mode`, `padding`) are ('ECB', 'PKCS'), ('GCM', 'NONE') and ('CBC', 'PKCS'). Optional additional authenticated data (AAD) is only supported for GCM. If provided for encryption, the identical AAD value must be provided for decryption. The default mode is GCM.
aes_encrypt(expr, key[, mode[, padding[, iv[, aad]]]]) Returns an encrypted value of `expr` using AES in given `mode` with the specified `padding`. Key lengths of 16, 24 and 32 bits are supported. Supported combinations of (`mode`, `padding`) are ('ECB', 'PKCS'), ('GCM', 'NONE') and ('CBC', 'PKCS'). Optional initialization vectors (IVs) are only supported for CBC and GCM modes. These must be 16 bytes for CBC and 12 bytes for GCM. If not provided, a random vector will be generated and prepended to the output. Optional additional authenticated data (AAD) is only supported for GCM. If provided for encryption, the identical AAD value must be provided for decryption. The default mode is GCM.
assert_true(expr [, message]) Throws an exception if `expr` is not true.
bitmap_bit_position(child) Returns the bit position for the given input child expression.
bitmap_bucket_number(child) Returns the bucket number for the given input child expression.
bitmap_count(child) Returns the number of set bits in the child bitmap.
current_catalog() Returns the current catalog.
current_database() Returns the current database.
current_schema() Returns the current database.
current_user() user name of current execution context.
hll_sketch_estimate(expr) Returns the estimated number of unique values given the binary representation of a Datasketches HllSketch.
hll_union(first, second, allowDifferentLgConfigK) Merges two binary representations of Datasketches HllSketch objects, using a Datasketches Union object. Set allowDifferentLgConfigK to true to allow unions of sketches with different lgConfigK values (defaults to false).
input_file_block_length() Returns the length of the block being read, or -1 if not available.
input_file_block_start() Returns the start offset of the block being read, or -1 if not available.
input_file_name() Returns the name of the file being read, or empty string if not available.
java_method(class, method[, arg1[, arg2 ..]]) Calls a method with reflection.
monotonically_increasing_id() Returns monotonically increasing 64-bit integers. The generated ID is guaranteed to be monotonically increasing and unique, but not consecutive. The current implementation puts the partition ID in the upper 31 bits, and the lower 33 bits represent the record number within each partition. The assumption is that the data frame has less than 1 billion partitions, and each partition has less than 8 billion records. The function is non-deterministic because its result depends on partition IDs.
reflect(class, method[, arg1[, arg2 ..]]) Calls a method with reflection.
session_user() user name of current execution context.
spark_partition_id() Returns the current partition id.
try_aes_decrypt(expr, key[, mode[, padding[, aad]]]) This is a special version of `aes_decrypt` that performs the same operation, but returns a NULL value instead of raising an error if the decryption cannot be performed.
try_reflect(class, method[, arg1[, arg2 ..]]) This is a special version of `reflect` that performs the same operation, but returns a NULL value instead of raising an error if the invoke method thrown exception.
typeof(expr) Return DDL-formatted type string for the data type of the input.
user() user name of current execution context.
uuid() Returns an universally unique identifier (UUID) string. The value is returned as a canonical UUID 36-character string.
version() Returns the Spark version. The string contains 2 fields, the first being a release version and the second being a git revision.