TensorFlow

API

 tf.strings / unicode_decode_with_offsets


Determine the script codes of a given tensor of Unicode integer code points.

Used in the notebooks

Used in the tutorials

This operation converts Unicode code points to script codes corresponding to each code point. Script codes correspond to International Components for Unicode (ICU) UScriptCode values.

See ICU project docs for more details on script codes.

For an example, see the unicode strings guide on unicode scripts.

Returns -1 (USCRIPT_INVALID_CODE) for invalid codepoints. Output shape will match input shape.

Examples:

tf.strings.unicode_script([1, 31, 38])
<tf.Tensor: shape=(3,), dtype=int32, numpy=array([0, 0, 0], dtype=int32)>

input A Tensor of type int32. A Tensor of int32 Unicode code points.
name A name for the operation (optional).

A Tensor of type int32.

此页内容是否对您有帮助