tf.raw_ops.StringSplitV2

Split elements of source based on sep into a SparseTensor.

Let N be the size of source (typically N will be the batch size). Split each element of source based on sep and return a SparseTensor containing the split tokens. Empty tokens are ignored.

For example, N = 2, source[0] is 'hello world' and source[1] is 'a b c', then the output will be

st.indices = [0, 0;
              0, 1;
              1, 0;
              1, 1;
              1, 2]
st.shape = [2, 3]
st.values = ['hello', 'world', 'a', 'b', 'c']

If sep is given, consecutive delimiters are not grouped together and are deemed to delimit empty strings. For example, source of "1<>2<><>3" and sep of "<>" returns ["1", "2", "", "3"]. If sep is None or an empty string, consecutive whitespace are regarded as a single separator, and the result will contain no empty strings at the startor end if the string has leading or trailing whitespace.

Note that the above mentioned behavior matches python's str.split.

Args
input A Tensor of type string. 1-D string Tensor, the strings to split.
sep A Tensor of type string. 0-D string Tensor, the delimiter character.
maxsplit An optional int. Defaults to -1. An int. If maxsplit > 0, limit of the split of the result.
name A name for the operation (optional).
Returns
A tuple of Tensor objects (indices, values, shape).
indices A Tensor of type int64.
values A Tensor of type string.
shape A Tensor of type int64.

© 2020 The TensorFlow Authors. All rights reserved.
Licensed under the Creative Commons Attribution License 3.0.
Code samples licensed under the Apache 2.0 License.
https://www.tensorflow.org/versions/r2.4/api_docs/python/tf/raw_ops/StringSplitV2