tf.raw_ops.QuantizedMatMul

Perform a quantized matrix multiplication of a by the matrix b.

The inputs must be two-dimensional matrices and the inner dimension of a (after being transposed if transpose_a is non-zero) must match the outer dimension of b (after being transposed if transposed_b is non-zero).

Args
a A Tensor. Must be one of the following types: qint8, quint8, qint32, qint16, quint16. Must be a two-dimensional tensor.
b A Tensor. Must be one of the following types: qint8, quint8, qint32, qint16, quint16. Must be a two-dimensional tensor.
min_a A Tensor of type float32. The float value that the lowest quantized a value represents.
max_a A Tensor of type float32. The float value that the highest quantized a value represents.
min_b A Tensor of type float32. The float value that the lowest quantized b value represents.
max_b A Tensor of type float32. The float value that the highest quantized b value represents.
Toutput An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16. Defaults to tf.qint32.
transpose_a An optional bool. Defaults to False. If true, a is transposed before multiplication.
transpose_b An optional bool. Defaults to False. If true, b is transposed before multiplication.
Tactivation An optional tf.DType from: tf.qint8, tf.quint8, tf.qint32, tf.qint16, tf.quint16. Defaults to tf.quint8. The type of output produced by activation function following this operation.
name A name for the operation (optional).
Returns
A tuple of Tensor objects (out, min_out, max_out).
out A Tensor of type Toutput.
min_out A Tensor of type float32.
max_out A Tensor of type float32.

© 2020 The TensorFlow Authors. All rights reserved.
Licensed under the Creative Commons Attribution License 3.0.
Code samples licensed under the Apache 2.0 License.
https://www.tensorflow.org/versions/r2.4/api_docs/python/tf/raw_ops/QuantizedMatMul