layer_multi_head_attention

object

int, dimensionality of the `query`, `key` and `value` tensors after the linear transformation.

head_size

num_heads

int, dimensionality of the output space, if `NULL` then the input dimension of `value` or `key` will be used, default `NULL`.

output_size

float, `rate` parameter for the dropout layer that is applied to attention after softmax, default `0`.

dropout

bool, whether to use a bias term after the linear output projection.

use_projection_bias

bool, if `TRUE`, return the attention coefficients as an additional output argument.

return_attn_coef

initializer, initializer for the kernel weights.

kernel_initializer

regularizer, regularizer for the kernel weights.

kernel_regularizer

constraint, constraint for the kernel weights.

kernel_constraint

initializer, initializer for the bias weights.

bias_initializer

regularizer, regularizer for the bias weights.

bias_regularizer

constraint, constraint for the bias weights.

bias_constraint

'TensorFlow SIG Addons' <https://www.tensorflow.org/addons> is a repository
of community contributions that conform to well-established API patterns,
but implement new functionality not available in core 'TensorFlow'.
'TensorFlow' natively supports a large number of operators, layers, metrics,
losses, optimizers, and more. However, in a fast moving field like Machine Learning,
there are many interesting new developments that cannot be integrated into
core 'TensorFlow' (because their broad applicability is not yet clear, or
it is mostly used by a smaller subset of the community).

layer_multi_head_attention: Keras-based multi head attention layer

Description

Usage

Arguments

Value

Details

Examples