Quiz about DSBC - Attention Is All You Need

Question 1

For what RNN is used and achieve the best results?

Accepted Answer

Handwriting and speech recognition

Answer

Speech and image recognition

Answer

Handwriting and image recognition

Answer

Financial predictions

Question 2

What is the basic concept of Recurrent Neural Network?

Accepted Answer

Use previous inputs to find the next output according to the training set.

Answer

Use recurrent features from dataset to find the best answers.

Answer

Use a loop between inputs and outputs in order to achieve the better prediction.

Answer

Use loops between the most important features to predict next output.

Question 3

What architecture represents many-to-many RNNs ?

Accepted Answer

Option B is correct

Accepted Answer

Option C is correct

Answer

A.

Answer

D.

Question 4

The multi-head attention block is fed three matrices named the Values (V), the Keys (K) and the Query (Q). Which of the following statements is correct ?

Accepted Answer

V and K are outputted from the input embedding and Q from the output embedding.

Answer

Q and K are outputted from the input embedding and V from the output embedding.

Answer

V is outputted from the input embedding and K and Q from the output embedding.

Answer

Q is outputted from the input embedding and K and V from the output embedding.

Question 5

The formula summarizing the multi-head attention operations is:

Accepted Answer

$$Attention\left(Q,\ K,\ V\right)\ =\ soft\max\left(\frac{VK^T}{\sqrt[]{d_k}}\right)\ Q$$

Answer

$$Attention\left(Q,\ K,\ V\right)\ =\ soft\max\left(\frac{QK^T}{\sqrt[]{d_k}}\right)\ V$$

Answer

$$Attention\left(Q,\ K,\ V\right)\ =\ soft\max\left(\frac{QV^T}{\sqrt[]{d_k}}\right)\ K$$

Question 6

The self-attention mechanism is permutation invariant.

Accepted Answer

True

Answer

False

Question 7

Select the false statement

Accepted Answer

Using sinusoidal functions for positional embeddings allow large displacements of positional similarity/dissimilarity.

Answer

Every position should have the same identifier irrespectively of the sequence length.

Answer

Positional encodings have the same dimension as the embeddings.

Answer

There are many choices of positional encodings.

Question 8

We can give the self attention greater power of discrimination, by combining several self attention heads.

Accepted Answer

True

Answer

False

Question 9

The attention mechanism contains considerably more weights than a classic RNN.

Accepted Answer

True

Answer

False

Question 10

For a computer vision problem, select the correct statement(s):

Accepted Answer

Soft Attention is the global Attention where all image patches are given some weight.

Accepted Answer

In Hard Attention, only one image patch is considered at a time.

Answer

Hard Attention is the global Attention where all image patches are given some weight.

Answer

In Soft Attention, only one image patch is considered at a time.

DSBC - Attention Is All You Need

Create a free account and access millions of resources

Similar Resources on Wayground

Popular Resources on Wayground

Discover more resources for Mathematics