S04 - Speech Recognition (GSLC)

S04 - Speech Recognition (GSLC)

University

10 Qs

quiz-placeholder

Similar activities

Computer Languages

Computer Languages

KG - University

10 Qs

Machine Learning

Machine Learning

12th Grade - University

10 Qs

Machine Learning Quiz

Machine Learning Quiz

University

10 Qs

Cisco 3, Module 1

Cisco 3, Module 1

University

15 Qs

QIS CVML TEST1

QIS CVML TEST1

University

10 Qs

ROBLOX Stepford County Railway

ROBLOX Stepford County Railway

2nd Grade - Professional Development

9 Qs

AIML

AIML

University

15 Qs

Protocolo OSPF

Protocolo OSPF

University

12 Qs

S04 - Speech Recognition (GSLC)

S04 - Speech Recognition (GSLC)

Assessment

Quiz

Computers

University

Easy

Created by

Amalia Zahra

Used 6+ times

FREE Resource

10 questions

Show all answers

1.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What is the main purpose of wav2vec models?

Text-to-Speech (TTS) conversion

Self-supervised learning for speech recognition

Machine Translation

Image classification

2.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What is the primary difference between wav2vec and wav2vec 2.0?

wav2vec 2.0 uses CNNs, while wav2vec uses Transformers

wav2vec 2.0 removes dependency on phoneme-based features

wav2vec is multilingual, while wav2vec 2.0 is not

wav2vec 2.0 is only used for speaker recognition

3.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

Which component of wav2vec 2.0 is responsible for capturing long-range speech dependencies?

Feature Encoder

Transformer Context Network

Quantization Module

Spectrogram Processor

4.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What is the function of the Quantization Module in wav2vec 2.0?

Converts audio into text

Discretizes speech representations into codebook embeddings

Removes noise from speech signals

Translates speech into multiple languages

5.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What is the main advantage of XLSR over wav2vec 2.0?

It is trained on multiple languages

It does not use self-supervised learning

It does not require any training

It is only used for speaker verification

6.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

What does the “53” in XLSR-53 refer to?

The number of Transformer layers

The number of audio features extracted

The number of languages used for training

The number of training datasets

7.

MULTIPLE CHOICE QUESTION

20 sec • 10 pts

Which model is best suited for low-resource languages?

wav2vec

wav2vec 2.0

XLSR-53

X-LSR

Create a free account and access millions of resources

Create resources
Host any resource
Get auto-graded reports
or continue with
Microsoft
Apple
Others
By signing up, you agree to our Terms of Service & Privacy Policy
Already have an account?