iGET

Deep Learning - MCQ Practice Questions

Neural networks, CNN, RNN, activation functions & backpropagation.

10 questions | 100% Free

Q.1Medium

In a neural network, the vanishing gradient problem is MOST commonly associated with which activation function?

Q.2Medium

Which of the following best describes the role of the 'stride' parameter in a Convolutional Neural Network (CNN)?

Q.3Medium

In the context of Batch Normalization, what is normalized during the forward pass of training?

Q.4Medium

What is the primary purpose of the 'dropout' regularization technique in deep learning?

Q.5Medium

Which optimizer uses both the first moment (mean) and the second moment (uncentered variance) of gradients to adapt the learning rate for each parameter?

Q.6Medium

In an LSTM (Long Short-Term Memory) network, which gate is responsible for deciding what information to discard from the cell state?

Q.7Medium

The output size of a convolutional layer is given by , where is the input size, is padding, is filter size, and is stride. For an input of size , filter , padding , and stride , what is the output size?

Q.8Medium

Which of the following is the key architectural difference that distinguishes a Transformer model from a traditional RNN?

Q.9Medium

In transfer learning for deep neural networks, which approach is most commonly used when the new dataset is small but similar to the original dataset?

Q.10Medium

The cross-entropy loss for a multi-class classification problem with classes is defined as . If the true label is class 2 (one-hot: ) and the predicted probabilities are , what is the loss?