Minimum questions
This section presents the questions you need to know the answer to in order to to follow the course.
1. Is it possible to multiply a vector by a vector?
Yes, there are several ways to do it - scalar, vector product.
No, you have to multiply a vector by a matrix or a number.
No, it works differently for vectors.
Yes, we multiply each component of a vector by each other. This is called scalar product.
2. Can the norm of a matrix be zero?
Yes
No
3. What is the derivative of the function ?
There is no way to take the derivative.
4. What is the antiderivative (indefinite integral) of the function ?
It is not possible to calculate the prime form.
5. What is the scalar product of vectors and ?
Unable to calculate
6. How do you calculate the determinant of a diagonal matrix?
Add up all the diagonal elements
Multiply all the diagonal elements
It is zero
The determinant of such a matrix is equal to the matrix itself.
Substantive course questions.
If you confidently know the answers to most of the questions below, the course will probably be too easy for you.
7. Is the function convex?
Yes
No
8. Is the set of symmetric positively defined square matrices convex?
Yes
No
9. What is the subgradient of the function at the point ?
The function is not differentiable at this point, so there is no subgradient.
Any number in the interval
Any number in the interval
None of the answers are correct.
What is a subgradient? (don't know)
Any number in the interval
10. You are training a neural network to classify images. The size of the training sample is 10000, the size of the batch is 100. How many epochs will you make if you make 1000 iterations of stochastic gradient descent?
1
10
100
1000
10000
Epoch? (don't know)
There is no correct answer choice
11. Logistic regression is a method of solving the problem of
Classification
Regression
clustering
12. Let the solution to the linear programming problem exist. Simplex method in the worst case:
Will not converge.
Converges polynomially
Converges exponentially
13. Is the problem of optimizing ResNet neural network weights convex?
Yes
No
There is not enough data in the problem
14. When optimizing with a stochastic gradient method it would be a good idea to :
Decrease the learning rate over time
Increase the learning rate over time
Do not change the learning rate
15. Is the statement, "Adding Tikhonov regularization to a convex function makes the function strongly convex" true?
Yes
No
16. Find the minimal Lipschitz constant of the function , where is a vector of dimension , is a real matrix of dimension , is a vector of dimension .
The function is not Lipschitzian.
17. Is it true that Newton's method will converge for a convex function if you run it from any point in space.
Yes
No
18. Let the computation of the loss function value of your neural network (forward pass) take time . Tell me approximately how long it takes to compute the gradients by weights (backward pass)
19. Is the statement true: Nesterov momentum and Polyak Momentum equally accelerate the gradient descent method for a convex function with a Lipschitzian gradient in terms of the nature of convergence (to within a constant multiplier)
Yes
No