We are given two data points with 2 different timestamps

Be Prepared For The Toughest Questions

Practice Problems

We are given two data points with 2 different timestamps.

At the timestamp t = 1, we have a data point (x1, x2, y) where (x1, x2) = (0.3, 0.6) and y = 0.2.

At the timestamp t = 2, we have a data point (x1, x2, y) where (x1, x2) = (0.1, 1.0) and y = 0.4.

Here, x1 and x2 are 2 input variables. y is the output variable.

(a) Consider the traditional LSTM model. Initially, we have the following internal weight vectors and bias variables as follows.

In the model, we have the following status variables. For each t = 1, 2, ….

1. forget gate variable ft

2. input gate variable it

3. input activation variable at

4. internal state variable st

5. output gate variable ot

6. final output variable yt

Suppose that y0 = 0 and s0 = 0.

Consider the input forward propagation step only.

(i) What are the values of the above status variables when t = 1 and when t = 2? Please show each answer up to 4 decimal places.

(ii) What are the errors of the final output variables when t = 1 and when t = 2? Please show each answer up to 4 decimal places.

(b) Consider the GRU model. Initially, we have the following internal weight vectors and bias variables as follows.

In the model, we have the following status variables. For each t = 1, 2, ….

1. reset gate variable rt

2. input activation variable at

3. update gate variable ut

4. final output variable yt

Suppose that y0 = 0.

Consider the input forward propagation step only.

(i) What are the values of the above status variables when t = 1 and when t = 2? Please show each answer up to 4 decimal places.

(ii) What are the errors of the final output variables when t = 1 and when t = 2? Please show each answer up to 4 decimal places.

(c) What is the major disadvantage of the traditional neural network model compared with the recurrent neural network model?

Hint

ManagementA GRU or a gated recurrent unit is part of a specific model of recurrent neural network which intends to use connections through a sequence of nodes to perform machine learning tasks associated with memory and clustering, like in speech recognition. These gated recurrent units helps to adjust neural network input weights to solve the vanishing gradient problem which is a common issue wit...

Select Deadline for Completion

4 Days

3 Days

2 Days

1 Day

1 to 15 Hours

Know the process

Students succeed in their courses by connecting and communicating with
an expert until they receive help on their questions

Unable to find what you’re looking for?

Consult our trusted tutors.

Ask a Question

Be Prepared For The Toughest Questions

Practice Problems

Related questions

Know the process

Submit Question

Tutor Is Assigned

Receive Help

Unable to find what you’re looking for?