ST311-st311代写-Assignment 4|学霸联盟

ST311-st311代写-Assignment 4

时间：2023-03-30

ST 311 Assignment 4 (theory part)
Due by 5pm, 4 April, 2023
Candidate number:
Instruction: Attempt all questions. The total marks is 50.
1. Use pseudo codes to summarize the minibatch stochastic gradient descent algorithms
of generative adversarial networks (GAN) and Wasserstein GAN. Discuss the main
difference between two algorithms. [24 marks]
2. Recall that the Bellman equation for the Markov reward process (MRP) yields
V (s) = E(Gt|St = s) = E[Rt+1 + γGt+1|St = s] = E[Rt+1 + γV (St+1)|St = s].
(a) Suppose X, Y, Z are random variables. Please show that
E[E[X|Y, Z]|Y = y] = E[X|Y = y]. (1)
[13 marks]
(b) With the help of (1), please prove that the Bellman equation holds for MRPs.
Hint: you may also need to use the Markov property of MRPs, i.e. ‘the future is
independent of the past given the present.’ [13 marks]

数据结构算法代写
nvivo代写
电力系统代写
报告代写
图像处理代写
mylab代写
判断代写
tcp代写
sas代写
report代写
IPYthon代写
Truffles代写
self attension算法代写
lingo代写
市场营销论文代写
凸优化代写
买卖行为代写
社会学代写
rs代写
政治代写
Data代写
paper代写
金融实证代写
拓扑学代写
Assembly language代写