TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?

TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?