Policy Gradient with Kernel Quadrature