Keywords: Q-learning, Mach-Zehnder interferometer, Fidelity.

Questions

  1. Section III: What differentiates the Artificial Neural Network used to calculate future return from the target network? It seems like the target network focuses on learning the immediate reward, whereas the original neural network calculates the expected return from the full trajectory.
  2. Section IV. A: How do we measure that the third excited Bloch state provides the best trade-off between large momentum splitting and high-frequency components?

Time evolution

  1. The Hamiltonian is described by .
  2. The solution to this is given by Schrodinger Equation .
  3. Letting be the Fourier Transform of , we convert the original Differential Equation into .
  4. Writing it as , we use Integrating Factors to get the solution.