Automatic Differentiation

Automatic differentiation (AD) has emerged as a key technique in computational science, enabling exact and efficient computation of derivatives for functions defined by code. Unlike symbolic differentiation, which may produce complex and inefficient expressions, or finite-difference methods, which suffer from numerical instability and poor scalability, AD leverages the chain rule at the level of elementary operations to provide machine-precision gradients with minimal overhead.

In QuantumToolbox.jl, we have introduced preliminary support for automatic differentiation. Many of the core functions are compatible with AD engines such as Zygote.jl, Enzyme.jl or ForwardDiff.jl, allowing users to compute gradients of observables or cost functionals involving the time evolution of open quantum systems. Although QuantumToolbox.jl was not originally designed with AD in mind, its architecture—rooted in Julia’s multiple dispatch and generic programming model—facilitated the integration of AD capabilities. Many core functions were already compatible with AD engines out of the box.

Experimental Functionality

At present, this functionality is considered experimental and not all parts of the library are AD-compatible. Here we provide a brief overview of the current state of AD support in QuantumToolbox.jl and how to use it.

Forward versus Reverse Mode AD

Automatic differentiation can be broadly categorized into two modes: forward mode and reverse mode. The choice between these modes depends on the nature of the function being differentiated and the number of inputs and outputs:

Forward Mode AD: This mode is particularly efficient for functions with many outputs and few inputs. It works by propagating derivatives from the inputs through the computational graph to the outputs. Forward mode is often preferred when the number of input variables is small, as it computes the derivative of each output with respect to each input in a single pass.
Reverse Mode AD: In contrast, reverse mode is more efficient for functions with many inputs and few outputs. It operates by first computing the function's output and then propagating derivatives backward through the computational graph. This mode is commonly used in machine learning and optimization applications, where the loss function (output) depends on a large number of parameters (inputs).

Understanding the differences between these two modes can help users choose the most appropriate approach for their specific use case in QuantumToolbox.jl.

Differentiate the master equation

One of the primary use cases for automatic differentiation in QuantumToolbox.jl is the differentiation of the master equation. The master equation describes the time evolution of a quantum system's density matrix under the influence of non-unitary dynamics, such as dissipation and decoherence. Let's consider a set of parameters $p = (p_{1}, p_{2}, \dots, p_{n})$ that influence the system's dynamics. The Hamiltonian and the dissipators will depend on these parameters

\hat{H} = \hat{H} (p), {\hat{L}}_{j} = {\hat{L}}_{j} (p),

Hence, the density matrix will evolve according to the master equation

\begin{matrix} (1) & \begin{aligned} \frac{d \hat{ρ} (p, t)}{d t} = & - i [\hat{H} (p), \hat{ρ} (p, t)] \\ + \sum_{j} {\hat{L}}_{j} (p) \hat{ρ} (p, t) {\hat{L}}_{j} (p)^{†} - \frac{1}{2} {{\hat{L}}_{j} (p)^{†} {\hat{L}}_{j} (p), \hat{ρ} (p, t)}, \end{aligned} \end{matrix}

which depends on the parameters $p$ and time $t$ .

We now want to compute the expectation value of an observable $\hat{O}$ at time $t$ :

⟨ \hat{O} (p, t) ⟩ = Tr [\hat{O} \hat{ρ} (p, t)],

which will also depend on the parameters $p$ and time $t$ .

Our goal is to compute the derivative of the expectation value with respect to the parameters:

\frac{\partial ⟨ \hat{O} (p, t) ⟩}{\partial p_{j}} = \frac{\partial}{\partial p_{j}} Tr [\hat{O} \hat{ρ} (p, t)],

and to achieve this, we can use an AD engine like ForwardDiff.jl (forward mode) or Zygote.jl (reverse mode).

Let's apply this to a simple example of a driven-dissipative quantum harmonic oscillator. The Hamiltonian in the drive frame is given by

\hat{H} = Δ {\hat{a}}^{†} \hat{a} + F (\hat{a} + {\hat{a}}^{†}),

where $Δ = ω_{0} - ω_{d}$ is the cavity-drive detuning, $F$ is the drive strength, and $\hat{a}$ and ${\hat{a}}^{†}$ are the annihilation and creation operators, respectively. The system is subject to a single dissipative channel with a Lindblad operator $\hat{L} = \sqrt{γ} \hat{a}$ , where $γ$ is the dissipation rate. If we start from the ground state $\hat{ρ} (0) = | 0 ⟩ ⟨ 0 |$ , the systems evolves according to the master equation in Eq. (1).

We now want to study the number of photons at the steady state, and how it varies with $p = (Δ, F, γ)$ , namely $\nabla_{p} ⟨ {\hat{a}}^{†} \hat{a} ⟩ (p, t \to \infty)$ . We can extract an analytical expression, in order to verify the correctness of the AD implementation:

⟨ {\hat{a}}^{†} \hat{a} ⟩_{ss} = \frac{F^{2}}{Δ^{2} + \frac{γ^{2}}{4}},

with the gradient given by

\nabla_{p} ⟨ {\hat{a}}^{†} \hat{a} ⟩_{ss} = (\begin{matrix} \frac{- 2 F^{2} Δ}{(Δ^{2} + \frac{γ^{2}}{4})^{2}} \\ \frac{2 F}{Δ^{2} + \frac{γ^{2}}{4}} \\ \frac{- F^{2} γ}{2 (Δ^{2} + \frac{γ^{2}}{4})^{2}} \end{matrix}) .

Although QuantumToolbox.jl has the steadystate function to directly compute the steady state without explicitly solving the master equation, here we use the mesolve function to integrate up to a long time $t_{\max}$ , and then compute the expectation value of the number operator. We will demonstrate how to compute the gradient using both ForwardDiff.jl and Zygote.jl.

Forward Mode AD with ForwardDiff.jl

We start by importing ForwardDiff.jl and defining the parameters and operators:

julia

using ForwardDiff

const N = 20
const a = destroy(N)
const ψ0 = fock(N, 0)
const t_max = 40
const tlist = range(0, t_max, 100)

0.0:0.40404040404040403:40.0

Then, we define a function that take the parameters p as an input and returns the expectation value of the number operator at t_max. We also define the analytical solution of the steady state photon number and its gradient for comparison:

julia

function my_f_mesolve_direct(p)
    H = p[1] * a' * a + p[2] * (a + a')
    c_ops = [sqrt(p[3]) * a]
    sol = mesolve(H, ψ0, tlist, c_ops, progress_bar = Val(false))
    return real(expect(a' * a, sol.states[end]))
end

# Analytical solution
function my_f_analytical(p)
    Δ, F, γ = p
    return F^2 / (Δ^2 + γ^2 / 4)
end
function my_grad_analytical(p)
    Δ, F, γ = p
    return [
        -2 * F^2 * Δ / (Δ^2 + γ^2 / 4)^2,
         2 * F / (Δ^2 + γ^2 / 4),
        -F^2 * γ / (2 * (Δ^2 + γ^2 / 4)^2)
    ]
end

my_grad_analytical (generic function with 1 method)

The gradient can be computed using ForwardDiff.gradient:

julia

Δ = 1.5
F = 1.5
γ = 1.5
params = [Δ, F, γ]

grad_exact = my_grad_analytical(params)
grad_fd = ForwardDiff.gradient(my_f_mesolve_direct, params)

3-element Vector{Float64}:
 -0.8533333334126725
  1.0666666664975633
 -0.2133333324383615

and test if the results match:

julia

isapprox(grad_exact, grad_fd; atol = 1e-5)

true

Reverse Mode AD with Zygote.jl

Reverse-mode differentiation is significantly more challenging than forward-mode when dealing ODEs, as the complexity arises from the need to propagate gradients backward through the entire time evolution of the quantum state.

QuantumToolbox.jl leverages the advanced capabilities of SciMLSensitivity.jl to handle this complexity. SciMLSensitivity.jl implements sophisticated methods for computing gradients of ODE solutions, such as the adjoint method, which computes gradients by solving an additional "adjoint" ODE backward in time. For more details on the adjoint method and other sensitivity analysis techniques, please refer to the SciMLSensitivity.jl documentation.

In order to reverse-differentiate the master equation, we need to define the operators as QuantumObjectEvolution objects, which use SciMLOperators.jl to represent parameter-dependent operators.

julia

using Zygote
using SciMLSensitivity

# For SciMLSensitivity.jl
coef_Δ(p, t) = p[1]
coef_F(p, t) = p[2]
coef_γ(p, t) = sqrt(p[3])
H = QobjEvo(a' * a, coef_Δ) + QobjEvo(a + a', coef_F)
c_ops = [QobjEvo(a, coef_γ)]
const L = liouvillian(H, c_ops)

function my_f_mesolve(p)
    sol = mesolve(
        L,
        ψ0,
        tlist,
        progress_bar = Val(false),
        params = p,
        sensealg = BacksolveAdjoint(autojacvec = EnzymeVJP()),
    )

    return real(expect(a' * a, sol.states[end]))
end

my_f_mesolve (generic function with 1 method)

And the gradient can be computed using Zygote.gradient:

julia

grad_zygote = Zygote.gradient(my_f_mesolve, params)[1]

3-element Vector{Float64}:
 -0.8533333194249466
  1.0666644159296987
 -0.2133310965078913

Finally, we can compare the results from ForwardDiff.jl and Zygote.jl:

julia

isapprox(grad_fd, grad_zygote; atol = 1e-5)

true

Conclusion

In this section, we have explored the integration of automatic differentiation into QuantumToolbox.jl, enabling users to compute gradients of observables and cost functionals involving the time evolution of open quantum systems. We demonstrated how to differentiate the master equation using both forward mode with ForwardDiff.jl and reverse mode with Zygote.jl, showcasing the flexibility and power of automatic differentiation in quantum computing applications. AD can be applied to other functions in QuantumToolbox.jl, although the support is still experimental and not all functions are guaranteed to be compatible. We encourage users to experiment with AD in their quantum simulations and contribute to the ongoing development of this feature.

Automatic Differentiation ​

Forward versus Reverse Mode AD ​

Differentiate the master equation ​

Forward Mode AD with ForwardDiff.jl ​

Reverse Mode AD with Zygote.jl ​

Conclusion ​

Automatic Differentiation

Forward versus Reverse Mode AD

Differentiate the master equation

Forward Mode AD with ForwardDiff.jl

Reverse Mode AD with Zygote.jl

Conclusion