Variational Quantum Algorithms - arXiv

Variational quantum algorithms

M. Cerezo,1, 2, 3, ∗ Andrew Arrasmith,1, 3 Ryan Babbush,4 Simon C. Benjamin,5 Suguru Endo,6 Keisuke Fujii,7, 8, 9

Jarrod R. McClean,4 Kosuke Mitarai,7, 10, 11 Xiao Yuan,12, 13 Lukasz Cincio,1, 3 and Patrick J. Coles1, 3, †

1Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA2Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA

3Quantum Science Center, Oak Ridge, TN 37931, USA4Google Quantum AI Team, Venice, CA 90291, United States of America

5Department of Materials, University of Oxford, Parks Road, Oxford OX1 3PH, United Kingdom6NTT Secure Platform Laboratories, NTT Corporation, Musashino, Tokyo 180-8585, Japan

7Graduate School of Engineering Science, Osaka University, Osaka 560-8531, Japan8Center for Quantum Information and Quantum Biology,

Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Osaka 560-8531, Japan9Center for Emergent Matter Science, RIKEN, Saitama 351-0198, Japan

10Center for Quantum Information and Quantum Biology,Institute for Open and Transdisciplinary Research Initiatives, Osaka 560-8531, Japan

11JST, PRESTO, Saitama 332-0012, Japan12Center on Frontiers of Computing Studies, Department of Computer Science, Peking University, Beijing 100871, China

13Stanford Institute for Theoretical Physics, Stanford University, Stanford California 94305, USA

Applications such as simulating complicated quantum systems or solving large-scale linear algebraproblems are very challenging for classical computers due to the extremely high computational cost.Quantum computers promise a solution, although fault-tolerant quantum computers will likely notbe available in the near future. Current quantum devices have serious constraints, including limitednumbers of qubits and noise processes that limit circuit depth. Variational Quantum Algorithms(VQAs), which use a classical optimizer to train a parametrized quantum circuit, have emerged asa leading strategy to address these constraints. VQAs have now been proposed for essentially allapplications that researchers have envisioned for quantum computers, and they appear to the besthope for obtaining quantum advantage. Nevertheless, challenges remain including the trainability,accuracy, and efficiency of VQAs. Here we overview the field of VQAs, discuss strategies to overcometheir challenges, and highlight the exciting prospects for using them to obtain quantum advantage.

I. INTRODUCTION

Quantum computing holds promise for a number of ap-plications that have motivated the decades-long quest tobuild the necessary physical hardware. For example, withan exponential speedup over classical methods, quantumalgorithms could factor numbers [1], simulate quantumsystems [2], or solve linear systems of equations [3].

In 2016, access to the first cloud-based quantum com-puter [4] became available, but noise and qubit limi-tations prevented serious implementations of the afore-mentioned quantum algorithms [5]. However, excitementgrew as to what could be done with these new devices,which have been called Noisy Intermediate-Scale Quan-tum (NISQ) computers [6]. Current state-of-the-art de-vice size ranges from 50 to 100 qubits which allows oneto achieve ‘quantum supremacy’: outperforming the bestclassical supercomputer, for certain contrived mathemat-ical tasks [7, 8].

Nevertheless, the true promise of quantum computers,speedup for practical applications, which is often calledquantum advantage, has yet to be realized. Moreover, theavailability of fault-tolerant quantum computers appears

∗ e-mail: [email protected]† e-mail: [email protected]

to still be many years, or even decades, away. The keytechnological question is therefore how to make best useof today’s NISQ devices to achieve quantum advantage.Any such strategy must account for: limited numbers ofqubits, limited connectivity of the qubits, and coherentand incoherent errors that limit quantum circuit depth.

Variational Quantum Algorithms (VQAs) haveemerged as the leading strategy to obtain quantumadvantage on NISQ devices. Accounting for all of theconstraints imposed by NISQ computers with a singlestrategy requires an optimization-based or learning-based approach, precisely what VQAs use. VQAsare arguably the quantum analog of highly successfulmachine-learning methods, such as neural networks.Moreover, VQAs leverage the toolbox of classicaloptimization, since VQAs use parametrized quantumcircuits to be run on the quantum computer, and thenoutsource the parameter optimization to a classicaloptimizer. This approach has the added advantage ofkeeping the quantum circuit depth shallow and hencemitigating noise, in contrast to quantum algorithmsdeveloped for the fault-tolerant era.

VQAs have already been considered for a plethora ofapplications (see Figure 3), covering essentially all of theapplications that researchers had envisioned for quantumcomputers. Although they may be the key to obtainingnear-term quantum advantage, VQAs still face importantchallenges, including their trainability, accuracy, and effi-

arX

iv:2

012.

0926

5v2

[qu

ant-

ph]

4 O

ct 2

021

2

FIG. 1. Schematic diagram of a Variational Quantum Algorithm (VQA). The inputs to a VQA are: a cost functionC(θ), with θ a set of parameters that encodes the solution to the problem, an ansatz whose parameters are trained to minimizethe cost, and (possibly) a set of training data {ρk} used during the optimization. Here, the cost can often be expressed inthe form in Eq. (3), for some set of functions {fk}. Also, the ansatz is shown as a parameterized quantum circuit (on theleft), which is analogous to a neural network (also shown schematically on the right). At each iteration of the loop one usesa quantum computer to efficiently estimate the cost (or its gradients). This information is fed into a classical computer thatleverages the power of optimizers to navigate the cost landscape C(θ) and solve the optimization problem in Eq. (1). Once atermination condition is met, the VQA outputs an estimate of the solution to the problem. The form of the output dependson the precise task at hand. The red box indicates some of the most common types of outputs.

ciency. In this Review, we discuss the exciting prospectsfor VQAs, and we highlight the challenges that must beovercome to obtain the ultimate goal of quantum advan-tage.

II. BASIC CONCEPTS AND TOOLS

One of the main advantages of VQAs is that they pro-vide a general framework that can be used to solve avariety of problems. Although this versatility translatesinto different algorithmic structures with different levelsof complexity, there are basic elements that most (if notall) VQAs have in common. In this section we review thebuilding blocks of VQAs.

Let us start by considering a task one wishes to solve.This implies having access to a description of the prob-lem, and also possibly to a set of training data. Asschematically shown in Fig. 1, the first step to develop-ing a VQA is to define a cost (or loss) function C whichencodes the solution to the problem. One then proposesan ansatz, that is, a quantum operation depending ona set of continuous or discrete parameters θ that canbe optimized (see below for a more in-depth discussionof ansatzes). This ansatz is then trained in a hybridquantum-classical loop to solve the optimization task

θ∗ = arg minθ

C(θ) . (1)

The trademark of VQAs is that they use a quantum com-puter to estimate the cost function C(θ) (or its gradient)while leveraging the power of classical optimizers to trainthe parameters θ. In what follows, we provide additional

details for each step of the VQA architecture shown inFig. 1.

A. Cost function

A crucial aspect of a VQA is encoding the probleminto a cost function. Similar to classical machine learn-ing, the cost function maps values of the trainable pa-rameters θ to real numbers. More abstractly, the costdefines a hyper-surface usually called the cost landscape(see Fig. 1) such that the task of the optimizer is to nav-igate through the landscape and find the global minima.Without loss of generality, the cost can be expressed as

C(θ) = f ({ρk}, {Ok}, U(θ)) , (2)

where f is some function, U(θ) is a parametrized uni-tary, θ is composed of discrete and continuous parame-ters, {ρk} are input states from a training set, {Ok} area set of observables. Often it is useful, and possible, toexpress the cost in the form

C(θ) =∑k

fk(Tr[OkU(θ)ρkU

†(θ)]), (3)

for some set of functions {fk}. Note that the task at handwill determine the choice of f in Eq. (2) or the choiceof {fk} in Eq. (3). During the optimization, one uses afinite statistic estimator of the cost or its gradients. (Seebelow for an overview of optimizers used to train the costfunction.)

Let us now discuss desirable criteria that the cost func-tion should meet. First, the cost must be ‘faithful’ in

3

that the minimum of C(θ) corresponds to the solutionof the problem. Second, one must be able to ‘efficientlyestimate’ C(θ) by performing measurements on a quan-tum computer and possibly performing classical post-processing. An implicit assumption here is that the costshould not be efficiently computable with a classical com-puter, as this would imply that no quantum advantagecan be achieved with the VQA. In addition, it is alsouseful for C(θ) to be ‘operationally meaningful’, so thatsmaller cost values indicate a better solution quality. Fi-nally, the cost must be ‘trainable’, which means that itshould be possible to efficiently optimize the parametersθ. We will later discuss in more detail the issue of train-ability for VQAs.

For a given VQA to be implementable in NISQ hard-ware, the quantum circuits used to estimate C(θ) mustkeep the circuit depth and ancilla requirements small.This is due to the fact that NISQ devices are prone togate errors, have limited qubit counts, and that thesequbits have short decoherence times. Hence the construc-tion of efficient cost evaluation circuits is an importantaspect of VQA research.

B. Ansatzes

Another important aspect of a VQA is its ansatz.Generically speaking the form of the ansatz dictates whatthe parameters θ are, and hence, how they can be trainedto minimize the cost. The specific structure of an ansatzwill generally depend on the task at hand, as in manycases one can use information about the problem to tai-lor an ansatz. These are the so-called ‘problem-inspiredansatze’. However, some ansatz architectures are genericand ‘problem-agnostic’, meaning that they can be usedeven when no relevant information is readily available.For the cost function in Eq. (3), the parameters θ canbe encoded in a unitary U(θ) that is applied to the in-put states to the quantum circuit. As shown in Fig. 2,U(θ) can be generically expressed as the product of Lsequentially applied unitaries

U(θ) = UL(θL) · · ·U2(θ2)U1(θ1) , (4)

with

Ul(θl) =∏m

e−iθmHmWm . (5)

HereWm is an unparametrized unitary and Hm is a Her-mitian operator; θl is the l-th element in θ. Below wedescribe some of the most widely used ansatzes in theliterature, starting with those that can be expressed asEq. (4), and then presenting more general architectures.

1. Hardware efficient ansatz

The hardware efficient ansatz [9] is a generic name usedfor ansatzes that are aimed at reducing the circuit depth

FIG. 2. Schematic diagram of an ansatz. The unitaryU(θ), with θ a set of parameters, can be expressed as a prod-uct of L unitaries Ul(θl) sequentially acting on an input state.As indicated, each unitary Ul(θl) can in turn be decomposedinto a sequence of parametrized and unparametrized gates.

needed to implement U(θ) when using a given quantumhardware. Here one uses unitariesWm and e−iθmHm thatare taken from a gate alphabet (set of quantum gates) de-termined from the connectivity and interactions specificto a quantum hardware which avoids the circuit depthoverhead arising from translating an arbitrary unitaryinto a sequence of gates easily implementable in a de-vice. One of the main advantages of the hardware effi-cient ansatz is its versatility, as it can accommodate en-coding symmetries [10, 11] and bringing correlated qubitscloser for depth reduction [12], as well as being especiallyuseful to study Hamiltonians that are similar to the de-vice’s interactions [13]. Such is the case, for instance, oflocal spin Hamiltonians, although in this case it has beenheuristically shown that near criticiallity the ansatz re-quires depths proportional to the system size [14]. Addi-tionally, ‘layered’ hardware efficient ansatzes, where gatesact on alternating pairs of qubits in a brick-like structure,have been prominently used as problem-agnostic archi-tectures. However, this ansatz can lead to trainabilityproblems when randomly initialized.

2. Unitary coupled clustered ansatz

The Unitary Coupled (UCC) ansatz is a problem-inspired ansatz widely used in quantum chemistry prob-lems where the goal is to obtain the ground state en-ergy of a fermionic molecular Hamiltonian H. TheUCC ansatz proposes a candidate for such ground statebased on exciting some reference state |ψ0〉 (usually theHartree-Fock state of H) as eT (θ)−T (θ)† |ψ0〉. Here, T =∑k Tk is the cluster operator [15, 16] and Tk are exci-

tation operators. In the so-called UCCSD ansatz (SD

4

stands for single and double) the summation is trun-cated to contain single excitations T1 =

∑i,j θ

ji a†iaj ,

and double excitations T2 =∑i,j,k,l θ

k,li,j a†ia†jakal, where

{a†i} ({ai}) are fermionic creation (annihilation) opera-tors. To implement this ansatz in a quantum computerone uses the Jordan-Wigner or the Bravyi-Kitaev trans-formations [17] to map the fermionic operators to spinoperators, resulting in an ansatz of the form Eq. (4).There are many variants of the UCC ansatz [18], withsome of them reducing the circuit depth by consideringmore efficient methods for compiling the fermionic oper-ators [19–22].

3. Quantum alternating operator ansatz

The Quantum Approximate Optimization Algorithm(QAOA) was originally introduced to obtain approximatesolutions for combinatorial optimization problems [23].The ansatz used in QAOA involves an alternating struc-ture and is often called the quantum alternating opera-tor ansatz [24], sharing the same acronym as the algo-rithm (although we will use QAOA to refer to the algo-rithm in this Review). This ansatz was first shown tobe computationally universal for certain Hamiltonians inRef. [25], with the proof of its universality being gener-alized in Ref. [26] for families of ansatzes defined by setsof graphs and hyper-graphs. The ansatz in QAOA is in-spired by a Trotterized adiabatic transformation wherethe order p of the Trotterization determines the preci-sion of the solution. The goal of this ansatz is to mapan input state |ψ0〉 to the ground-state of a given prob-lem Hamiltonian HP by sequentially applying a problemunitary e−iγlHp and a mixer unitary e−iβlHM , where HM

is a Hermitian operator known as the mixing Hamilto-nian Ref. [27]. Specifically, the ansatz takes the formU(γ,β) =

∏pl=1 e

−iβlHM e−iγlHP , where θ = (γ,β). Thisansatz is naturally of the form in Eq. (4), although de-composing these unitaries into native gates may resultin a lengthy circuit due to many-body terms in HP andlimited device connectivity. One of the strengths of thisansatz is the fact that the feasible subspace for certainproblems is smaller than the full Hilbert space, and thisrestriction may result in a better-performing algorithm.

4. Variational Hamiltonian ansatz

Inspired by the QAOA ansatz, the variational Hamil-tonian ansatz also aims to prepare a trial ground statesfor a given Hamiltonian H =

∑kHk (where HK are Her-

mitian operators, usual Pauli strings) by Trotterizing anadiabatic state preparation process [28]. Here, each Trot-ter step corresponds to a variational ansatz so that theunitary is given by U(θ) =

∏l(∏k e−θl,kHk), and again is

of the form Eq. (4). Due to its versatility, the variationalHamiltonian ansatz has been implemented for quantum

chemistry [21, 29], optimization [24], and for quantumsimulation problems [30].

5. Variable structure ansatz

In many ansatzes, one optimizes over continuous pa-rameters (such as rotation angles), while the structureof the circuit is kept fixed. Although this enables thecontrol of the overall circuit complexity, it may miss re-finements attained by optimizing the circuit structure it-self, including the addition or removal of unnecessary cir-cuit elements. Optimizing the circuit structure was ini-tially explored in a framework called ADAPT-VQE [31],which seeks to adaptively add specific elements to theansatz to maximize the benefit while minimizing thenumber of circuit elements in quantum chemistry ap-plications. (Improvements to ADAPT-VQE and vari-able ansatz for quantum chemistry have been introducedin Refs. [32, 33], and a variable structure version ofthe QAOA ansatz was introduced in Ref. [34].) Onecan then view this problem as a sparse model problem,and whereas such an optimization is known to be hard,heuristic or greedy approximations that seek to add oneterm at a time have been shown to be helpful [31, 32].

Machine learning-aided evolutionary algorithms forcircuit design have also been explored in Refs. [35, 36],where individuals (quantum circuits) from a populationare upgraded to grow the circuit and explore the Hilbertspace. In addition, Refs. [37–41] use tools from ma-chine learning to develop variational ansatzes for variousVQA applications. Complementary approaches basedon exploring different ansatz variants simultaneously asan evolving cohort have also shown promising perfor-mance [42].

6. Sub-logical ansatz and quantum optimal control

The parameters θ are often specified at the logical cir-cuit level (such as rotation angle), however sometimesthey have a direct translation to device-level parametersbelow the logical level. Hence, one can include thesedevice-level parameters in the definition of the ansatz, asthis can offer additional flexibility [43]. This approachalso establishes a connection to the idea of quantumoptimal control, which is often used to determine thetranslation from logical to physical device parameters,and which is especially applicable for quantum simula-tions [44, 45]. Refs. [46, 47] have explored using VQAsthe construction of optimal control sequences. Althoughthis can increase the number of parameters, the addi-tional flexibility may allow for on-the-fly calibration ef-fects that have been seen to reduce the effects of coherentnoise [46–48].

5

7. Hybrid ansatzes

In some cases, it is possible to combine quantumansatzes with classical strategies to push some of thecomplexity onto the classical device. For instance, inquantum chemistry one can exploit the classical simula-bility of free fermion dynamics to apply quantum oper-ations via classical post-processing [49–54]. A differentapproach is to use as ansatz a trainable linear combina-tion of parametrized states |ψ({cµ},θ)〉 =

∑µ cµ|ψµ(θµ)〉

with {cµ} classically optimizable coefficients [55–61].Moreover, given that quantum circuits can be viewed astensor networks [62], it is natural to combine the existingtensor network techniques with a quantum ansatz [63–67]. For instance, it has been shown that it is possi-ble to unitarily contract tensor networks on a quantumcomputer [63, 64]. An alternative hybrid approach wasproposed via the deep variational quantum eigensolver,where the algorithm divides the whole system into smallsubsystems and sequentially solves each subsystem andthe interaction between the subsystems [68]. Finally,there is also a hybrid method that combines variationalMonte Carlo techniques with a quantum ansatz to classi-cally apply the so-called Jastrow operator e(

∑i,j Jijσiσj)

(for J a symmetric matrix, and σi and σj Pauli opera-tors) to a parametrized quantum state |ψ(θ)〉 with thegoal of obtaining a more accurate result by optimizingtogether J and θ (Ref. [69]).

8. Ansatz for mixed states

Since mixed states play an important role in many ap-plications, such as systems at finite temperature, severalansatzes have been developed to construct a mixed stateρ =

∑i pi|ψi〉〈ψi| of n qubits (here pi are the eigen-

values of ρ such that∑i pi = 1). A first approach

(which comes at the cost of requiring up to 2n qubits)is based on preparing a pure state that has ρ as a re-duced state in some subsystem of qubits. Refs. [65, 70]have proposed a method to variationally obtain a purifi-cation |ψ〉 =

∑i

√pi|ψi〉|φi〉 of ρ, whereas Ref. [71] intro-

duced a method to construct a state |ρ〉 = 1c

∑i pi|ψi〉|ψi〉

with normalization c =∑i p

2i . Alternatively, one can

also train a probability distribution {pi(φ)} and a setof states {|ψi(θi)〉} to construct ρ as the statistical en-semble ρ(φ, {θi}) =

∑i pi(φ)|ψi(θi)〉〈ψi(θi)|. Ref. [70]

proposed to use a simple product distribution based onphysical insights, whereas a more general proposal forenergy based models was introduced in Ref. [72]. Morerecently, there has been a proposal to generate mixedstates which uses the autoregressive model [73].

9. Ansatz expressibility

Given the wide range of ansatzes one can use, a rel-evant question is whether a given architecture can pre-pare a target state by optimizing its parameters. In thissense, there are different ways to judge the quality of anansatz [74] by considering two different notions: the ex-pressibility and the entangling capability of an ansatz.An ansatz is expressible if the circuit can be used to uni-formly explore the entire space of quantum states. Thusone way to quantify the expressibility of an ansatz U(θ)is to compare the distribution of states obtained fromU(θ) to the maximally expressive uniform (Haar) distri-bution of states UHaar. Motivated by this line of thought,the expressibility of a circuit is measured by [74] ||A(t)||,where

A(t)(U) :=

∫dUHaar U

⊗tHaar|0〉〈0|(U

†Haar)

⊗t

−∫dU U⊗t|0〉〈0|(U†)⊗t . (6)

Other expressibility measures can be considered aswell [74], and the expressibility of different ansatzes wasinvestigated further in Ref. [75]. Ref. [74] also introduceda measure of entangling capability for ansatzes, whichquantifies the average entanglement of states producedfrom randomly sampling the circuit parameters θ.

Quantifying expressibility for particular ansatzes is anactive area of research [74–78], with certain quantum ar-chitectures exhibiting higher expressibility (according tocertain measures) relative to classical architectures [77].

C. Gradients

Once the cost function and ansatz have been defined,the next step is to train the parameters θ and solve theoptimization problem of Eq. (1). It is known that formany optimization tasks using information in the costfunction gradient (or in higher-order derivatives) can helpin speeding up and guaranteeing the convergence of theoptimizer. One of the main advantages of many VQAsis that, as discussed below, one can analytically evaluatethe cost function gradient.

1. Parameter-shift rule

Let us consider for simplicity a cost function of theform in Eq. (3) with fk(x) = x, and let θl be the l-th element in θ which parametrize a unitary eiθlσl inthe ansatz. Here, σl is a Pauli operator. Surprisingly,there is a hardware-friendly protocol to evaluate the par-tial derivative of C(θ) with respect to θl often referredto as the parameter-shift rule [79–82]. Explicitly, the

6

parameter-shift rules states that the equality

∂C

∂θl=∑k

1

2 sinα

(Tr[OkU

†(θ+)ρkU(θ+)]

− Tr[OkU

†(θ−)ρkU(θ−)] )

, (7)

with θ± = θ±αel, holds for any real number α. Here elis a vector having 1 as its l-th element and 0 otherwise.Equation (7) shows that one can evaluate the gradientby shifting the l-th parameter by some amount α. Notethat the accuracy of the evaluation depends on the coef-ficient 1/(2 sinα) since each of the ±α-term is evaluatedby sampling Ok. This accuracy is maximized at α = π/4,since 1/ sinα is minimized at this point. Although theparameter-shift rule might resemble a naive finite differ-ence, it evaluates the analytic gradient of the parameterby virtue of the coefficient 1/ sinα. A detailed compar-ison between the parameter-shift rule and the finite dif-ference can be found in Ref. [83]. Finally, the gradientfor more general fk(x) can be obtained from Eq. (7) byusing the chain rule.

2. Other derivatives

Higher-order derivatives of the cost function canbe evaluated by straight-forward extensions of theparameter-shift rule. For example, the second derivativefor the previous example can be written as

∂2C

∂θ2l=∑k

1

4 sin2 α

(Tr[OkU

† (θ + 2αel) ρkU (θ + 2αel)]

+Tr[OkU

† (θ − 2αel) ρkU (θ − 2αel)]

− 2Tr[OkU

†(θ)ρkU(θ)] )

,

by applying the parameter-shift rule twice. Other higher-order ones such as ∂2C

∂θlθl′or ∂3C

∂θ3lcan be obtained in

a similar fashion. Explicit formulas can be found inRefs. [83, 84]. These observations relate to the fact thatthe cost function can be expanded into a trigonometricseries that admits a classically efficient, analytical ap-proximation around any reference point. One can thusinfer a classical model of the cost function, and minimiseit, to offload more work from the quantum processor tothe classical supervising system [85, 86].

Other types of derivatives of the parametrized quan-tum state not directly related to the cost function,such as a metric tensor of a state ∂〈ψ(θ)|

∂θl

∂|ψ(θ)〉∂θ′l

(with|ψ(θ)〉 = U(θ)|ψ0〉 for some initial state |ψ0〉), are some-times used in sophisticated optimization algorithms [87–89] and variational quantum simulation [90–92] (see thesection on dynamical quantum simulation). As quantitiessuch as ∂〈ψ(θ)|

∂θl

∂|ψ(θ)〉∂θ′l

are essentially overlaps of differentstates, this can be evaluated via Hadamard-test like pro-tocols [91]. However, as shown in Ref. [93], those can alsobe reduced to the parameter-shift technique.

D. Optimizers

As for any variational approach, the success of avariational quantum algorithm (VQA) depends on theefficiency and reliability of the optimization methodused. The classical optimization problems associatedwith VQAs are expected to be NP-hard in general asthey involve cost functions that can have many localminima [94]. In addition to the typical difficulties en-countered in complex classical optimizations, it has beenshown that when training a VQA one can encounter newchallenges. These include issues such as the inherentlystochastic environment due to the finite budget for mea-surements, hardware noise, and the presence of barrenplateaus (see main text). This has led to the develop-ment of many quantum-aware optimizers, with the opti-mal choice still being an active topic of debate. Here wediscuss a selection of optimizers that have been designedor promoted for use with VQAs. For convenience, thesewill be grouped into two categories based on whether ornot they implement some version of gradient descent.

1. Gradient descent methods

One of the most common approaches to optimization isto make iterative steps in directions indicated by the gra-dient. Given that only statistical estimates are availablefor these gradients, these strategies fall under the um-brella of Stochastic Gradient Descent (SGD). One SGDmethod that has been imported from the machine learn-ing community is Adam, which adapts the size of thesteps taken during the optimization to allow for more ef-ficient and precise solutions than those obtained throughbasic SGD [95]. An alternative method inspired bythe machine learning literature adapts the precision (thenumber of shots taken for each estimate), rather than thestep size, at each iteration in an attempt to be frugal withthe quantum resources used [96]. It is possible to attainan unbiased estimator for a partial derivative with evenjust a single shot [97], so adapting the number of shotswhen low precision is acceptable can lead to significantreductions in the overall shot cost of an algorithm.

A different gradient-based approach is based on sim-ulating an imaginary time evolution [87], or equiva-lently by using the quantum natural gradient descentmethod, which is based on notions of information geom-etry [88, 89]. Whereas standard gradient descent takessteps in the steepest descent direction in the l2 (Eu-clidean) geometry of the parameter space, natural gra-dient descent works instead on a space with a metrictensor that encodes the sensitivity of the quantum stateto variations in the parameters. Using this metric ten-sor, typically accelerates the convergence of the gradientupdate steps, allowing a given level of precision to be at-tained with fewer iterations. This method has also beenextended to incorporate the effects of noise [89].

7

2. Other methods

A different method which uses gradients, but hasa more complicated update step than SGD, is meta-learning [98]. In this context, the optimizer ‘learns tolearn’ by training a neural network to make a good up-date step based on the optimization history and currentgradient with similar optimization problems. Becausethe update steps taken are based on rules learned fromsimilar cost functions, this meta-learning approach hassignificant potential to be highly efficient when used ona new instance of a common class of optimizations.

Of the optimization methods proposed for use withVQAs which do not directly utilize gradients, the onethat is perhaps the most closely related to SGD isthe simultaneous perturbation stochastic approximation(SPSA) method [99]. SPSA can be considered as an ap-proximation to gradient descent where the gradient isapproximated by a single partial derivative computedby a finite difference along a randomly chosen direction.SPSA has thus been put forward as an efficient methodfor VQAs as it avoids the expense of computing manygradient components at each iteration. Moreover, it hasbeen shown that for a restricted set of problems, SPSAhas a faster theoretical convergence rate (in terms of thenumber of function calls) than SGD performed with finitedifferences [99].

Finally, another noteworthy gradient-free approach hasbeen developed specifically for the context of VQAs forproblems where the objective function is a linear func-tion of an operator expectation value, so that C(θ) canbe expressed as a sum of trigonometric functions. Usingthis insight, one can fit the functional dependence on afew parameters (with the rest held fixed) allowing oneto make local parameter updates [85, 100]. Performingsuch local updates sequentially over all parameters, orsubsets of parameters, and iterating over all parametersone then has an optimization method that is gradient-free and which does not depend on hyper-parameters.Additionally, a variation of this method using Andersonacceleration (a method that adds a linear combination ofprior steps to each new update step) to speed up conver-gence has been proposed [100].

3. Convergence analysis

The cost landscapes of VQAs are generally non-convexand can be complicated [101], making it difficult to ob-tain general guarantees about the computational expenseof the optimizations. However, for simplified landscapes,SGD convergence guarantees have been derived whichare similar to those provided in the machine learning lit-erature [97, 102]. Furthermore, within a convex regionabout a minimum, SGD methods using gradients calcu-lated via the parameter-shift rule have been shown tohave smaller upper bounds on the optimization complex-ity than methods using only objective values (including

FIG. 3. Applications of Variational Quantum Algo-rithms (VQAs). Many applications have been envisionedfor VQAs. Here we show some of the key applications thatare discussed in this Review.

finite difference methods) [102].

III. APPLICATIONS

One of the main advantages of the VQA paradigm isthat it allows for task-oriented programming. That is,VQAs provide a framework that can be used to tackle awide array of tasks. This has lead to VQAs being pro-posed for essentially all applications envisioned for quan-tum computers, and in fact, it has been shown that VQAsallow for universal quantum computing [103]. In this sec-tion we provide an overview of some of the main appli-cations of VQAs and their state-of-the implementation.These applications are also summarized in Figure 3. Wealso refer the reader to Section V for an overview of ap-plications where VQAs can be potentially used to obtaina quantum advantage.

A. Finding ground and excited states

The best-known application of VQAs is estimating low-lying eigenstates and corresponding eigenvalues of a givenHamiltonian. Previous quantum algorithms to find theground state of a given HamiltonianH were based on adi-abatic state preparation and quantum phase estimationsubroutines [104, 105], both of which have circuit depthrequirements beyond those available in the NISQ era.Hence, the first proposed VQA, the Variational QuantumEigensolver (VQE), was developed to provide a near-termsolution to this task. Here we review both the originalVQE architecture and some more advanced methods for

8

FIG. 4. Variational Quantum Eigensolver (VQE) im-plementation. The VQE algorithm can be used to estimatethe ground state energy EG of a molecule. The interactions ofthe system are encoded in a HamiltonianH, usually expressedas a linear combination of simple operators hk with coeffi-cients ck. Taking H as input, VQE outputs an estimate EGof the ground-state energy. The lower part of the figure showsthe results of a VQE implementation for the electronic struc-ture problem of an H2 molecule, whose exact energy is shownas a dashed line. The experimental results were obtained us-ing two of the five qubits in one of IBM’s superconductingquantum processors (the inset illustrates qubit connectivitywith Q0 . . . Q4 denoting the qubits ). Due to the presence ofhardware noise the estimated energy EG has a gap with thetrue energy. In fact, amplifying the noise strength (that isincreasing the quantity s), deteriorates the solution quality.However, as discussed below, one can use error mitigationtechniques to improve the solution quality. Figure adaptedfrom Ref. [106], Springer Nature Limited.

finding ground and excited states.

1. Variational quantum eigensolver

As shown in Fig. 4, VQE is aimed at finding the groundstate energy EG of a Hamiltonian H [16]. Here the costfunction is defined as C(θ) = 〈ψ(θ)|H|ψ(θ)〉. That is,one seeks to minimize the expectation value of H over atrial state |ψ(θ)〉 = U(θ)|ψ0〉 for some ansatz U(θ) andinitial state |ψ0〉. According to the Rayleigh-Ritz vari-ational principle, the cost is meaningful and faithful asC(θ) > EG, with the equality holding if |ψ(θ)〉 is theground state |ψG〉 of H. In practice, the Hamiltonian His usually represented as a linear combination of productsof Pauli operators σk as H =

∑k ckσk (ck ∈ R), so that

the cost function C(θ) is obtained from a linear combina-tion of expectation values of σk. Since practical physical

systems are generally described by sparse Hamiltonians,the cost function can be efficiently estimated on quantumcomputers with a computational cost that usually growsat most polynomially with the system size.

2. Orthogonality constrained VQE

Once an approximated ground state |ψG〉 = U(θ∗)|ψ0〉has been obtained, one can use it to find excited statesof H. Let a be a positive constant that is much largerthan the energy gap between the ground state and thefirst excited states. Then, H ′ = H + a|ψG〉〈ψG| isa Hamiltonian whose ground state is the first excitedstate of H (Ref. [107]). Thus, by using the VQE forH ′ with an updated cost C(θ) = 〈ψ(θ)|H|ψ(θ)〉 +

a〈ψ(θ)|ψG〉〈ψG|ψ(θ)〉, one may find the first excited stateof H. The first term here is evaluated as in VQE, and thesecond term can be obtained by computing the state over-lap between |ψG〉 and |ψ(θ)〉 (Refs. [38, 108]). This pro-cedure can be further generalized to approximate higherexcited states. Moreover, it has been shown that incorpo-rating an imaginary time evolution can help to improvethe calculation robustness [109].

3. Subspace expansion method

Another way to discover low energy excited statesusing information of the estimated ground state |ψG〉is via the subspace expansion method [55]. Here oneruns an additional optimization in a subspace of states{|ψk〉} generated from |ψG〉. For instance, one cre-ates states |ψk〉 = σk|ψG〉 for low-weight Pauli opera-tors σk, and expands the candidates for the eigenstatesas |E〉 =

∑k αk|ψk〉. Then, one obtains approxima-

tions to the lowest eigenstates by training the coefficientsα = (α0, α1, α2, ...) while solving the generalised eigen-value problem Hα = ESα, with Hk,j = 〈ψk|H|ψj〉 andSk,j = 〈ψk|ψj〉.

4. Subspace VQE

The main idea behind subspace VQE [110] is to traina unitary for preparing states in the lowest energy sub-space of H. There are two variants of subspace VQEcalled weighted and non-weighted subspace VQE. Forthe weighted subspace VQE, one considers a cost func-tion C(θ) =

∑mi=0 wi〈ϕi|U(θ)HU(θ)|ϕi〉 with ordered

weights w0 > w1 > · · · > wm and easily preparedmutually-orthogonal states {|ϕi〉}. By minimizing thecost function, one approximates the subspace of the low-est eigenstates as {U(θ∗)|ϕi〉}mi=0. Since the weights arein decreasing order, each state U(θ∗)|ϕi〉 corresponds toan eigenstate of the (non-degenerate) Hamiltonian withincreasing energies.

9

The non-weighted subspace VQE makes use of the costfunction C1(θ) =

∑mi=0〈ϕi|U†(θ)HU(θ)|ϕi〉. Minimiz-

ing C1 again gives the subspace of lowest eigenstates.As each state U(θ∗)|ϕi〉 is in a superposition of theeigenstates, one needs to further optimize a second costC2(θ∗,φ) = 〈ϕi|V †(φ)U†(θ∗)HU(θ∗)V (φ)|ϕi〉 over pa-rameters φ to rotate each state U(θ∗)V (φ)|ϕi〉 to aneigenstate.

5. Multistate contracted VQE

The multistate contracted VQE [56] can be regardedas a midway point between subspace expansion andsubspace VQE. It first obtains the lowest energy sub-space {U(θ∗)|ϕi〉}mi=0 by optimizing C1(θ) as in the non-weighted subspace VQE. Instead of optimizing an addi-tional unitary, the multistate contracted VQE approxi-mates each eigenstate as |E〉 =

∑i αiU(θ∗)|ϕi〉 with co-

efficients αi which are obtained by solving a generalisedeigenvalue problem similar to subspace expansion withS = 1.

6. Adiabatically assisted VQE

Quantum adiabatic optimization seeks to find a solu-tion to an optimization problem by slowly transformingthe ground state of a simple problem to that of a complexproblem. These methods have a close connection withclassical homotopy schemes that are used to find the solu-tions of classical problems in optimization [111]. In lightof this connection, the adiabatically assisted VQE [112]uses a cost function C(θ) = 〈ψ(θ)|H(s)|ψ(θ)〉, whereH(s) = (1 − s)H0 + sHP and |ψ(θ)〉 = U(θ)|ψ0〉. HereHP is the problem Hamiltonian of interest and H0 is asimple Hamiltonian whose known ground state is taken asthe initial state |ψ0〉. During the parameter optimization,one slowly changes s from 0 to 1. The idea of Hamiltoniantransformation has been used as a type of ansatz to ob-tain solutions near the more challenging endpoint [113].

7. Accelerated VQE

As previously mentioned, whereas Quantum Phase Es-timation (QPE) provides a means to estimate eigenener-gies in the fault-tolerant era, it is not implementable inthe near-term. However, one of the positive features ofthis algorithm is that a precision ε can be obtained with anumber of measurements which scale as O(log( 1

ε )). Thisis in contrast with VQE, which requires O( 1

ε2 ) measure-ment for the same precision. This scaling motivated theAccelerated VQE algorithm, which interpolates betweenthe VQE and QPE algorithms [114–116]. The interpo-lation involves taking the VQE algorithm and replacingthe measurement process with a tunable version of QPE

called α-QPE. This allows the measurement cost to in-terpolate between that of VQE and QPE.

Dynamical quantum simulation

Apart from static eigenstate problems, VQAs can alsobe applied to simulate the dynamical evolution of a quan-tum system. Conventional quantum Hamiltonian sim-ulation algorithms, such as the Trotter-Suzuki productformula [117], generally discretize time into small timesteps and simulate each time evolution with a quantumcircuit. Therefore, the circuit depth generally increasespolynomially with the system size and simulated time.Given the noise inherent in NISQ devices, the accumu-lated hardware errors for such deep quantum circuits canprove prohibitive. To address this, VQAs for dynamicalquantum simulation only use a shallow depth circuit, sig-nificantly reducing the impact of hardware noise.

8. Iterative approach

Instead of directly implementing the unitary evolu-tion described by the Schrödinger equation d|ψ(t)〉

dt =−iH|ψ(t)〉, iterative variational algorithms [90, 91] con-sider trial states |ψ(θ)〉 and map the evolution of the stateto the evolution of the parameters θ. By iteratively up-dating the parameters, the quantum state is effectivelyupdated and hence evolved. Specifically, by using vari-ational principles, such as McLachlan’s principle [118]to solve the minimization minθδ‖(

ddt + iH)|ψ(θ)〉‖, one

obtains a linear equation for the parameters as M ·θ = V . Here ‖|ψ〉‖ =

√〈ψ|ψ〉, θ = dθ

dt , Mi,j =

Re(∂i〈ψ(θ)|∂j |ψ(θ)〉

), Vi = Im

(〈ψ(θ)|H∂i|ψ(θ)〉

), and

∂i|ψ(θ)〉 = ∂|ψ(θ)〉∂θi

. Each element of M and V can beefficiently measured with a modified Hadamard test cir-cuit. By solving the linear equation, one can iterativelyupdate the parameters from θ to θ + θ∆t with a smalltime step ∆t. Similar variational algorithms could be ap-plied for simulating the Wick-rotated Schrödinger equa-tion of imaginary time evolution [87] and general first-order derivative equations with non-Hermitian Hamilto-nians [92]. A systematic comparison between differentvariational principles for different problems can be foundin Ref. [91]. Recent works also extend the algorithms touse adaptive ansatz to reduce the circuit depth [119, 120]

9. Subspace approach

The weighted subspace VQE [110] provides an alter-native way to simulate dynamics in the subspace of thelow energy eigenstates [121]. Here one uses the weightedsubspace VQE unitary operator U(θ∗) that maps compu-tational basis states {|ϕj〉} to the low energy eigenstates{|Ej〉} as U(θ∗)|ϕj〉 ≈ eiδj |Ej〉, with δj an unknown

10

phase. Considering the low energy subspace, the timeevolution operator can be approximated as exp(−iHt) ≈U(θ∗)T (t)U†(θ∗) with T (t) =

∑j exp(−iEjt)|ψj〉〈ψj |.

The procedure could intuitively be understood as first,rotating the state to the computational basis withU†(θ∗), second, evolving the state with T (t), and third,rotating the basis back with U(θ∗). Therefore, for anystate |ψ(0)〉 =

∑j αj |Ej〉 that is a superposition of the

low energy eigenstates, its time evolution can be simu-lated as |ψ(t)〉 = U(θ∗)T (t)U†(θ∗)|ψ(0)〉. Since the timeevolution is directly implemented via T (t), it does notinvolve iterative parameter update and the circuit depthis independent of the simulation time.

10. Variational fast forwarding

Similar to the subspace approach, variational fast for-warding [122, 123] simulates the time evolution opera-tion exp(−iHt) as U(θ∗)T (E, t)U†(θ∗) with T (E, t) =∑j exp(−iEjt)|ψj〉〈ψj | a trainable diagonal matrix and

U(θ∗) a trainable unitary that maps between the eigen-states of H and the computational basis. Althoughthe subspace approach obtains T (E, t) and U(θ∗) viaweighted subspace VQE, variational fast forwarding op-timises a cost given by the fidelity between e−iHδt andU(θ∗)T (E, δt)U†(θ∗) for a small time step δt via theso-called local Hilbert-Schmidt test [124]. Then, ac-cording to the Trotter-Suzuki product formula, one hase−iHT = (e−iH∆t)M ≈ U(θ∗)(T (E, t))MU†(θ∗). Again,since the time evolution is implemented in T (E, t), onecan simulate the evolution for arbitrary time t with thesame circuit structure. As shown in Ref. [125], the en-suing Trotter error of this approach can be removed bydiagonalizing instead the Hamiltonian H that generatesthe evolution.

11. Simulating open systems

The VQA framework can also be extended to simulatedynamical evolution of open quantum systems. Supposethat the dynamics of the system is described by dρ

dt =L(ρ), where L denotes a super-operator for a dissipa-tive process. Similarly to the iterative approach for purestates [90], one maps the evolution of the mixed state toone of the variational parameters via McLachlan’s princi-ple, which solves the minimization minθ ‖(

ddt − L)ρ(θ)‖.

The solution determines the evolution of the parame-ters M · θ = V with Mi,j = Tr

[∂iρ(θ)†∂jρ(θ)

], Vi =

Tr[∂iρ(θ)†L(ρ)

]and ∂iρ(θ) = ∂ρ(θ)

∂θi. Each term of M

and V can be computed by applying the SWAP test cir-cuit on two copies of the purified states [91]. Here, tosimulate an open system of n qubits, one needs to applyoperations on 4n+1 qubits. An alternative approach [92]which reduces this overhead is to simulate the stochas-tic Schrödinger equation, which unravels the evolution ofthe density matrix into trajectories of pure states. Each

FIG. 5. Quantum Approximate Optimization Algo-rithm (QAOA). a. Schematic representation of the Trot-terized adiabatic transformation in the ansatz. The algo-rithm only loosely follows the evolution of the ground state ofH(t) = (1−t)HM+tHP for every t ∈ [0, 1], as one is interestedin making the final state close to the ground state of the prob-lem Hamiltonian HP , with HM being a mixer Hamiltonian.The free parameters {βl}pl=1 and {γl}pl=1 are trained, with pbeing the number of QAOA rounds. b. Problem Hamilto-nian HP and graph 〈jk〉 for a Max-Cut task. Each node inthe graph (circle) represents a spin. Vertices connecting twonodes indicate an interaction σzjσzk in HP , with σzk the Pauliz operator on spin k. The solution is encoded in the groundstate of HP where some spins are pointing up (green) whereasothers point down (blue).

pure state trajectory experiences continuous damping ef-fect and jump processes due to the noise operators, bothof which can be efficiently simulated. Since this methodone only controls a single copy of the pure state, it onlyrequires n+ 1 qubits.

B. Optimization

Thus far we have discussed using VQAs for tasks whichare inherently quantum in nature, that is, finding groundstates and simulating the evolution of quantum states.In this subsection we discuss a different possibility whereone uses a VQA to solve a classical optimization prob-lem [126].

The most famous VQA for quantum-enhanced op-timization is the QAOA [23], originally introduced toapproximately solve combinatorial problems such asConstraint-Satisfaction (SAT) [127] and Max-Cut prob-lems [128].

Combinatorial optimization problems are defined onbinary strings s = (s1, · · · , sN ) with the task of maxi-mizing a given classical objective function L(s). QAOAencodes L(s) in a quantum Hamiltonian HP by promot-ing each classical variable sj to a Pauli spin-1/2 operatorσzj , so that the goal is to prepare the ground state of HP .Motivated by the quantum adiabatic algorithm, QAOAreplaces adiabatic evolution with p rounds of alternat-ing time propagation between the problem HamiltonianHP and appropriately chosen mixer Hamiltonian HM ,

11

see Fig. 5. As discussed in the subsection on quantumalternating operator ansatz, the evolution time intervalsare treated as variational parameters and are optimizedclassically. Hence, defining θ = {γ,β}, the cost functionis C(γ,β) = 〈ψp(γ,β)|HP |ψp(γ,β)〉 with

|ψp(γ,β)〉 = e−iβpHM e−iγpHP · · · e−iβ1HM e−iγ1HP |ψ0〉 , (8)

and where |ψ0〉 is the ground state of HM .Finding optimal values γ and β is a hard problem since

the optimization landscape in QAOA is non-convex withmany local optima [129]. Hence, great efforts have beendevoted to finding a good classical optimizer that wouldrequire as few calls to the quantum computer as possible.Gradient-based [130, 131], derivative-free [43, 132], andreinforcement learning [133] methods were investigated,and this still remains an active field to guarantee a goodperformance for the QAOA.

C. Mathematical applications

Several VQAs have been proposed to tackle relevantmathematical problems such as solving linear systems ofequations or integer factorization. Since in many casesthere exist quantum algorithms for the fault-tolerant eraaimed for these tasks, the goal of VQAs is to have heuris-tical scalings comparable to the provable scaling of thesenon-near-term algorithms while keeping the algorithm re-quirements compatible with the NISQ era.

1. Linear systems

Solving systems of linear equations has wide-rangingapplications in science and engineering. Quantum com-puters offer the possibility of exponential speedup for thistask. Specifically, for an N × N linear system Ax = b(with A an N ×N matrix, and b an N × 1 column vec-tor defined from the linear systems problem), one con-siders the Quantum Linear Systems Problem (QLSP)where the task is to prepare a normalized state |x〉 suchthat A|x〉 ∝ |b〉, where |b〉 = b/‖b‖ is also a normal-ized state. The classical algorithmic complexity for thistask scales polynomially in the dimension N , whereasthe now-famous Harrow–Hassidim–Lloyd (HHL) quan-tum algorithm [3] has a complexity that scales logarith-mically in N , with some scaling improvements havingbeen proposed [134–137]. These pioneering quantum al-gorithms, however, will be difficult to implement in thenear-term due to the enormous circuit depth require-ments [138].

This situation has motivated VQAs for the QLSP [139–141]. A common feature in these algorithms is the as-sumption that A =

∑k ckAk is given as a linear combi-

nation of unitaries Ak that can be efficiently implementedweighted by real coefficients ck. One can then constructa Hamiltonian whose ground state is the solution to theQLSP and apply a variational approach to minimize the

cost C(θ) = 〈ψ(θ)|HG|ψ(ψ)〉. Refs. [139–141] consid-ered the Hamiltonian HG = A(1 − |b〉〈b|)A† (which wasalso considered outside of the variational setting [142]).The aforementioned cost can have gradients that vanishexponentially in the number of qubits n (that is, a so-called barren plateau in the cost landscape). This prob-lem can be mitigated by considering a local Hamiltonianwith the same ground state [139] or by using a hybridansatz strategy [141] where |ψ(θ)〉 =

∑i αi|ψi(θ1)〉 with

αi being variational parameters. A study [139] was con-ducted with n = 10, . . . , 30 qubits for Ising-inspired lin-ear systems and with n = 2, . . . , 7 qubit random (sparse)linear systems. The study showed that the time to so-lution scales logarithmically in N (and also efficiently incondition number and solution precision) for these prob-lems. Provided that larger systems display similar be-havior, the observed heuristic scaling suggests that VQAscould potentially give an exponential speedup, analogousto HHL, for the QLSP.

2. Matrix-vector multiplication

Another related problem is matrix-vector multiplica-tion, that is to prepare a normalized state |x〉 suchthat |x〉 ∝ A|b〉 with normalized vector |b〉. WhenA = 1− iHδt, then the problem becomes the taskof Hamiltonian simulation. Similar to solving theQLSP, one constructs the Hamiltonian HM = 1 −A |b〉〈b|A†/‖A |b〉 ‖2, whose ground state is |x〉 with zeroenergy [140]. Here ‖A|b〉‖ =

√〈b|A†A|b〉 is the Eu-

clidean norm. Given an approximate solution |ψ(θ∗)〉,one can lower bound the fidelity to the exact solutionas |〈ψ(θ∗)|x〉|2 ≥ 1− 〈ψ(θ∗)|HM |ψ(θ∗)〉, thus verify thesolution’s correctness whenever the cost function is small.

3. Non-linear equations

Non-linear equations are important to various fields,especially in the form of non-linear partial differentialequations. However, mapping such equations onto quan-tum computers requires careful thought since the under-lying mathematics of quantum mechanics is linear. Toaddress this, a VQA for such non-linear problems wasproposed in Ref. [143]. The approach was illustratedfor the time-independent non-linear Schrödinger equa-tion, where the cost function is the total energy (sum ofpotential, kinetic, and interaction energies), and wherethe space was discretized into a finite grid. By us-ing multiple copies of variational quantum states in thecost-evaluation circuit, this VQA can compute non-linearfunctions.

An alternative approach has been proposed for non-linear differential equations that is based on using a set ofbasis functions rather than a finite grid [144]. First, thebasis functions are encoded as non-linear feature maps(state preparation unitaries that are a function of the

12

variables from the system). Next, a parameterized ansatzprepares a state that represents a linear combination ofthese basis functions. The corresponding function valueis then output as an expectation value of an operator.Additionally, derivatives of this function are computedwith the parameter shift rule. This method then op-timizes a cost function that is minimized then the non-linear differential equation of interest is satisfied at a cho-sen set of points.

4. Factoring

Large-scale implementations of Shor’s algorithm arenot possible in the near term. Hence, a VQA for factor-ing as a potential near-term alternative was introducedin Ref. [145]. This proposal relies on the fact that factor-ing can be formulated as an optimization problem, andin particular, as a ground state problem for a classicalIsing model. The authors used the QAOA to variation-ally search for the ground state. Their numerical heuris-tics suggest that a linear number of layers in the ansatz(p ∈ O(n)) leads to a large overlap with the ground state.

5. Principal Component Analysis

An important primitive in data science is reducingthe dimensionality of data with Principal ComponentAnalysis (PCA). This involves diagonalizing the covari-ance matrix for a data set and selecting the eigenvectorswith the largest eigenvalues as the key features of thedata. Because the covariance matrix is positive semi-definite, one can store it in a density matrix, that is, ina quantum state, and then any diagonalization methodfor quantum states can be used for PCA. This ideawas exploited in Ref. [146] to propose a quantum algo-rithm for PCA. However, quantum phase estimation anddensity matrix exponentiation were subroutines in thisalgorithm, making it non-implementable in the NISQera. To potentially make this application more near-term, Ref. [147] proposed a variational quantum statediagonalization algorithm, where the cost function C(θ)quantifies the Hilbert-Schmidt distance between the stateρ(θ) = U(θ)ρU(θ)† and Z(ρ(θ)), and where Z is the de-phasing channel. This VQA outputs estimates of all theeigenvalues and eigenvectors of ρ, but it comes at the costof requiring 2n qubits for an n qubit state. This qubitrequirement can be reduced with the VQA of Ref. [113],which requires only n qubits. Here one exploits the con-nection between diagonalization and majorization to de-fine a cost function of the form C(θ) = Tr[ρ(θ)H] whereH is a non-degenerate Hamiltonian. Due to Schur con-cavity, this cost function is minimized when ρ(θ) is diag-onalized.

D. Compilation and unsampling

A natural task that NISQ devices can potentially accel-erate is the compiling of quantum programs. In quantumcompiling, the goal is to transform a given unitary V intonative gate sequence U(θ) with an optimally short cir-cuit depth. Quantum compiling plays a major role inerror mitigation, as errors increase with circuit depth.Quantum compiling is a challenging problem for classicalcomputers to perform optimally, due to the exponentialcomplexity of classically simulating quantum dynamics.Hence, several VQAs have been introduced that can po-tentially be used to accelerate this task [124, 148–151].These algorithms can be categorized as either Full Uni-tary Matrix Compiling (FUMC) or Fixed Input StateCompiling (FISC), which respectively aim to compile thetarget unitary V over all input states or for a particularinput state. In Ref. [124] a VQA for FUMC was pre-sented, which uses cost functions closely related to entan-glement fidelities to quantify the distance between V andU(θ). The proposal in Ref. [148] also treats the FUMCcase, but with an alternative approach to quantifying thecost using the average gate fidelity, averaged over manyinput and output states. The FISC case was treatedin Ref. [149], where the problem was reformulated as aground state energy task, hence making the connectionwith VQE. The connection with VQE was also general-ized to FUMC [150], showing that variational quantumcompiling, in general, is a special kind of VQE problem.Ref. [151] introduced and experimentally implementeda compiling scheme which can be thought of as FISC,although the architecture here is focused on the applica-tion of unpreparing a quantum state. Finally, it is worthnoting that both FUMC and FISC exhibit resilience tohardware noise, in that the global minimum of the costlandscape is unaffected by various types of noise [150].This noise resilience feature is crucial for the utility ofvariational quantum compiling for error mitigation, andwe discuss this in more detail later.

E. Error correction

Quantum Error Correction (QEC) protects qubitsfrom hardware noise. Due to the large qubit requirementsof QEC schemes, their implementation is beyond NISQdevice capabilities. Nevertheless, QEC could still bene-fit NISQ hardware by suppressing the error to a certainextent and by combining it with other error mitigationmethods. Specifically, conventional universal approachesfor implementing QEC codes generally involve an un-necessarily long circuit that does not take into accountthe hardware structure or the type of noise. Hence, twoVQAs have been introduced to solve these problems toautomatically discover or compile a small quantum error-correcting code for any quantum hardware and any noise.

The Variational Quantum Error Corrector (QVEC-TOR) was first proposed to discover a device-tailored

13

quantum error-correcting code for a quantum mem-ory [152]. For any k-qubit input state |ψ〉 = US |0〉,prepared by a unitary US acting on a reference state|0〉, QVECTOR considered two parametrized circuitsV (θ1) (on n ≥ k qubits) and W (θ2) (on n + r qubits),which respectively encode the input logical state into nqubits with n − k ancillary qubits and realize recoveryoperations with r ancillary qubits. By sequentially ap-plying encoding, recovery, and decoding on the inputstate, one obtains an output ρout = W (θ1)V (θ1)(ψ ⊗|0〉〈0|⊗n−k+r)V (θ1)†W (θ1)†. Projecting the n − k an-cillary qubits back to |0〉〈0| and discarding the lastr ancillary qubits, one finds a quantum channel ρ =E(θ1,θ2)(ψ) on the input state ψ. The target of QVEC-TOR is to maximize the fidelity

∫ψdψF (ψ, E(θ1,θ2)(ψ))

between the output ρ and the input ψ averaged overall allψ or any US that forms a unitary 2-design. The solutionwill give the quantum circuit that maximally protects theinput state. Numerical simulations showed that QVEC-TOR can find quantum codes that outperform existingones [152].

Instead of discovering new device-tailored QEC codes,Ref. [153] considered how to compile conventional QECcodes into a given quantum hardware with specific noise.Suppose one aims to implement the logical state |ψ〉L =α|0〉L + β|1〉L with logical state basis {|0〉L, |1〉L}. Notethat |ψ〉L is the ground state of the stabilizers Gk aswell as the logical operator P = |ψ〉L〈ψ|L − |ψ⊥〉L〈ψ⊥|Lwith orthogonal state |ψ⊥〉L. Then one can constructa frustration-free Hamiltonian H = −a0P −

∑k≥1 akGk

with positive coefficients a0, ak, and with |ψ〉L the groundstate with energy EG = −(a0 +

∑k≥1 ak). One then uses

a VQA to discover the circuit that implements |ψ〉L witha given hardware structure. Since the eigenstate energiesare know, the fidelity, F , of the discovered state can bebounded by F ≥ 1− (E−EG)/a with the discovered en-ergy E and a = min{a0, ak}. Numerical studies showedthe encoding circuits for the five- and seven-qubit codeswith different noisy hardware [153].

F. Machine learning and data science

Quantum machine learning (QML) generally refers tothe tasks of using a quantum computer to learn patternsin quantum data with the goal of making accurate pre-dictions on unknown, and unseen data [154]. Althoughan in-depth overview of QML is beyond the scope of thisReview, we present several QML applications for whichthe VQA framework can be readily implemented. Specif-ically, here one learns a parametrized quantum circuit tosolve a given task [80, 155]. This connection betweenVQAs and (typical) QML applications shows that thelessons learned in one field can be of great use in theother, hence providing a close connection between thesetwo fields.

1. Classifiers

The classification of data is a ubiquitous task inmachine learning. Given training data of the form{x(i), y(i)}, where x(i) are inputs, and y(i) labels, the goalis to train a classifier to accurately predict the label ofeach input. Since a key aspect for the success of classi-cal neural networks is their non-linearity, one can expectthis property to also arise in a quantum classifier. Asshown in Ref. [156], parametrized quantum circuits cansupport linear transformations and non-linearity can beexploited from the tensor product structure of a quantumsystem. More precisely, defining an input data depen-dent unitary V (x), then the tensor product V (x)⊗V ′(x)or the multiplication V (x)V ′(x) results in a non-linearfunction of the input data x. In this sense, the unitaryV (x) can be used as a quantum non-linear feature map,where the Hilbert space can be exploited for a featurespace [157, 158]. Interestingly, the tensor network struc-ture of quantum mechanics has even inspired classicalmachine learning methods [159].

Here, after embedding the input data x into thequantum state, a linear transformation is performedusing a parametrized quantum circuit, U(θ)V (x)|ψ0〉.The cost function is then defined as the error betweenthe true label and the expectation value of an easilymeasurable observable A, that is, C(θ) =

∑i[y

(i) −〈ψ0|V †(x(i))U†(θ)AU(θ)V (x(i))|ψ0〉]2. This approachhas been used in generalization and in classificationtasks [80, 156], with Refs. [76, 80, 160, 161] discussingdifferent ways of embedding classical data into quantumstates (such as data re-uploading), and with Ref. [158]showing an experimental demonstration of variationalclassification.

Moreover, as shown in Refs. [157, 158], instead ofusing a parameterized unitary U(θ) one can use prod-ucts of quantum feature vectors 〈ψ0|V †(x′)V (x)|ψ0〉 toperform a kernel method. Finally, the quantum kerneltrick, which means that the dimensions of the quantum-enhanced feature space are larger than the number ofdata sets, has been demonstrated experimentally by us-ing an ensemble nuclear spins [162].

2. Autoencoders

The autoencoder for data compression is an importantprimitive in machine learning. The idea is to force in-formation through a bottleneck while still maintainingthe recoverability of the data. As a quantum analog,Ref. [163] introduced a VQA for quantum autoencod-ing, with the goal of compressing quantum data. (seeRefs. [164, 165] for alternative approaches to quantumautoencoders.) The input to the algorithm is an ensembleof pure quantum states {pµ, |ψµ〉} on a bipartite systemAB (here pµ are real and positive coefficients such that∑µ, pµ = 1). The goal is then to train an ansatz U(θ) to

compress this ensemble into the A subsystem, such that

14

one can recover each state |ψµ〉 with high fidelity fromsubsystem A. The B subsystem is discarded and hencecan be thought of as the ‘trash’. Given the close con-nection between data compression and decoupling [163],the cost function is based on the overlap between theoutput state on B and a fixed pure state. Recently, alocal version of this cost function was also proposed andwas shown to train well for large-scale problems [166].Moreover, in Ref. [167], the autoencoder scheme wasgeneralized to mixed state and a noise-assisted algo-rithm was provided to improve the recovering fidelity formixed/pure states. Quantum autoencoders have seen ex-perimental implementation on quantum hardware [168],and will likely be an important primitive in QML.

3. Generative models

The idea of training a parameterized quantum circuitfor a QML implementation can also be applied for a gen-erative model [169–171], which is an unsupervised sta-tistical learning task with the goal of learning a prob-ability distribution that generates a given data set. Let{x(i)}Di=1 be a data set of size D sampled from a probabil-ity distribution q(x). Here one learns q(x) as the param-eterized probability distribution pθ(x) = |〈x|U(θ)|ψ0〉|2obtained by applying U(θ) to an input state and mea-suring in the computational basis, that is, it correspondsthen to a quantum circuit Born machine [170]. In prin-ciple one wishes to minimize the difference between thetwo distributions. However, since q(x) is not available,the cost function is defined by the negative log-likelihoodC(θ) = − 1

D

∑i log(pθ(x

(i))). In Ref. [170] a variationalframework for training quantum circuit Born machineswas introduced and demonstrated for both classical data,such as the bars-and-stripes data set, and for syntheticdata sets related to the preparation of cat states andcoherent thermal states. In Ref. [172], an implicit gen-erative model has been constructed by comparing thedistance in the Gaussian kernel feature space. The repre-sentation power of the generative model has been investi-gated in Ref. [171]. Finally, it has been shown that quan-tum circuit Born machines can simulate the restrictedBoltzmann machine and perform a sampling task that ishard for a classical computer [173].

4. Variational Quantum Generators

Generative Adversarial Networks (GANs) use two neu-ral networks, a generator and discriminator, in competi-tion. The generator aims to convince the discriminatorthat its output is coming from the true distribution asso-ciated with the training data. GANs play an importantrole in classical machine learning for applications such asimage synthesis and molecular discovery. Ref. [174] pro-posed a VQA for learning continuous distributions whichis meant to be a quantum version of GANs. Here one still

considers classical data, but encoded into a quantum cir-cuit. This encoding is followed by a variational quantumcircuit that generates quantum states, which are thenmeasured to produce a fake sample. This fake samplethen enters either a classical discriminator or a quantumdiscriminator, and the cost function is optimized to min-imize the discrimination probability with respect to realsamples. The target application is to accelerate classicalGANs using quantum computers.

5. Quantum Neural Network architectures

Several Quantum Neural Network (QNN) architectureshave been proposed; for instance, Refs. [155, 175, 176]proposed perceptron-based QNNs. In these architectureseach node in the neural network represents a qubit, andtheir connections are given by parameterized unitariesof the form in Eq. (4) acting on the input states. Inaddition, Ref. [177] introduced Quantum ConvolutionalNeural Networks (QCNNs). QCNNs have been used forerror correction [177], image recognition [178], and to dis-criminate quantum state belonging to different topologi-cal phases [177]. Moreover, it has been shown that QC-NNs and QNNs with tree tensor network architecturesdo not exhibit barren plateaus [179, 180] (which will bediscussed later), potentially making them a genericallytrainable architecture for large-scale implementations.

G. New frontiers

In this section we discuss some exciting, recently pro-posed applications of VQAs. These applications high-light the fact that VQAs could be used to understandand exploit the mathematical and physical structure ofquantum states, and quantum theory in general.

1. Quantum foundations

NISQ computers will likely play an important role inunderstanding the foundations of quantum mechanics.In a sense, these devices offer experimental platformsto test foundational ideas ranging from quantum gravityto quantum Darwinism [181]. For example, the emer-gence of classicality in quantum systems will be soonbe a computationally tractable field of study due to theincreasing size of NISQ computers. Along these lines,Ref. [182] proposed the Variational Consistent Histories(VCH) algorithm. Consistent Histories is a formal ap-proach to quantum mechanics that has proven to be use-ful in studying the quantum-to-classical transition andquantum cosmology. In this formalism, interference be-tween different paths (histories) as quantified in the de-coherence functional [183]. The exponential number ofterms in this decoherence functional makes the formal-ism computationally expensive on classical devices. VCH

15

provides a way to prepare a density matrix representa-tion of the entire functional, allowing one to efficientlyexamine the consistency of a set of histories. The appli-cation of standard VQAs to foundational situations canalso provide a framework for new insights. For example,Ref. [184] showed that a Full Unitary Matrix Compil-ing (FUMC) strategy (discussed above) cannot efficientlylearn a scrambling unitary. This result provides insightinto the black hole information paradox as one wouldneed to have a representation of a black hole’s scram-bling unitary in order to unscramble information fromemitted Hawking radiation [185].

2. Quantum information theory

Another field that will likely see renewed interest dueto NISQ computers is quantum information theory [186].For example, in Ref. [163] it was remarked that the quan-tum autoencoder algorithm could potentially be used tolearn encodings and achievable rates for quantum chan-nel transmission. Another area of research is using NISQcomputers to compute key quantities in quantum infor-mation theory, such as the von Neumann entropy or dis-tinguishability measures such as the trace distance. Al-though it is know that these problems are hard for generalquantum states [187], Ref. [188] introduced a VQA to es-timate the quantum fidelity between an arbitrary stateσ and a low-rank state ρ. Moreover, in Ref. [72] a VQAwas introduced to learn modular Hamiltonians, whichprovides an upper bound on the von Neumann entropyof a quantum state. Here one attempts to variationallydecorrelate a quantum state by minimizing the relativeentropy to a product distribution, and hence this methodis suited for states that can be easily decorrelated.

3. Entanglement Spectroscopy

Characterizing entanglement is crucial for understand-ing condensed matter systems, and the entanglementspectrum has proven to be useful in studying topologi-cal order. Several VQAs have been introduced to extractthe entanglement spectrum of a quantum state [113, 147,189]. Since the entanglement spectrum can be viewed asthe principlal components of a reduced density matrix,algorithms for PCA can be used for this purpose, includ-ing the VQAs discussed before. In addition, one can alsouse the variational algorithm for quantum singular valuedecomposition introduced in Ref. [189]. These algorithmscould potentially characterize the entanglement (and forexample, topological order) in a ground state that wasprepared by VQE, and hence different VQAs can be usedtogether in a complementary manner.

4. Quantum metrology

Quantum metrology is a field where one seeks the opti-mal setup for probing a parameter of interest (for exam-ple a magnetic field) with minimal shot noise. In the ab-sence of noise during the probing process, the analyticalsolution for the optimal probe state can be derived. How-ever, when general physical noises are present, an analyt-ical solution is hard to find. Variational-state quantummetrology variationally searches for the optimal probestate [190–193]. For state preparation, variational quan-tum circuits are used in Refs. [190, 192, 193] whereas opti-cal tweezer arrays are considered in Ref. [191]. More con-cretely, one prepares a probe state with variational pa-rameters, probes the magnetic field with physical noises,measures quantum Fisher information (QFI) as a costfunction, and updates the parameters to maximize it.Note that since QFI cannot be efficiently computed, anapproximation of QFI can be heuristically found by op-timizing the measurement basis, or by computing upperand lower bounds on the QFI [193].

IV. CHALLENGES AND POTENTIALSOLUTIONS

Despite the tremendous developments in the field ofVQAs, there are still many challenges that need to beaddressed to maintain the hope of achieving quantumspeedups when scaling up these near-term architectures.Understanding the limitations of VQAs is crucial to de-veloping strategies that can be used to construct bet-ter algorithms, prove certain guarantees on their perfor-mance, and even to build better quantum hardware.

A. Trainability

1. Barren plateaus

The so-called barren plateau (BP) phenomenon in thecost function landscape has received considerable atten-tion as one of the main bottlenecks for VQAs. When agiven cost function C(θ) exhibits a BP, the magnitude ofits partial derivatives will be, on average, exponentiallyvanishing with the system size [194]. As shown in Fig. 6this has the effect of the landscape being essentially flat.Hence, in a BP one needs an exponentially large precisionto resolve against finite sampling noise and determine acost-minimizing direction, with this being valid indepen-dently of using a gradient-based [84] or gradient-free op-timization method [195]. The exponential scaling in theprecision due to BPs could erase a potential quantumadvantage with a VQA, as its complexity would be com-parable to the exponential scaling typically associatedwith classical algorithms. Hence, analyzing the existenceof BPs in a given VQA is fundamental to preserve thehope of using it to achieve a quantum advantage.

16

FIG. 6. Barren plateau (BP) phenomenon. a. Vari-ance of the cost function partial derivative, Var(∂θ1,1E), fora particular parameter θ1,1 in the ansatz versus number ofqubits (n). Results were obtained from a Variational Quan-tum Eigensolver implementation with a deep unstructuredansatz. The y-axis is on a log scale. As the number of qubitsincreases the variance vanish exponentially with the systemsize. b. Visualization of the landscape of a global cost func-tion which exhibits a BP for the quantum compilation imple-mentation, . The orange (blue) landscape was obtained forn = 24 (n = 4) qubits. As the number of qubits increases,the landscape becomes flatter. Moreover, this cost also ex-hibits the narrow gorge phenomenon [166], where the volumeof parameters leading to small cost values shrinks exponen-tially with n. Panel a is adapted from Ref. [194], CC BY 4.0;Panel b is adapted from Ref. [166], CC BY 4.0.

The phenomenon of BPs was originally discovered inRef. [194] where it was shown that deep unstructuredparametrized quantum circuits exhibit BPs when ran-domly initialized. The proof of this result relies on thefact that these unstructured ansatzes become 2-designswhen their depth grows polynomially with the numberof qubits n [196, 197]. One can view this phenomenonas stemming from the fact that the ansatz is problem-agnostic and hence needs to explore an exponentiallylarge space to find the solution. Therefore, the proba-bility of finding the solution when randomly initializingthe ansatz is exponentially small.

The analysis of BPs was extended to shallow randomlayered ansatze [166] where it was shown that the BP phe-nomenon is cost-function dependent: Global cost func-tions (when one compares operators or states living inexponentially large Hilbert spaces) exhibit BPs, whereaslocal cost functions (when one compares objects at thesingle-qubit level), exhibit gradients which vanish poly-nomially in n so long as the circuit depth is at mostlogarithmic in n. This implies a connection between thelocality and trainability and informs our intuition as towhat types of cost functions might have to be avoided.These results have been numerically verified in Ref. [198]and further extended in Ref. [199]. Here one can under-stand the presence of BPs for global costs as spanningfrom the fact that the Hilbert space grows exponentiallywith n and hence the probability of two objects beingclose is exponentially small.

In addition, it has been shown that BPs can arise inmore general problems such as learning a scrambler [184]where for any choice of variational ansatz will lead to,

with high probability, a BP. This again is due to theintrinsic randomness in a scrambler. In addition, theBP phenomenon has been studied when training ran-domly initialized perceptron-based quantum neural net-works [200, 201]. Here, BPs arise from the significantamount of entanglement created by the perceptrons con-necting large number of qubits in visible and hidden lay-ers. Specifically, when tracing out the qubits in the hid-den layers, the state of the visible qubits becomes ex-ponentially close to being maximally mixed (due to con-centration of measure), which makes it difficult to extractinformation from such a state.

Although previous results rely on the randomness ofthe ansatz, there is a conceptually different phenomenonthat can lead to BPs. Recently, it was shown in Ref. [202]that noise can induce barren plateaus, regardless of theansatz employed. Here, the presence of noise actingthroughout the circuit progressively corrupts the statetowards the fixed point of the noise model, usually themaximally mixed state [203]. Such a phenomenon wasshown to arise when the circuit depth needs to be lin-ear (or larger) with the system size, meaning that it willaffect many widely-used ansatzes.

2. Ansatz and initialization strategies

Hand-in-hand with the theoretical progress on theanalysis on the BP phenomenon, great effort has beendedicated to avoiding or mitigating the effect of BPs.The main strategy here has been to break the assump-tions leading to BPs. In what follows we present twomain approaches: parameters initialization and choice ofansatz.

• Parameter initialization. Randomly initializing anansatz can lead to the algorithm starting far fromthe solution, near a local minima, or even in a re-gion with barren plateaus. Hence, optimally choos-ing the seed for θ at the beginning of the optimiza-tion is an important task. The importance of pa-rameter initialization was made clear in Ref. [204]where it was noted that the optimal parametersin QAOA exhibit persistent patterns. Based onthese observations initialization strategies were pro-posed and which were heuristically shown to out-perform randomly initialized optimizations. Addi-tionally, in Refs. [29, 205] an initialization strat-egy was developed specifically to address BPs indeep circuits. Here, one selects at random a sub-set of parameters in θ, and chooses the value ofthe remaining ones so that the circuit is a sequenceof shallow unitaries that evaluates to the identity.The main idea behind this method is to reducethe randomness and depth of the circuit to breakthe assumption that the circuit approximates a 2-design, a condition necessary for BPs to arise indeep ansatzes. Similar to the previous method,

17

other schemes have been introduced to prevent BPby restricting the randomization of the ansatz. Forinstance, the proposal in Ref. [206] showed that cor-relating the parameters in the ansatz effectively re-duces the dimension of the hyperparameter spaceand can lead to large cost function gradients. Inaddition, Ref. [207] introduced a method whereone uses layer-by-layer training: one initially trainsshallow circuits and progressively adds componentsto the circuit. Whereas the latter guarantees thatthe number of parameters and randomness remainssmall for the first steps of the training, it hasbeen shown [208] that this method can lead to anabrupt transition in the ability of quantum circuitsto be trained. Finally, a method was introducedin Ref. [209] where one pre-trains the parametersin the quantum circuits by using classical neuralnetworks.

• Ansatz strategies. Another strategy for prevent-ing BPs is using structured ansatzes which areproblem-inspired. The goal here is to restrict thespace explored by the ansatz during the optimiza-tion. As discussed in the section on ansatzes, theUCC ansatz for VQE of the quantum alternat-ing operator ansatz [23, 24] for optimization areproblem-inspired ansatzes which are usually train-able even when randomly initialized. Other ansatzstrategies include the proposals in Ref. [209] tolearning a mixed state, where one leverages knowl-edge of the target Hamiltonian to create a Hamilto-nian variational ansatz. In addition, Refs. [60, 61]presented an approach where the ansatz for the so-lution is |ψ ({cµ})〉 =

∑µ cµ|ψµ〉, for a fixed set of

states {|ψµ〉} determined by the problem at hand.Here the optimization over the coefficients {cµ}can be solved using a quadratically constrainedquadratic program.

Finally, we remark that along with ansatz strategiesthere are other ways of potentially addressing BPs.These include optimizers tailored to mitigate the effect ofBPs [210], local cost functions [113], or architectures suchas the QCNN, which has been shown to avoid BPs [179].

B. Efficiency

Another requirement that must be met for VQAs toprovide a quantum advantage is having an efficient way toestimate expectation values (and more general cost func-tions). The existence of BPs can exponentially increasethe precision requirements needed for the optimizationportion of VQAs, as discussed in the section on BPs, buteven in the absence of such BPs these expectation valueestimations are not guaranteed to be efficient. Indeed,early estimations of resource requirements suggested thatthe number of measurements that would be required forinteresting quantum chemistry VQE problems would be

astronomical, hence addressing this issue is essential forrealizing quantum advantage [28]. More reasonable re-source estimates can be reached for restricted problemssuch as the Hubbard model [211, 212]. Although in prin-ciple one could always take projective measurements ontothe eigenbasis of the operator in question, in general boththe computational complexity of finding the required uni-tary, and the depth required to implement that transfor-mation, may be intractable. However, given that arbi-trary Pauli operators are diagonalizable with one layerof single qubit rotations, it is common for the operatorsof interest (such as quantum chemistry Hamiltonians) tobe expressed by their decomposition into such Pauli op-erators. That is, H =

∑i ciσi, where {ci} are real coef-

ficients and {σi} are Pauli operators. The drawback ofthis approach is that, for many interesting Hamiltoniansthis decomposition contains many terms. For example,for chemical Hamiltonians the number of distinct Paulistrings scales as n4 where n is the number of orbitals(and thus qubits) for large molecules. In what follows wediscuss several methods whose goal is to obtain measure-ment frugality in estimating the cost function.

1. Commuting sets of operators

In the interest of reducing the number of measurementsrequired to estimate an operator expectation value, anumber of methods have been proposed for partition-ing sets of Pauli strings into commuting (simultaneouslymeasurable) subsets. The choice of the subsets is also ofcourse non-unique and has been mapped onto the com-binatorial problems of graph coloring [213, 214], findingthe minimum clique cover [215–218], or finding the max-imal flow in network flow graphs [219], which makes itpossible to import the heuristics and formal results fromthose problems.

Perhaps the simplest approach to such a partition-ing is to look for subsets that are qubit-wise commut-ing (QWC), which is to say that the Pauli operators oneach qubit commute. Indeed, this was the first methodintroduced [220]. However, whereas the QWC methodshelp reduce the number of operators, they do not changethe asymptotic scaling for quantum chemistry applica-tions, motivating more general commutative groupingsto be considered. To this end, it has been shown thatby considering general commutations (and increasing thenumber of gates of the circuit quadratically with n) thescaling of the number of measurements can be reducedto n3 [213, 214, 216–219].

For using VQE on fermionic systems, this scaling canactually be brought down to either quadratic or, for sim-pler cases, even linear [221] in n. This significant im-provement is found by considering factorizations of thetwo-electron integral tensors, rather than working at theoperator level. The success of this approach suggests thatusing background information on the problem may signif-icantly improve the measurement efficiency of estimating

18

an expectation value.

2. Optimized sampling

In addition to reducing the number of individual oper-ators that need to be measured, measurement efficiencycan also be improved by carefully allocating the num-ber of shots among the Pauli operators. Since operatorswith smaller coefficients will tend to contribute less to theoverall variance, assigning the same number of shots toeach operator is usually inefficient. Instead, the optimalapproach [222] is to give each Pauli operator a numberof shots proportional to |ci|

√Var(σi), where ci is the co-

efficient of the ith Pauli operator σi and Var(σi) is thevariance of 〈σi〉. During an optimization where low pre-cision steps may be allowed early on, this allocation caninstead be performed randomly with probabilities pro-portional to |ci|

√Var(σi). Making the allocation ran-

domly in this way allows for unbiased estimates with aslittle as one shot, potentially significantly increasing theefficiency of the optimization [223]. Optimizing the sam-pling of the metric tensor has also been explored, withthe conclusion that these costs need not be dominant inmetric-aware VQAs [224].

3. Classical shadows

Another promising approach to efficient measurementsis the construction of classical shadows [225], also knowas shadow tomography. In this approach, an approximateclassical representation of the state (the classical shadow)is constructed by summing over the collection of statesthat a sequence of different measurements projects onto.These measurements are taken in the basis of randomlychosen strings so that a partial tomography of the stateis completed. Combining the measurements in this way,each shot contributes to the estimation of each Pauli op-erator expectation value, resulting in a number of mea-surements that scales logarithmically with n. As withdirect measurement approaches discussed above, this ap-proach can also be further optimized by tuning the proba-bility distribution for the Pauli operators that define themeasurements to match the properties of the operatorand state [226].

4. Neural network tomography

A different approach using partial tomography isto train an approximate restricted Boltzmann machine(RBM) representation of the desired quantum state [227].This RBM is fitted using measurements of the Pauli oper-ators that are needed to directly estimate a given opera-tor’s expectation value, and so does not inherently reducethe number of operators to measure. However, by com-puting the expectation value on an approximate RBM

instead of directly from measurements the sampling vari-ance for a given number of shots is substantially reducedat the cost of introducing a small, positive bias [227].

C. Accuracy

One of the main goals for VQAs is to enable a prac-tical use for NISQ devices. For this goal, VQAs providea strategy to deal with hardware noise as they can po-tentially minimize quantum circuit depth. Moreover, asdiscussed below, error mitigation methods can be com-bined with VQAs to further improve accuracy. However,one can still ask what the impact of hardware noise willbe on the accuracy of a VQA.

1. Impact of hardware noise

There are multiple aspects of the impact of hardwarenoise: it could potentially slow down the training pro-cess, it could bias the landscape so that the noisy globaloptimum no longer corresponds to the noise-free globaloptimum, and it could affect the final value of the optimalcost.

• Effect of noise on training. The question of whethernoise can help with the training process was posedin Ref. [228]. In practice, it is typical to observethat noise slows down the training. For example, itwas heuristically observed that the noise-free costachieves lower values with noise-free training thanwith noisy training [96, 223, 229]. As discussed inthe section on BPs, the intuition behind this slow-ing down is that the cost landscape is flattened,and hence gradient magnitudes are reduced, by thepresence of incoherent noise [202, 230, 231]. More-over, gradients decay exponentially with the algo-rithm’s depth, meaning that the deeper the circuit,the more it will be affected. This can be further un-derstood from the fact that cost functions are typi-cally extremized by pure states, and since incoher-ent noise reduces state purity, one expects this noiseto erode the extremal points of the landscape [203].The presence of noise-induced BPs and their effecton the trainability is one of the leading challengesfor VQAs, with potential solutions being develop-ing better quantum hardware or shorter-depth al-gorithms. It is worth remarking that the resultsdiscussed here do not account for the use of errormitigation techniques, and the scope to which thesecould help is still an open question.

• Effect of noise on cost evaluation. In Refs. [202,203] it was also shown that in the presence of lo-cal Pauli noise, the cost landscape concentrated ex-ponentially with the depth of the ansatz aroundthe value of the cost associated with the maximally

19

mixed state. Whereas the proof of this exponentialconcentration of the cost was for general VQAs,some previous works had also observed this effectfor the special case of the QAOA [230, 231]. Theexponential concentration of the cost is of courseimportant beyond the issue of trainability. Even ifone is able to train, the final cost value will be cor-rupted by noise. There are certain VQAs where thisis not an important issue (for example, in QAOAwhere one can classically compute the cost aftersampling). However, for VQE problems, this is im-portant, since one is ultimately interested in an ac-curate estimation of the energy. This emphasizesthe importance of understanding to what degreeerror mitigation methods can correct for this issue.

2. Noise resilience

One reason for the interest in VQAs is their ability tonaturally overcome certain types of noise in hardware,especially in near-term implementations. This noise re-silience is a crucial, non-trivial feature of VQAs.

• Inherent resilience to coherent noise. By construc-tion, VQAs are insensitive to the specific parametervalues, ultimately only sampling physical observ-ables from the resulting state. More specifically,if the physical implementation of a unitary resultsin a coherent error within the parameter space, orU(θ) actually results in U(θ+ δ), then under mildassumptions the optimizer can calibrate this blockunitary on the fly to improve the physical stateproduced. This effect was first conjectured theo-retically [220] and later seen experimentally in su-perconducting qubits [48], where errors after thevariational procedure were reduced in some casesby over an order of magnitude. Success in this en-deavor depends upon the ability to optimize fasterthan the drift of calibration in the device, and suf-ficient variational flexibility in the ansatz, but maycontinue to be effective even into the early fault-tolerant regime where coherent errors can be espe-cially insidious.

• Inherent resilience to incoherent noise. It is aninteresting question as to what degree incoherentnoise, such as decoherence, random gate errors, andmeasurement errors, will impact VQAs. For ex-ample, it was shown that optimization in the pres-ence of some noise channels can automatically movethe state into subspaces that are resilient to thosechannels as an energetic trade-off [55]. However,one could operate under the assumption of per-fect training (which may still be possible for eitherweak noise or shallow ansatzes), and ask whetherthe global optimum in the cost landscape is robustto such noise. This was the approach of Ref. [150],

where it was shown that VQAs for quantum com-piling (see section on compilation), exhibit a spe-cial type of noise resilience known as Optimal Pa-rameter Resilience (OPR). OPR is the notion thatglobal minima of the noisy cost function correspondto global minima of the noise-free cost function. Inthis sense, if an algorithm exhibits OPR, then min-imizing the cost in the presence of noise will stillobtain the correct optimal parameters, and hencethe optimal parameters are resilient. Since quan-tum compilation is a special case of VQE the ques-tion still remains open to whether other VQAs ex-hibit this type of noise resilience for certain noisemodels. A different type of noise resilience was ana-lyzed in Ref. [232] in the context of the holographicquantum simulation of many-body systems. Specif-ically, it was shown that under certain conditionsthe expectation values of local observables mea-sures on the prepares ground-state are perturbedby, at most, a function that does not depend onthe size, but rather only on the noise parameter.

3. Error mitigation

Quantum Error Mitigation (QEM) generally sup-presses physical errors to expectation values of observ-ables via classical post-processing of measurement out-comes [233]. An intuitive, but powerful, example is theextrapolation method [90, 234]. Even if the error ratecannot be reduced, in many cases it can be deliber-ately boosted, for example, as shown in Fig. 7, by in-serting additional noisy pulses or making gate operationslonger, the quantum device undergoes more physical er-rors. Then, by obtaining measurement outcomes at sev-eral noise levels and extrapolating them, one can esti-mate the error-free result using the so-called zero-noiseextrapolation method. Due to the propagation of uncer-tainty, the variance of the error-mitigated result is am-plified and hence one needs to have a larger samplingcost, which is the overhead of QEM. First, Richardsonextrapolation was proposed [90, 106, 234], and it wasshown that single- and multi-exponential extrapolationwork well for Markovian gate errors, with the latter sub-sequently shown to have very broad efficacy [235, 236].In addition, extrapolation using least square fitting forseveral noise parameters has been proposed [237]. Fur-thermore, it has been observed that the extrapolationmethod can mitigate algorithmic errors that arise due toinsufficiency in the number of time steps [238].

Although extrapolation methods by design cannotfully mitigate physical errors [234, 235], probabilistic er-ror cancellation in theory can obtain unbiased expecta-tion values by inverting the noise process with additionalprobabilistic gate operations (if a complete characteri-zation of noise is provided). Note that since an inversemap of physical errors is generally unphysical, it is nec-essary to post-process measurement outcomes according

20

Fig 7

Nature Reviews | Physics

Manuscript number NRPhys-20-485 Coles Review 28|05|21

ba

| 0 | 0

| 1 | 1

FIG. 7. Qubit trajectories on the Bloch sphere withthe Zero-Noise Extrapolation (ZNE) technique. Theaccuracy of a noisy quantum computer can be improved withthe ZNE error mitigation method. a. Here, one repeats agiven calculation with different levels of noise. The greencurve corresponds to a rotation on the Bloch sphere with ahigher noise level than that leading to the red curve. b. Tak-ing data from the red and green curves, ZNE can be usedto estimate what the trajectory (blue) would be like in theabsence of noise. Adapted from Ref. [106], Springer NatureLimited.

to applied recovery operations. In Ref. [234] this methodwas first introduced and in Ref. [235] it was found thatgate set tomography is a suitable noise characterizationstrategy, and a set of operations was proposed which cancompensate for general Markovian errors. Furthermore,based on probabilistic error cancellation, stochastic errormitigation which works for general continuous systemssuch as analog quantum simulators and digital quantumcomputers was introduced [239].

A different approach to QEM relies on the classicalsimulability of near-Clifford circuits. The basic idea be-hind this approach is to compare the classically com-puted exact expectation values for near-Clifford circuitswith their noisy counterparts evaluated on actual hard-ware [240–242]. Taking this approach can allow one toimplement a probabilistic error mitigation protocol with-out needing to construct a full error model for an ex-periment [240]. Alternatively, one can perform a simpleregression with this Clifford data to estimate how the ob-servables have been affected and invert this regression toestimate desired noise-free expectation values [241]. Fi-nally, zero-noise extrapolation can be merged with thisregression to have an extrapolation to zero-noise whoseform is tuned via the Clifford data, reducing the risk ofblind extrapolations [242].

Several additional QEM methods have been proposed.Symmetry verification is especially useful for ansatzethat preserve symmetries such as particle and spin num-ber [11, 243, 244]. Since physical errors break the symme-try, by measuring and ignoring the undesired case (simi-larly to error detection), one can mitigate physical noise.Unlike other QEM methods, symmetry verification canrecover the quantum state itself. One can also take a

post-processing approach using the information of thesymmetry with a larger sample number [244]. The useof symmetry verification to augment error extrapolationand probabilistic error cancellation was taken still furtherin Ref. [236].

In an alternative and complementary approach, thesubspace expansion method was also shown to be use-ful for QEM in Ref. [245]. Here, using subspace expan-sion one can mitigate physical noise for eigenstates ofthe Hamiltonian as well as evaluating excited states be-cause the state is expanded in a larger subspace. Notethat this method works better for coherent noise than forstochastic noise. A distinct approach was introduced inRef. [246, 247] which comes at the cost of increasing thenumber of qubits. Here, by entangling and measuringM copies of a noisy state ρ, one can compute expecta-tion values with respect to the state ρM

Tr[ρM ]. Under the

assumption that the principal eigenvector of ρ is the de-sired state, this method can exponentially suppress errorswith M . Finally, we remark that Ref. [248] introduced amethod to mitigate expectation values against correlatedmeasurement errors, whereas Ref. [249] implemented anerror mitigation technique to suppress the effects of pho-ton loss for a Gaussian Boson sampling device.

V. OPPORTUNITIES FOR NEAR-TERMQUANTUM ADVANTAGE

VQAs are largely regarded as the best candidate forproviding quantum advantage for practical applications.That is, it is expected that a VQA can solve a prob-lem more efficiently than any classical state-of-the-artmethod. As discussed in the main text, tremendous efforthas been dedicated to this goal with the development ofefficient ansatz strategies, quantum-aware optimizationmethods, new VQAs, and error mitigation techniques.Although many challenges still remain to be addressed,such as the need for larger and better quantum devices,one can nevertheless pose the question as to what specificapplications will provide the first quantum advantage fora practical scenario. In this section we discuss some ofthe most exciting possibilities where quantum advantagecould arise.

A. Chemistry and material sciences

The ability to simulate and understand the static anddynamical properties of molecules and strongly corre-lated electronic systems is a fundamental task in manyareas of science. For instance, this task is relevant inbiology to understand protein folding dynamics, and inpharmaceutical sciences one could analyze drug-receptorinteractions to improve drug discovery capabilities [250–252]. Similarly, analyzing the electronic structure of com-plex correlated materials is very important for studying

21

high-temperature superconductivity or to analyze tran-sition metal materials near a Mott transition.

1. Molecular structure

In the past few decades there have been great develop-ments in the classical treatment of the structure of molec-ular systems. These include approximate methods suchas Hartree-Fock or density functional theory, or methodsclosely connected to quantum information, like the den-sity matrix renormalization group approach that utilizesmatrix product states as an ansatz [253, 254]. However,even for these sophisticated approaches, systems of in-terest such as the FeMo cofactor are beyond the reach ofan accurate description due to the entanglement struc-ture of the electrons and orbitals. The relevant electronicspace that one needs to treat correlations accurately infor these systems is relatively modest, and for that rea-son, these may be good targets for near-term quantumcomputers to play a role. As discussed in the main textthe Variational Quantum Eigensolver algorithm [16] (andassociated architectures) have shown promising advancestowards the goal of performing molecular quantum chem-istry on quantum computers [255], with large scale im-plementation already being executed [256].

2. Molecular dynamics

As for the dynamics of chemical and other quantumsystems, there have been a number of strides in eval-uating or compressing these evolutions using variationalapproaches [90, 91]. Much like variational principles con-nected to the ground state, there are a number of time-dependent variational principles that can be used to ap-proximate time-dynamics. Here there are two timescalesof interest. The first is the electronic timescales overwhich electrons rearrange upon excitation. The second,much slower than the first, is the rearrangement of nu-clei that is induced by forces derived from the electrons intheir respective configurations, excited or not. Generallyspeaking, treating the detailed dynamics of the electronsaccurately has been extremely challenging for classicalapproaches despite its relevance in phenomena related tophotovoltaics and light-emitting diodes [257, 258]. Thescale between the two timescales has motivated the de-velopment of methods that treat them separately, oftenusing a classical or semi-classical representation for thenuclei and quantum representation for the electrons [259].Variational methods can be applied incrementally inthese cases, by stepping the electronic wavefunction for-ward with time-dependent variational principles [90, 91]and sampling the forces [260] to move the nuclei clas-sically, resulting in a Born-Oppenheimer type molecu-lar dynamics. Early test systems for quantum moleculardynamics often include photo-dissociation reactions andconical interactions of small molecular systems [261]. Ul-

timately, these methods may help unlock proton-coupledelectron transfer mechanisms [262] in proteins and helpwith the design of novel organic photovoltaics [257] andrelated systems.

3. Materials science

Classical methods for materials simulations usually usedensity-functional theory coupled with approximationmethods, such as the local density approximation [263]to tackle weakly correlated materials. However, manyeffects arising from strongly correlated systems are be-yond the reach of such classical methods. Since long-term algorithms for material simulation require phaseestimation [264–266], these lie beyond the scope of near-term devices. In contrast, near-term VQAs for analyz-ing strong correlation problem are aimed at reducing thecircuit depth by using smart initializations [267], or byoptimizing the circuit structure itself [31, 32].

B. Nuclear and particle physics

1. Nuclear physics

Similar to the chemistry applications discussed above,VQAs have the potential to convey a quantum advan-tage in studying nuclear structure and dynamics. Themost studied potential contribution is the utility of theVQE method to find nuclear ground states. This wasfirst demonstrated for computing the deuteron (2H) bind-ing energy [268], and has been extended to other lightnuclei such as the triton (3H), 3He, and an alpha par-ticle (4He) [269]. Additionally, using VQE to preparethe ground state of a triton has been an initial stepas a demonstration of simulating neutrino-nucleon scat-tering [270]. Considering these low-energy applicationsalong with the general progress towards studying higherenergy nuclear interactions (quantum chromodynamics)via VQA lattice gauge theory approaches (discussed be-low) shows that VQAs have the potential to provide asignificant advantage over classical methods for nuclearphysics.

2. Particle physics

In particle physics many analytical tools have been de-veloped to describe and study theories, but there aremany areas that remain intractable. In particular, thestudy of important gauge theories such as quantum chro-modynamics is often handled by mapping the problemonto a lattice to allow for numerical studies. One ofthe major drawbacks of such Lattice Gauge Theories(LGTs) for classical computation is that they exhibit thesign problem and as a result are usually not classicallysimulable. Although large scale, fault-tolerant quantum

22

computers will eventually be able to handle this diffi-culty [271, 272], there is also the potential for achieving asignificant quantum advantage in this area with VQAs inthe Noisy Intermediate-Scale Quantum (NISQ) era [273].Advances in this direction include work on VQAs forLGT simulation [13] and variational determinations ofmass gaps, Green’s functions, and running coupling con-stants [274–276]. In addition, an approach using a VQAto determine interpolation operators to accelerate classi-cal LGT computations has been proposed [277]. Finally,the impacts of decoherence by hardware noise on LGTcalculations have been studied, finding that gauge viola-tions caused by decoherence only grow linearly at shorttimes, suggesting that short depth approaches may bepossible [278]. Taken together, these results show thatstudying LGTs is a viable candidate for NISQ quantumadvantage.

C. Optimization and machine learning

Although it is natural to consider that VQAs can bringan advantage on tasks which are inherently quantumin nature, the prospect of using quantum algorithms tosolve classical problems is also an exciting one. Gen-erally, one here aims to use the large dimension of theHilbert space to encode big problems or large amounts ofdata, with the premise that the quantum nature of thealgorithm (such as coherence or entanglement betweenqubits [279]) helps in speeding up a given task.

1. Optimization

Many optimization problems can be encoded in rel-atively simple mathematical models such as the Max-Cut [128] or the Max-Sat [127] problems. These includetasks such as electronic circuitry layout design, stateproblems in statistical physics [280], and even automo-tive configuration [281]. Applying Quantum Approxi-mate Optimization Algorithm (QAOA) to classical op-timization problems is widely considered to be one ofthe leading candidates for achieving quantum advantageon NISQ devices [131]. There are several reasons forthis optimism. QAOA has provable performance guar-antees [23, 282] for p = 1. In general, even p = 1 QAOAansatz cannot be efficiently simulated on any classicaldevice [283]. At the same time, QAOA performance canonly improve by increasing p. It was also shown that‘bang-bang’ evolution that motivates QAOA ansatz isthe optimal approach given fixed quantum computationtime [43]. However, there are problems for which a shal-low QAOA ansatz does not perform well [284, 285] sug-gesting that p may have to grow with the problem size.Larger p requires improvements in the parametrizationand optimization [204]. Similarly to quantum chemistry,large scale experiments of QAOA have already been im-plemented [286].

2. Machine Learning

In the past few decades, the use of machine learninghas become common in most, if not all, areas of sci-ence. Although the problem of loading classical dataon quantum computers is still an active topic of re-search, there has been significant efforts put forward touse quantum algorithms for machine learning applica-tions [154, 157, 174, 287]. For instance, it has been shownthat quantum neural networks can achieve a significantlyhigher capacity, as measured by the effective dimension,than comparable classical neural networks [77], implyingthat the former can express a broader class of functionsthan the latter. Moreover, it has also been pointed outthat quantum algorithms can outperform classical onesin deep learning problems [288], potentially provide ex-ponentially better ability to generalize when trained topredict the outcome of physical processes [289], and morerecently a VQA has been proposed for deep reinforcementlearning [290]. An exciting prospect for using quantumneural networks is that certain architectures are immuneto barren plateaus, and hence are trainable even for largeproblems [179, 180].

VI. OUTLOOK

In the quest for quantum advantage, analytical andheuristic scaling analysis of VQAs will be increasinglyimportant. Better methods to analyze VQA scalabilityare anticipated in the future. This will likely includeboth gradient scaling and other scaling aspects, such asthe density of local minima and the shape of the costlandscape. These fundamental results will help to guidethe search for quantum advantage.

At the same time, the future will also see an im-proved toolbox for VQAs. Quantum-aware optimizerswill exploit knowledge gained about the cost landscape.These improved optimizers will mitigate the impacts ofsmall gradients and avoid local minima to facilitate rapidtraining of the parameters in VQAs. Moreover, com-mercial software packages will streamline the testing ofVQAs and further speed up the parameter optimiza-tion [82, 291, 292].

Application-specific ansatzes will continue to be devel-oped. Better ansatzes will enhance gradient magnitudesto improve trainability and they may also reduce the im-pact of noise on VQAs. This will likely include adap-tive ansatz strategies, which appear promising. Hybridquantum-classical models [72] are a natural extension ofVQAs where one parameterizes both a classical (for ex-ample, neural network) and quantum ansatz, and suchmodels could also facilitate near-term applications.

New error mitigation strategies are anticipated in thefuture. These will be crucial for obtaining accurate re-sults from VQAs and will improve accuracy by ordersof magnitude. Error mitigation will be hard-coded into

23

cloud-based quantum computing platforms, to allow usesto obtain accurate results with ease.

The future will also see better quantum hardware be-come available, both in terms of qubit count and noiselevels. VQAs will certainly benefit from such improvedhardware. Moreover, VQAs will play a central role inbenchmarking the capabilities of these new platforms.

In the near future, VQAs will likely see a shift from theproposal and development phase to the implementationphase. Researchers will aim to implement larger, morerealistic problems with VQAs instead of toy problems.These implementations will incorporate multiple state-of-the-art strategies for enhancing VQA performance. Com-bining strategies for improving the accuracy, trainability,and efficiency of VQAs will test their ultimate capabili-ties and will push the boundaries of NISQ devices, withthe grand vision of obtaining quantum advantage.

In the more distant future, VQAs will even find useeven when the fault-tolerant era arrives. Transitioningfrom estimating expectation values from Hamiltonian av-eraging to phase estimation may be an important com-ponent here [114]. QAOA may be a good candidate VQAto find usage in the fault-tolerant era, albeit with caveatsabout the overhead [293]. Strategies that address chal-lenges in the NISQ era, such as keeping circuit depth shal-low and avoiding barren plateaus, could still play a rolein the fault-tolerant era. Therefore, current research onVQAs will likely remain useful even when fault-tolerantquantum devices arrive.

VII. REFERENCES

[1] Peter W Shor, “Algorithms for quantum computation:discrete logarithms and factoring,” in Proceedings 35thannual symposium on foundations of computer science(Ieee, 1994) pp. 124–134.

[2] Seth Lloyd, “Universal quantum simulators,” Science273, 1073–1078 (1996).

[3] Aram W Harrow, Avinatan Hassidim, and Seth Lloyd,“Quantum algorithm for linear systems of equations,”Physical Review Letters 103, 150502 (2009).

[4] “IBM Makes Quantum Computing Available onIBM Cloud to Accelerate Innovation,” (2016),press release at https://www-03.ibm.com/press/us/en/pressrelease/49661.wss.

[5] Adetokunbo Adedoyin, John Ambrosiano, Petr Anisi-mov, Andreas Bärtschi, William Casper, GopinathChennupati, Carleton Coffrin, Hristo Djidjev, DavidGunter, Satish Karra, et al., “Quantum algo-rithm implementations for beginners,” arXiv preprintarXiv:1804.03719 (2018).

[6] J. Preskill, “Quantum computing in the NISQ era andbeyond,” Quantum 2, 79 (2018).

[7] Frank Arute et al., “Quantum supremacy using a pro-grammable superconducting processor,” Nature 574,505–510 (2019).

[8] Han-Sen Zhong, Hui Wang, Yu-Hao Deng, Ming-ChengChen, Li-Chao Peng, Yi-Han Luo, Jian Qin, Dian Wu,

Xing Ding, Yi Hu, et al., “Quantum computational ad-vantage using photons,” Science 370, 1460–1463 (2020).

[9] A. Kandala, A. Mezzacapo, K. Temme, M. Takita,M. Brink, J. M. Chow, and J. M. Gambetta,“Hardware-efficient variational quantum eigensolver forsmall molecules and quantum magnets,” Nature 549,242–246 (2017).

[10] Bryan T Gard, Linghua Zhu, George S Barron,Nicholas J Mayhall, Sophia E Economou, and EdwinBarnes, “Efficient symmetry-preserving state prepara-tion circuits for the variational quantum eigensolver al-gorithm,” npj Quantum Information 6, 1–9 (2020).

[11] Matthew Otten, Cristian L Cortes, and Stephen KGray, “Noise-resilient quantum dynamics usingsymmetry-preserving ansatzes,” arXiv preprintarXiv:1910.06284 (2019).

[12] Nikolay V Tkachenko, James Sud, Yu Zhang, SergeiTretiak, Petr M Anisimov, Andrew T Arrasmith,Patrick J Coles, Lukasz Cincio, and Pavel ADub, “Correlation-informed permutation of qubitsfor reducing ansatz depth in VQE,” arXiv preprintarXiv:2009.04996 (2020).

[13] Christian Kokail, Christine Maier, Rick van Bijnen, TiffBrydges, Manoj K Joshi, Petar Jurcevic, Christine AMuschik, Pietro Silvi, Rainer Blatt, Christian F Roos,et al., “Self-verifying variational quantum simulation oflattice models,” Nature 569, 355–360 (2019).

[14] Carlos Bravo-Prieto, Josep Lumbreras-Zarapico, LucaTagliacozzo, and José I Latorre, “Scaling of variationalquantum circuit depth for condensed matter systems,”Quantum 4, 272 (2020).

[15] Andrew G. Taube and Rodney J. Bartlett, “New per-spectives on unitary coupled-cluster theory,” Interna-tional Journal of Quantum Chemistry 106, 3393–3401(2006).

[16] A. Peruzzo, J. McClean, P. Shadbolt, M.-H. Yung, X.-Q. Zhou, P. J. Love, A. Aspuru-Guzik, and J. L.O’Brien, “A variational eigenvalue solver on a photonicquantum processor,” Nature Communications 5, 4213(2014).

[17] Sergey B Bravyi and Alexei Yu Kitaev, “Fermionicquantum computation,” Annals of Physics 298, 210–226(2002).

[18] Joonho Lee, William J. Huggins, Martin Head-Gordon,and K. Birgitta Whaley, “Generalized unitary coupledcluster wave functions for quantum computation,” Jour-nal of Chemical Theory and Computation 15, 311–324(2019).

[19] Mario Motta, Erika Ye, Jarrod R McClean, Zhen-dong Li, Austin J Minnich, Ryan Babbush, and Gar-net Kin Chan, “Low rank representations for quan-tum simulation of electronic structure,” arXiv preprintarXiv:1808.02625 (2018).

[20] Yuta Matsuzawa and Yuki Kurashige, “Jastrow-type de-composition in quantum chemistry for low-depth quan-tum circuits,” Journal of Chemical Theory and Compu-tation 16, 944–952 (2020).

[21] Ian D Kivlichan, Jarrod McClean, Nathan Wiebe, CraigGidney, Alán Aspuru-Guzik, Garnet Kin-Lic Chan,and Ryan Babbush, “Quantum simulation of electronicstructure with linear depth and connectivity,” PhysicalReview Letters 120, 110501 (2018).

[22] Kanav Setia, Sergey Bravyi, Antonio Mezzacapo, andJames D Whitfield, “Superfast encodings for fermionic

https://ieeexplore.ieee.org/document/365700


https://science.sciencemag.org/content/273/5278/1073


https://journals.aps.org/prl/abstract/10.1103/PhysRevLett.103.150502

https://www-03.ibm.com/press/us/en/pressrelease/49661.wss

https://www-03.ibm.com/press/us/en/pressrelease/49661.wss

https://arxiv.org/abs/1804.03719


https://quantum-journal.org/papers/q-2018-08-06-79/

https://www.nature.com/articles/s41586-019-1666-5


https://science.sciencemag.org/content/early/2020/12/02/science.abe8770

https://www.nature.com/articles/nature23879









https://onlinelibrary.wiley.com/doi/abs/10.1002/qua.21198



https://www.nature.com/articles/ncomms5213

https://www.nature.com/articles/ncomms5213

http://dx.doi.org/10.1006/aphy.2002.6254

http://dx.doi.org/10.1006/aphy.2002.6254

https://doi.org/10.1021/acs.jctc.8b01004









24

quantum simulation,” Physical Review Research 1,033033 (2019).

[23] Edward Farhi, Jeffrey Goldstone, and Sam Gut-mann, “A quantum approximate optimization algo-rithm,” arXiv preprint arXiv:1411.4028 (2014).

[24] S. Hadfield, Z. Wang, B. O’Gorman, E. G. Rieffel,D. Venturelli, and R. Biswas, “From the quantum ap-proximate optimization algorithm to a quantum alter-nating operator ansatz,” Algorithms 12, 34 (2019).

[25] Seth Lloyd, “Quantum approximate optimiza-tion is computationally universal,” arXiv preprintarXiv:1812.11075 (2018).

[26] Mauro ES Morales, JD Biamonte, and Zoltán Zim-borás, “On the universality of the quantum approx-imate optimization algorithm,” Quantum InformationProcessing 19, 1–26 (2020).

[27] Zhihui Wang, Nicholas C Rubin, Jason M Dominy, andEleanor G Rieffel, “X y mixers: Analytical and numeri-cal results for the quantum alternating operator ansatz,”Physical Review A 101, 012320 (2020).

[28] Dave Wecker, Matthew B Hastings, and MatthiasTroyer, “Progress towards practical quantum variationalalgorithms,” Physical Review A 92, 042303 (2015).

[29] Roeland Wiersema, Cunlu Zhou, Yvette de Sereville,Juan Felipe Carrasquilla, Yong Baek Kim, and HenryYuen, “Exploring entanglement and optimization withinthe hamiltonian variational ansatz,” Phys. Rev. XQuantum 1, 020319 (2020).

[30] Wen Wei Ho and Timothy H Hsieh, “Efficient varia-tional simulation of non-trivial quantum states,” SciPostPhys 6, 029 (2019).

[31] Harper R Grimsley, Sophia E Economou, Edwin Barnes,and Nicholas J Mayhall, “An adaptive variational al-gorithm for exact molecular simulations on a quantumcomputer,” Nature Communications 10, 1–9 (2019).

[32] Ho Lun Tang, VO Shkolnikov, George S Barron,Harper R Grimsley, Nicholas J Mayhall, Edwin Barnes,and Sophia E Economou, “qubit-ADAPT-VQE: Anadaptive algorithm for constructing hardware-efficientansatze on a quantum processor,” arXiv preprintarXiv:1911.10205 (2019).

[33] Yordan S Yordanov, V Armaos, Crispin HW Barnes,and David RM Arvidsson-Shukur, “Iterative qubit-excitation based variational quantum eigensolver,”arXiv preprint arXiv:2011.10540 (2020).

[34] Linghua Zhu, Ho Lun Tang, George S Barron,Nicholas J Mayhall, Edwin Barnes, and Sophia EEconomou, “An adaptive quantum approximate op-timization algorithm for solving combinatorial prob-lems on a quantum computer,” arXiv preprintarXiv:2005.10258 (2020).

[35] Arthur G Rattew, Shaohan Hu, Marco Pistoia, RichardChen, and Steve Wood, “A domain-agnostic, noise-resistant, hardware-efficient evolutionary variationalquantum eigensolver,” arXiv preprint arXiv:1910.09694(2019).

[36] D Chivilikhin, A Samarin, V Ulyantsev, I Iorsh,AR Oganov, and O Kyriienko, “MoG-VQE: Multiob-jective genetic variational quantum eigensolver,” arXivpreprint arXiv:2007.04424 (2020).

[37] Lukasz Cincio, Kenneth Rudinger, Mohan Sarovar, andPatrick J Coles, “Machine learning of noise-resilientquantum circuits,” Phys. Rev. X Quantum 2, 010324(2021).

[38] L. Cincio, Y. Subaşı, A. T. Sornborger, and P. J. Coles,“Learning the quantum algorithm for state overlap,”New Journal of Physics 20, 113022 (2018).

[39] Yuxuan Du, Tao Huang, Shan You, Min-Hsiu Hsieh,and Dacheng Tao, “Quantum circuit architecturesearch: error mitigation and trainability enhance-ment for variational quantum solvers,” arXiv preprintarXiv:2010.10217 (2020).

[40] Shi-Xin Zhang, Chang-Yu Hsieh, Shengyu Zhang,and Hong Yao, “Differentiable quantum architecturesearch,” arXiv preprint arXiv:2010.08561 (2020).

[41] M Bilkis, M Cerezo, Guillaume Verdon, Patrick J Coles,and Lukasz Cincio, “A semi-agnostic ansatz with vari-able structure for quantum machine learning,” arXivpreprint arXiv:2103.06712 (2021).

[42] Arthur G. Rattew, Shaohan Hu, Marco Pistoia, RichardChen, and Steve Wood, “A domain-agnostic, noise-resistant, hardware-efficient evolutionary variationalquantum eigensolver,” arXiv preprint arXiv:1910.09694(2019).

[43] Zhi-Cheng Yang, Armin Rahmani, Alireza Shabani,Hartmut Neven, and Claudio Chamon, “Optimizingvariational quantum algorithms using pontryagin’s min-imum principle,” Phys. Rev. X 7, 021027 (2017).

[44] Alicia B Magann, Christian Arenz, Matthew D Grace,Tak-San Ho, Robert L Kosut, Jarrod R McClean, Her-schel A Rabitz, and Mohan Sarovar, “From pulses tocircuits and back again: A quantum optimal controlperspective on variational quantum algorithms,” Phys.Rev. X Quantum 2, 010101 (2021).

[45] Alexandre Choquette, Agustin Di Paolo, Panagio-tis Kl Barkoutsos, David Sénéchal, Ivano Taver-nelli, and Alexandre Blais, “Quantum-optimal-control-inspired ansatz for variational quantum algorithms,”arXiv preprint arXiv:2008.01098 (2020).

[46] Jun Li, Xiaodong Yang, Xinhua Peng, and Chang-PuSun, “Hybrid quantum-classical approach to quantumoptimal control,” Phys. Rev. Lett. 118, 150503 (2017).

[47] Dawei Lu, Keren Li, Jun Li, Hemant Katiyar, Annie Ji-hyun Park, Guanru Feng, Tao Xin, Hang Li, GuiluLong, Aharon Brodutch, Jonathan Baugh, Bei Zeng,and Raymond Laflamme, “Enhancing quantum controlby bootstrapping a quantum processor of 12 qubits,”npj Quantum Information 3, 45 (2017).

[48] Peter JJ O’Malley, Ryan Babbush, Ian D Kivlichan,Jonathan Romero, Jarrod R McClean, Rami Barends,Julian Kelly, Pedram Roushan, Andrew Tranter, NanDing, et al., “Scalable quantum simulation of molecularenergies,” Physical Review X 6, 031007 (2016).

[49] Tyler Takeshita, Nicholas C. Rubin, Zhang Jiang, Eun-seok Lee, Ryan Babbush, and Jarrod R. McClean, “In-creasing the representation accuracy of quantum simu-lations of chemistry without extra quantum resources,”Phys. Rev. X 10, 011004 (2020).

[50] Leslie G. Valiant, “Quantum circuits that can be simu-lated classically in polynomial time,” SIAM Journal onComputing 31, 1229–1254 (2002).

[51] Barbara M. Terhal and David P. DiVincenzo, “Classicalsimulation of noninteracting-fermion quantum circuits,”Phys. Rev. A 65, 032325 (2002).

[52] Richard Jozsa and Akimasa Miyake, “Matchgates andclassical simulation of quantum circuits,” Proceedings ofthe Royal Society A: Mathematical, Physical and Engi-neering Sciences 464, 3089–3106 (2008).

https://journals.aps.org/prresearch/abstract/10.1103/PhysRevResearch.1.033033



https://www.mdpi.com/1999-4893/12/2/34



https://link.springer.com/article/10.1007/s11128-020-02748-9

https://link.springer.com/article/10.1007/s11128-020-02748-9

https://journals.aps.org/pra/abstract/10.1103/PhysRevA.101.012320


https://journals.aps.org/prxquantum/abstract/10.1103/PRXQuantum.1.020319


https://scipost.org/10.21468/SciPostPhys.6.3.029

https://scipost.org/10.21468/SciPostPhys.6.3.029













https://iopscience.iop.org/article/10.1088/1367-2630/aae94a/meta








https://link.aps.org/doi/10.1103/PhysRevX.7.021027




https://link.aps.org/doi/10.1103/PhysRevLett.118.150503

https://doi.org/10.1038/s41534-017-0045-z

https://journals.aps.org/prx/abstract/10.1103/PhysRevX.6.031007

https://link.aps.org/doi/10.1103/PhysRevX.10.011004

https://epubs.siam.org/doi/abs/10.1137/S0097539700377025?mobileUi=0&

https://epubs.siam.org/doi/abs/10.1137/S0097539700377025?mobileUi=0&

https://link.aps.org/doi/10.1103/PhysRevA.65.032325

https://royalsocietypublishing.org/doi/10.1098/rspa.2008.0189



25

[53] Wataru Mizukami, Kosuke Mitarai, Yuya O. Nakagawa,Takahiro Yamamoto, Tennin Yan, and Yu-ya Ohnishi,“Orbital optimized unitary coupled cluster theory forquantum computer,” Phys. Rev. Research 2, 033421(2020).

[54] Igor O. Sokolov, Panagiotis Kl. Barkoutsos, Pauline J.Ollitrault, Donny Greenberg, Julia Rice, Marco Pis-toia, and Ivano Tavernelli, “Quantum orbital-optimizedunitary coupled cluster methods in the strongly corre-lated regime: Can quantum algorithms outperform theirclassical equivalents?” The Journal of Chemical Physics152, 124107 (2020).

[55] Jarrod R McClean, Mollie E Kimchi-Schwartz,Jonathan Carter, and Wibe A de Jong, “Hybridquantum-classical hierarchy for mitigation of decoher-ence and determination of excited states,” Physical Re-view A 95, 042308 (2017).

[56] Robert M Parrish, Edward G Hohenstein, Peter LMcMahon, and Todd J Martínez, “Quantum compu-tation of electronic transitions using a variational quan-tum eigensolver,” Physical Review Letters 122, 230401(2019).

[57] Robert M Parrish and Peter L McMahon, “Quantum fil-ter diagonalization: Quantum eigendecomposition with-out full quantum phase estimation,” arXiv preprintarXiv:1909.08925 (2019).

[58] William J Huggins, Joonho Lee, Unpil Baek, BryanO’Gorman, and K Birgitta Whaley, “A non-orthogonalvariational quantum eigensolver,” New Journal ofPhysics 22, 073009 (2020).

[59] Nicholas H Stair, Renke Huang, and Francesco A Evan-gelista, “A multireference quantum krylov algorithm forstrongly correlated electrons,” Journal of Chemical The-ory and Computation 16, 2236–2245 (2020).

[60] Kishor Bharti and Tobias Haug, “Iterative quantumassisted eigensolver,” arXiv preprint arXiv:2010.05638(2020).

[61] Kishor Bharti and Tobias Haug, “Quantum assisted sim-ulator,” arXiv preprint arXiv:2011.06911 (2020).

[62] Igor L. Markov and Yaoyun Shi, “Simulating quantumcomputation by contracting tensor networks,” SIAMJournal on Computing 38, 963–981 (2008).

[63] Isaac H Kim and Brian Swingle, “Robust entanglementrenormalization on a noisy quantum computer,” arXivpreprint arXiv:1711.07500 (2017).

[64] Isaac H Kim, “Holographic quantum simulation,” arXivpreprint arXiv:1702.02093 (2017).

[65] Jin-Guo Liu, Yi-Hong Zhang, Yuan Wan, and LeiWang, “Variational quantum eigensolver with fewerqubits,” Phys. Rev. Research 1, 023025 (2019).

[66] Fergus Barratt, James Dborin, Matthias Bal, Vid Sto-jevic, Frank Pollmann, and Andrew G Green, “Parallelquantum simulation of large systems on small quantumcomputers,” arXiv preprint arXiv:2003.12087 (2020).

[67] Xiao Yuan, Jinzhao Sun, Junyu Liu, Qi Zhao, andYou Zhou, “Quantum simulation with hybrid tensor net-works,” arXiv preprint arXiv:2007.00958 (2020).

[68] Keisuke Fujii, Kosuke Mitarai, Wataru Mizukami, andYuya O Nakagawa, “Deep variational quantum eigen-solver: a divide-and-conquer method for solving a largerproblem with smaller size quantum computers,” arXivpreprint arXiv:2007.10917 (2020).

[69] Guglielmo Mazzola, Pauline J. Ollitrault, Panagiotis Kl.Barkoutsos, and Ivano Tavernelli, “Nonunitary opera-

tions for ground-state calculations in near-term quan-tum computers,” Phys. Rev. Lett. 123, 130501 (2019).

[70] John Martyn and Brian Swingle, “Product spectrumansatz and the simplicity of thermal states,” Phys. Rev.A 100, 032107 (2019).

[71] Nobuyuki Yoshioka, Yuya O Nakagawa, Kosuke Mi-tarai, and Keisuke Fujii, “Variational quantum algo-rithm for nonequilibrium steady states,” Physical Re-view Research 2, 043289 (2020).

[72] Guillaume Verdon, Jacob Marks, Sasha Nanda, StefanLeichenauer, and Jack Hidary, “Quantum hamiltonian-based models and the variational quantum thermalizeralgorithm,” arXiv preprint arXiv:1910.02071 (2019).

[73] JinGuo Liu, Liang Mao, Pan Zhang, and Lei Wang,“Solving quantum statistical mechanics with variationalautoregressive networks and quantum circuits,” Ma-chine Learning: Science and Technology 2, 025011(2021).

[74] Sukin Sim, Peter D Johnson, and Alán Aspuru-Guzik,“Expressibility and entangling capability of parameter-ized quantum circuits for hybrid quantum-classical al-gorithms,” Advanced Quantum Technologies 2, 1900070(2019).

[75] Kouhei Nakaji and Naoki Yamamoto, “Expressibility ofthe alternating layered ansatz for quantum computa-tion,” arXiv preprint arXiv:2005.12537 (2020).

[76] Maria Schuld, Ryan Sweke, and Johannes Jakob Meyer,“The effect of data encoding on the expressive power ofvariational quantum machine learning models,” arXivpreprint arXiv:2008.08605 (2020).

[77] Amira Abbas, David Sutter, Christa Zoufal, AurélienLucchi, Alessio Figalli, and Stefan Woerner, “Thepower of quantum neural networks,” arXiv preprintarXiv:2011.00027 (2020).

[78] Zoë Holmes, Kunal Sharma, M. Cerezo, and Patrick JColes, “Connecting ansatz expressibility to gradi-ent magnitudes and barren plateaus,” arXiv preprintarXiv:2101.02138 (2021).

[79] Gian Giacomo Guerreschi and Mikhail Smelyanskiy,“Practical optimization for hybrid quantum-classical al-gorithms,” arXiv preprint arXiv:1701.01450 (2017).

[80] K. Mitarai, M. Negoro, M. Kitagawa, and K. Fujii,“Quantum circuit learning,” Phys. Rev. A 98, 032309(2018).

[81] Maria Schuld, Ville Bergholm, Christian Gogolin, JoshIzaac, and Nathan Killoran, “Evaluating analytic gra-dients on quantum hardware,” Phys. Rev. A 99, 032331(2019).

[82] Ville Bergholm, Josh Izaac, Maria Schuld, Chris-tian Gogolin, M Sohaib Alam, Shahnawaz Ahmed,Juan Miguel Arrazola, Carsten Blank, Alain Delgado,Soran Jahangiri, et al., “Pennylane: Automatic dif-ferentiation of hybrid quantum-classical computations,”arXiv preprint arXiv:1811.04968 (2018).

[83] Andrea Mari, Thomas R Bromley, and Nathan Killo-ran, “Estimating the gradient and higher-order deriva-tives on quantum hardware,” Physical Review A 103,012405 (2021).

[84] M. Cerezo and Patrick J Coles, “Impact of barrenplateaus on the hessian and higher order derivatives,”arXiv preprint arXiv:2008.07454 (2020).

[85] Ken M Nakanishi, Keisuke Fujii, and Synge Todo, “Se-quential minimal optimization for quantum-classical hy-brid algorithms,” Physical Review Research 2, 043158



https://aip.scitation.org/doi/10.1063/1.5141835








https://iopscience.iop.org/article/10.1088/1367-2630/ab867b

https://iopscience.iop.org/article/10.1088/1367-2630/ab867b

https://pubs.acs.org/doi/10.1021/acs.jctc.9b01125

https://pubs.acs.org/doi/10.1021/acs.jctc.9b01125




https://epubs.siam.org/doi/10.1137/050644756?mobileUi=0

https://epubs.siam.org/doi/10.1137/050644756?mobileUi=0
















http://iopscience.iop.org/10.1088/2632-2153/aba19d



https://onlinelibrary.wiley.com/doi/full/10.1002/qute.201900070

https://onlinelibrary.wiley.com/doi/full/10.1002/qute.201900070


















26

(2020).[86] Bálint Koczor and Simon C Benjamin, “Quantum ana-

lytic descent,” arXiv preprint arXiv:2008.13774 (2020).[87] Sam McArdle, Tyson Jones, Suguru Endo, Ying Li, Si-

mon C Benjamin, and Xiao Yuan, “Variational ansatz-based quantum simulation of imaginary time evolution,”npj Quantum Information 5, 1–6 (2019).

[88] James Stokes, Josh Izaac, Nathan Killoran, andGiuseppe Carleo, “Quantum natural gradient,” Quan-tum 4, 269 (2020).

[89] Bálint Koczor and Simon C Benjamin, “Quantum natu-ral gradient generalised to non-unitary circuits,” arXivpreprint arXiv:1912.08660 (2019).

[90] Ying Li and Simon C Benjamin, “Efficient variationalquantum simulator incorporating active error minimiza-tion,” Physical Review X 7, 021050 (2017).

[91] Xiao Yuan, Suguru Endo, Qi Zhao, Ying Li, and Si-mon C Benjamin, “Theory of variational quantum sim-ulation,” Quantum 3, 191 (2019).

[92] Suguru Endo, Jinzhao Sun, Ying Li, Simon C Benjamin,and Xiao Yuan, “Variational quantum simulation of gen-eral processes,” Physical Review Letters 125, 010501(2020).

[93] Kosuke Mitarai and Keisuke Fujii, “Methodology forreplacing indirect measurements with direct measure-ments,” Phys. Rev. Research 1, 013006 (2019).

[94] Lennart Bittel and Martin Kliesch, “Training variationalquantum algorithms is np-hard–even for logarithmicallymany qubits and free fermionic systems,” arXiv preprintarXiv:2101.07267 (2021).

[95] Diederik P Kingma and Jimmy Ba, “Adam: A methodfor stochastic optimization,” in Proceedings of the 3rdInternational Conference on Learning Representations(ICLR) (2015).

[96] Jonas M Kübler, Andrew Arrasmith, Lukasz Cin-cio, and Patrick J Coles, “An adaptive optimizer formeasurement-frugal variational algorithms,” Quantum4, 263 (2020).

[97] Ryan Sweke, Frederik Wilde, Johannes Jakob Meyer,Maria Schuld, Paul K Fährmann, Barthélémy Meynard-Piganeau, and Jens Eisert, “Stochastic gradient descentfor hybrid quantum-classical optimization,” Quantum 4,314 (2020).

[98] Max Wilson, Sam Stromswold, Filip Wudarski, StuartHadfield, Norm M Tubman, and Eleanor Rieffel, “Op-timizing quantum heuristics with meta-learning,” arXivpreprint arXiv:1908.03185 (2019).

[99] James C Spall, “Multivariate stochastic approximationusing a simultaneous perturbation gradient approxima-tion,” IEEE transactions on automatic control 37, 332–341 (1992).

[100] Robert M Parrish, Joseph T Iosue, Asier Ozaeta,and Peter L McMahon, “A jacobi diagonalization andanderson acceleration algorithm for variational quan-tum algorithm parameter optimization,” arXiv preprintarXiv:1904.03206 (2019).

[101] Patrick Huembeli and Alexandre Dauphin, “Character-izing the loss landscape of variational quantum circuits,”Quantum Science and Technology 6, 025011 (2021).

[102] Aram Harrow and John Napp, “Low-depth gradientmeasurements can improve convergence in variationalhybrid quantum-classical algorithms,” arXiv preprintarXiv:1901.05374 (2019).

[103] Jacob Biamonte, “Universal variational quantum com-

putation,” Phys. Rev. A 103, L030401 (2021).[104] Daniel S. Abrams and Seth Lloyd, “Quantum algorithm

providing exponential speed increase for finding eigen-values and eigenvectors,” Phys. Rev. Lett. 83, 5162–5165 (1999).

[105] Alán Aspuru-Guzik, Anthony D Dutoi, Peter J Love,and Martin Head-Gordon, “Simulated quantum compu-tation of molecular energies,” Science 309, 1704–1707(2005).

[106] Abhinav Kandala, Kristan Temme, Antonio D Cór-coles, Antonio Mezzacapo, Jerry M Chow, and Jay MGambetta, “Error mitigation extends the computationalreach of a noisy quantum processor,” Nature 567, 491–495 (2019).

[107] Oscar Higgott, Daochen Wang, and Stephen Brierley,“Variational quantum computation of excited states,”Quantum 3, 156 (2019).

[108] Harry Buhrman, Richard Cleve, John Watrous, andRonald De Wolf, “Quantum fingerprinting,” PhysicalReview Letters 87, 167902 (2001).

[109] Tyson Jones, Suguru Endo, Sam McArdle, Xiao Yuan,and Simon C Benjamin, “Variational quantum algo-rithms for discovering hamiltonian spectra,” PhysicalReview A 99, 062304 (2019).

[110] Ken M Nakanishi, Kosuke Mitarai, and Keisuke Fu-jii, “Subspace-search variational quantum eigensolverfor excited states,” Physical Review Research 1, 033062(2019).

[111] Jarrod R McClean, Matthew P Harrigan, MasoudMohseni, Nicholas C Rubin, Zhang Jiang, Sergio Boixo,Vadim N Smelyanskiy, Ryan Babbush, and HartmutNeven, “Low depth mechanisms for quantum optimiza-tion,” arXiv preprint arXiv:2008.08615 (2020).

[112] A Garcia-Saez and JI Latorre, “Addressing hard clas-sical problems with adiabatically assisted variationalquantum eigensolvers,” arXiv preprint arXiv:1806.02287(2018).

[113] M. Cerezo, Kunal Sharma, Andrew Arrasmith, andPatrick J Coles, “Variational quantum state eigen-solver,” arXiv preprint arXiv:2004.01372 (2020).

[114] Daochen Wang, Oscar Higgott, and Stephen Brierley,“Accelerated variational quantum eigensolver,” PhysicalReview Letters 122, 140504 (2019).

[115] Guoming Wang, Dax Enshan Koh, Peter D Johnson,and Yudong Cao, “Minimizing estimation runtime onnoisy quantum computers,” Phys. Rev. X Quantum 2,010346 (2021).

[116] Wang, Guoming and Koh, Dax Enshan and Johnson,Peter D and Cao, Yudong, “Bayesian inference with en-gineered likelihood functions for robust amplitude es-timation,” Preprint at https://arxiv.org/abs/2006.09350 (2020).

[117] M. A. Nielsen and I. L. Chuang, Quantum Computationand Quantum Information: 10th Anniversary Edition,10th ed. (Cambridge University Press, New York, NY,USA, 2011).

[118] AD McLachlan, “A variational solution of the time-dependent schrodinger equation,” Molecular Physics 8,39–44 (1964).

[119] Yong-Xin Yao, Niladri Gomes, Feng Zhang, ThomasIadecola, Cai-Zhuang Wang, Kai-Ming Ho, and Pe-ter P. Orth, “Adaptive variational quantum dynamicssimulations,” arXiv preprint arXiv:2011.00622 (2020).

[120] Zi-Jian Zhang, Jinzhao Sun, Xiao Yuan, and Man-Hong












https://link.aps.org/doi/10.1103/PhysRevResearch.1.013006



http://arxiv.org/abs/1412.6980













https://iopscience.iop.org/article/10.1088/2058-9565/abdbc9



http://dx.doi.org/ 10.1103/PhysRevA.103.L030401
























http://dx.doi.org/ 10.1080/00268976400100041

http://dx.doi.org/ 10.1080/00268976400100041


27

Yung, “Low-depth hamiltonian simulation by adap-tive product formula,” arXiv preprint arXiv:2011.05283(2020).

[121] Kentaro Heya, Ken M Nakanishi, Kosuke Mitarai, andKeisuke Fujii, “Subspace variational quantum simula-tor,” arXiv preprint arXiv:1904.08566 (2019).

[122] Cristina Cirstoiu, Zoe Holmes, Joseph Iosue, LukaszCincio, Patrick J Coles, and Andrew Sornborger, “Vari-ational fast forwarding for quantum simulation beyondthe coherence time,” npj Quantum Information 6, 1–10(2020).

[123] Joe Gibbs, Kaitlin Gili, Zoë Holmes, Benjamin Com-meau, Andrew Arrasmith, Lukasz Cincio, Patrick JColes, and Andrew Sornborger, “Long-time simulationswith high fidelity on quantum hardware,” arXiv preprintarXiv:2102.04313 (2021).

[124] S. Khatri, R. LaRose, A. Poremba, L. Cincio, A. T.Sornborger, and P. J. Coles, “Quantum-assisted quan-tum compiling,” Quantum 3, 140 (2019).

[125] Benjamin Commeau, M. Cerezo, Zoë Holmes, LukaszCincio, Patrick J Coles, and Andrew Sornborger,“Variational hamiltonian diagonalization for dynamicalquantum simulation,” arXiv preprint arXiv:2009.02559(2020).

[126] Nikolaj Moll, Panagiotis Barkoutsos, Lev S Bishop,Jerry M Chow, Andrew Cross, Daniel J Egger, Ste-fan Filipp, Andreas Fuhrer, Jay M Gambetta, MarcGanzhorn, et al., “Quantum optimization using vari-ational algorithms on near-term quantum devices,”Quantum Science and Technology 3, 030503 (2018).

[127] Cedric Yen-Yu Lin and Yechao Zhu, “Performanceof qaoa on typical instances of constraint satisfac-tion problems with bounded degree,” arXiv preprintarXiv:1601.01744 (2016).

[128] Z. Wang, S. Hadfield, Z. Jiang, and E. G. Rief-fel, “Quantum approximate optimization algorithm forMaxCut: A fermionic view,” Phys. Rev. A 97, 022304(2018).

[129] Ruslan Shaydulin, Ilya Safro, and Jeffrey Larson, “Mul-tistart methods for quantum approximate optimiza-tion,” in 2019 IEEE High Performance Extreme Com-puting Conference (HPEC) (IEEE, 2019) pp. 1–8.

[130] Jonathan Romero, Ryan Babbush, Jarrod R McClean,Cornelius Hempel, Peter J Love, and Alán Aspuru-Guzik, “Strategies for quantum computing molecularenergies using the unitary coupled cluster ansatz,”Quantum Science and Technology 4, 014008 (2018).

[131] Gavin E Crooks, “Performance of the quantum approxi-mate optimization algorithm on the maximum cut prob-lem,” arXiv preprint arXiv:1811.08419 (2018).

[132] Dave Wecker, Matthew B Hastings, and MatthiasTroyer, “Training a quantum optimizer,” Physical Re-view A 94, 022309 (2016).

[133] Sami Khairy, Ruslan Shaydulin, Lukasz Cincio, YuriAlexeev, and Prasanna Balaprakash, “Learning to op-timize variational quantum circuits to solve combinato-rial problems,” Proceedings of the AAAI Conference onArtificial Intelligence 34, 2367–2375 (2020).

[134] A Ambainis, “Variable time amplitude amplificationand a faster quantum algorithm for solving systems oflinear equations 29th int,” in Symp. Theoretical Aspectsof Computer Science (STACS 2012), Vol. 14 (2012) pp.636–47.

[135] Y. Subaşı, R. D. Somma, and D. Orsucci, “Quantum

algorithms for systems of linear equations inspired byadiabatic quantum computing,” Phys. Rev. Lett. 122,060504 (2019).

[136] A. Childs, R. Kothari, and R. Somma, “Quantum algo-rithm for systems of linear equations with exponentiallyimproved dependence on precision,” SIAM J. Comput-ing 46, 1920–1950 (2017).

[137] Shantanav Chakraborty, András Gilyén, and StaceyJeffery, “The Power of Block-Encoded Matrix Pow-ers: Improved Regression Techniques via Faster Hamil-tonian Simulation,” in 46th International Colloquiumon Automata, Languages, and Programming (ICALP2019), Leibniz International Proceedings in Informat-ics (LIPIcs), Vol. 132 (2019) pp. 33:1–33:14.

[138] A. Scherer, B. Valiron, S.-C. Mau, S. Alexander,E. Van den Berg, and T. E. Chapuran, “Concrete re-source analysis of the quantum linear-system algorithmused to compute the electromagnetic scattering crosssection of a 2D target,” Quantum Information Process-ing 16, 60 (2017).

[139] Carlos Bravo-Prieto, Ryan LaRose, M. Cerezo, YigitSubasi, Lukasz Cincio, and Patrick J Coles, “Varia-tional quantum linear solver: A hybrid algorithm for lin-ear systems,” arXiv preprint arXiv:1909.05820 (2019).

[140] Xiaosi Xu, Jinzhao Sun, Suguru Endo, Ying Li, Simon CBenjamin, and Xiao Yuan, “Variational algorithmsfor linear algebra,” arXiv preprint arXiv:1909.03898(2019).

[141] Hsin-Yuan Huang, Kishor Bharti, and Patrick Reben-trost, “Near-term quantum algorithms for linear systemsof equations,” arXiv preprint arXiv:1909.07344 (2019).

[142] Yiğit Subaşı, Rolando D Somma, and Davide Orsucci,“Quantum algorithms for systems of linear equations in-spired by adiabatic quantum computing,” Physical Re-view Letters 122, 060504 (2019).

[143] Michael Lubasch, Jaewoo Joo, Pierre Moinier, MartinKiffner, and Dieter Jaksch, “Variational quantum algo-rithms for nonlinear problems,” Physical Review A 101,010301 (2020).

[144] Oleksandr Kyriienko, Annie E Paine, and Vin-cent E Elfving, “Solving nonlinear differential equationswith differentiable quantum circuits,” arXiv preprintarXiv:2011.10395 (2020).

[145] Eric Anschuetz, Jonathan Olson, Alán Aspuru-Guzik,and Yudong Cao, “Variational quantum factoring,” inInternational Workshop on Quantum Technology andOptimization Problems (Springer, 2019) pp. 74–85.

[146] Seth Lloyd, Masoud Mohseni, and Patrick Reben-trost, “Quantum principal component analysis,” NaturePhysics 10, 631–633 (2014).

[147] Ryan LaRose, Arkin Tikku, Étude O’Neel-Judy, LukaszCincio, and Patrick J Coles, “Variational quantumstate diagonalization,” npj Quantum Information 5, 1–10 (2019).

[148] Kentaro Heya, Yasunari Suzuki, Yasunobu Nakamura,and Keisuke Fujii, “Variational quantum gate optimiza-tion,” arXiv preprint arXiv:1810.12745 (2018).

[149] Tyson Jones and Simon C Benjamin, “Quantum compi-lation and circuit optimisation via energy dissipation,”arXiv preprint arXiv:1811.03147 (2018).

[150] Kunal Sharma, Sumeet Khatri, M. Cerezo, andPatrick J Coles, “Noise resilience of variational quantumcompiling,” New Journal of Physics 22, 043006 (2020).

[151] Jacques Carolan, Masoud Mohseni, Jonathan P Olson,








https://doi.org/10.22331/q-2019-05-13-140



https://iopscience.iop.org/article/10.1088/2058-9565/aab822/meta





https://ieeexplore.ieee.org/document/8916288/

https://ieeexplore.ieee.org/document/8916288/

https://iopscience.iop.org/article/10.1088/2058-9565/aad3e4




https://ojs.aaai.org/index.php/AAAI/article/view/5616

https://ojs.aaai.org/index.php/AAAI/article/view/5616

https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.397.7475&rep=rep1&type=pdf

https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.397.7475&rep=rep1&type=pdf



https://link.springer.com/article/10.1007/s00220-017-3002-y

https://link.springer.com/article/10.1007/s00220-017-3002-y

http://drops.dagstuhl.de/opus/volltexte/2019/10609



https://dl.acm.org/doi/10.1007/s11128-016-1495-5

https://dl.acm.org/doi/10.1007/s11128-016-1495-5











https://link.springer.com/chapter/10.1007/978-3-030-14082-3_7

https://link.springer.com/chapter/10.1007/978-3-030-14082-3_7

https://www.nature.com/articles/nphys3029

https://www.nature.com/articles/nphys3029





https://iopscience.iop.org/article/10.1088/1367-2630/ab784c

28

Mihika Prabhu, Changchen Chen, Darius Bunandar,Murphy Yuezhen Niu, Nicholas C Harris, Franco NCWong, Michael Hochberg, et al., “Variational quantumunsampling on a quantum photonic processor,” NaturePhysics 16, 322–327 (2020).

[152] Peter D Johnson, Jonathan Romero, Jonathan Olson,Yudong Cao, and Alán Aspuru-Guzik, “Qvector: analgorithm for device-tailored quantum error correction,”arXiv preprint arXiv:1711.02249 (2017).

[153] Xiaosi Xu, Simon C Benjamin, and Xiao Yuan, “Vari-ational circuit compiler for quantum error correction,”arXiv preprint arXiv:1911.05759 (2019).

[154] Jacob Biamonte, Peter Wittek, Nicola Pancotti, PatrickRebentrost, Nathan Wiebe, and Seth Lloyd, “Quantummachine learning,” Nature 549, 195–202 (2017).

[155] Edward Farhi and Hartmut Neven, “Classification withquantum neural networks on near term processors,”arXiv preprint arXiv:1802.06002 (2018).

[156] Maria Schuld, Alex Bocharov, Krysta M Svore, andNathan Wiebe, “Circuit-centric quantum classifiers,”Physical Review A 101, 032308 (2020).

[157] Maria Schuld and Nathan Killoran, “Quantum machinelearning in feature hilbert spaces,” Physical Review Let-ters 122, 040504 (2019).

[158] Vojtěch Havlíček, Antonio D Córcoles, Kristan Temme,Aram W Harrow, Abhinav Kandala, Jerry M Chow,and Jay M Gambetta, “Supervised learning withquantum-enhanced feature spaces,” Nature 567, 209–212 (2019).

[159] Edwin Stoudenmire and David J Schwab, “Supervisedlearning with tensor networks,” in Advances in NeuralInformation Processing Systems (2016) pp. 4799–4807.

[160] Seth Lloyd, Maria Schuld, Aroosa Ijaz, Josh Izaac, andNathan Killoran, “Quantum embeddings for machinelearning,” arXiv preprint arXiv:2001.03622 (2020).

[161] Adrián Pérez-Salinas, Alba Cervera-Lierta, Elies Gil-Fuster, and José I Latorre, “Data re-uploading for auniversal quantum classifier,” Quantum 4, 226 (2020).

[162] Takeru Kusumoto, Kosuke Mitarai, Keisuke Fujii,Masahiro Kitagawa, and Makoto Negoro, “Experimen-tal quantum kernel machine learning with nuclear spinsin a solid,” arXiv preprint arXiv:1911.12021 (2019).

[163] J. Romero, J. P. Olson, and A. Aspuru-Guzik, “Quan-tum autoencoders for efficient compression of quan-tum data,” Quantum Science and Technology 2, 045001(2017).

[164] Kwok Ho Wan, Oscar Dahlsten, Hlér Kristjánsson,Robert Gardner, and MS Kim, “Quantum generali-sation of feedforward neural networks,” npj QuantumInformation 3, 36 (2017).

[165] Guillaume Verdon, Jason Pye, and Michael Broughton,“A universal training algorithm for quantum deep learn-ing,” arXiv preprint arXiv:1806.09729 (2018).

[166] M. Cerezo, Akira Sone, Tyler Volkoff, Lukasz Cincio,and Patrick J Coles, “Cost function dependent barrenplateaus in shallow parametrized quantum circuits,” Na-ture Communications 12, 1–12 (2021).

[167] Chenfeng Cao and Xin Wang, “Noise-assisted quantumautoencoder,” arXiv preprint arXiv:2012.08331 (2020).

[168] Alex Pepper, Nora Tischler, and Geoff J Pryde, “Ex-perimental realization of a quantum autoencoder: Thecompression of qutrits via machine learning,” PhysicalReview Letters 122, 060501 (2019).

[169] Guillaume Verdon, Michael Broughton, and Ja-

cob Biamonte, “A quantum algorithm to train neu-ral networks using low-depth circuits,” arXiv preprintarXiv:1712.05304 (2017).

[170] Marcello Benedetti, Delfina Garcia-Pintos, Oscar Per-domo, Vicente Leyton-Ortega, Yunseong Nam, andAlejandro Perdomo-Ortiz, “A generative modeling ap-proach for benchmarking and training shallow quantumcircuits,” npj Quantum Information 5, 1–9 (2019).

[171] Yuxuan Du, Min-Hsiu Hsieh, Tongliang Liu, andDacheng Tao, “Expressive power of parametrized quan-tum circuits,” Physical Review Research 2, 033125(2020).

[172] Jin-Guo Liu and Lei Wang, “Differentiable learning ofquantum circuit born machines,” Physical Review A 98,062324 (2018).

[173] Brian Coyle, Daniel Mills, Vincent Danos, and ElhamKashefi, “The born supremacy: Quantum advantageand training of an ising born machine,” npj QuantumInformation 6, 1–11 (2020).

[174] Jonathan Romero and Alan Aspuru-Guzik, “Variationalquantum generators: Generative adversarial quantummachine learning for continuous distributions,” arXivpreprint arXiv:1901.00848 (2019).

[175] MV Altaisky, “Quantum neural network,” arXivpreprint quant-ph/0107012 (2001).

[176] Kerstin Beer, Dmytro Bondarenko, Terry Farrelly, To-bias J Osborne, Robert Salzmann, Daniel Scheiermann,and Ramona Wolf, “Training deep quantum neural net-works,” Nature communications 11, 1–6 (2020).

[177] Iris Cong, Soonwon Choi, and Mikhail D Lukin, “Quan-tum convolutional neural networks,” Nature Physics 15,1273–1278 (2019).

[178] Lukas Franken and Bogdan Georgiev, “Explorations inquantum neural networks with intermediate measure-ments,” in Proceedings of ESANN (2020).

[179] Arthur Pesah, M. Cerezo, Samson Wang, Tyler Volkoff,Andrew T Sornborger, and Patrick J Coles, “Absenceof barren plateaus in quantum convolutional neural net-works,” arXiv preprint arXiv:2011.02966 (2020).

[180] Kaining Zhang, Min-Hsiu Hsieh, Liu Liu, and DachengTao, “Toward trainability of quantum neural networks,”arXiv preprint arXiv:2011.06258 (2020).

[181] Wojciech Hubert Zurek, “Quantum darwinism,” Naturephysics 5, 181–188 (2009).

[182] A. Arrasmith, L. Cincio, A. T. Sornborger, W. H. Zurek,and P. J. Coles, “Variational consistent histories as a hy-brid algorithm for quantum foundations,” Nature com-munications 10, 3438 (2019).

[183] Robert B Griffiths, Consistent quantum theory (Cam-bridge University Press, 2003).

[184] Zoë Holmes, Andrew Arrasmith, Bin Yan, Patrick JColes, Andreas Albrecht, and Andrew T Sornborger,“Barren plateaus preclude learning scramblers,” arXivpreprint arXiv:2009.14808 (2020).

[185] Patrick Hayden and John Preskill, “Black holes as mir-rors: quantum information in random subsystems,”Journal of high energy physics 2007, 120 (2007).

[186] Mark M Wilde, Quantum information theory (Cam-bridge University Press, 2013).

[187] B. Rosgen and J. Watrous, “On the hardness of distin-guishing mixed-state quantum computations,” in 20thAnnual IEEE Conference on Computational Complex-ity (CCC’05) (2005) pp. 344–354.

[188] M. Cerezo, Alexander Poremba, Lukasz Cincio, and












https://papers.nips.cc/paper/2016/hash/5314b9674c86e3f9d1ba25ef9bb32895-Abstract.html

https://papers.nips.cc/paper/2016/hash/5314b9674c86e3f9d1ba25ef9bb32895-Abstract.html




https://iopscience.iop.org/article/10.1088/2058-9565/aa8072

https://iopscience.iop.org/article/10.1088/2058-9565/aa8072




https://www.nature.com/articles/s41467-021-21728-w
















https://arxiv.org/abs/quant-ph/0107012

https://arxiv.org/abs/quant-ph/0107012




https://www.esann.org/sites/default/files/proceedings/2020/ES2020-197.pdf



http://dx.doi.org/10.1038/nphys1202

http://dx.doi.org/10.1038/nphys1202





http://dx.doi.org/ 10.1088/1126-6708/2007/09/120




29

Patrick J Coles, “Variational quantum fidelity estima-tion,” Quantum 4, 248 (2020).

[189] Carlos Bravo-Prieto, Diego García-Martín, and José ILatorre, “Quantum singular value decomposer,” Physi-cal Review A 101, 062310 (2020).

[190] Bálint Koczor, Suguru Endo, Tyson Jones, YuichiroMatsuzaki, and Simon C Benjamin, “Variational-statequantummetrology,” New Journal of Physics 22, 083038(2020).

[191] Raphael Kaubruegger, Pietro Silvi, Christian Kokail,Rick van Bijnen, Ana Maria Rey, Jun Ye, Adam MKaufman, and Peter Zoller, “Variational spin-squeezingalgorithms on programmable quantum sensors,” Physi-cal Review Letters 123, 260505 (2019).

[192] Ziqi Ma, Pranav Gokhale, Tian-Xing Zheng, Sisi Zhou,Xiaofei Yu, Liang Jiang, Peter Maurer, and Frederic TChong, “Adaptive circuit learning for quantum metrol-ogy,” arXiv preprint arXiv:2010.08702 (2020).

[193] Jacob L Beckey, M. Cerezo, Akira Sone, and Patrick JColes, “Variational quantum algorithm for estimat-ing the quantum fisher information,” arXiv preprintarXiv:2010.10488 (2020).

[194] Jarrod R McClean, Sergio Boixo, Vadim N Smelyanskiy,Ryan Babbush, and Hartmut Neven, “Barren plateausin quantum neural network training landscapes,” Naturecommunications 9, 4812 (2018).

[195] Andrew Arrasmith, M. Cerezo, Piotr Czarnik, LukaszCincio, and Patrick J Coles, “Effect of barrenplateaus on gradient-free optimization,” arXiv preprintarXiv:2011.12245 (2020).

[196] Aram W Harrow and Richard A Low, “Random quan-tum circuits are approximate 2-designs,” Communica-tions in Mathematical Physics 291, 257–302 (2009).

[197] Fernando GSL Brandao, Aram W Harrow, and MichałHorodecki, “Local random quantum circuits are approx-imate polynomial-designs,” Communications in Mathe-matical Physics 346, 397–434 (2016).

[198] Alexey Uvarov, Jacob D. Biamonte, and Dmitry Yudin,“Variational quantum eigensolver for frustrated quan-tum systems,” Phys. Rev. B 102, 075104 (2020).

[199] Alexey Uvarov and Jacob Biamonte, “On barrenplateaus and cost function locality in variationalquantum algorithms,” arXiv preprint arXiv:2011.10530(2020).

[200] Kunal Sharma, M. Cerezo, Lukasz Cincio, andPatrick J Coles, “Trainability of dissipative perceptron-based quantum neural networks,” arXiv preprintarXiv:2005.12458 (2020).

[201] Carlos Ortiz Marrero, Mária Kieferová, and NathanWiebe, “Entanglement induced barren plateaus,” arXivpreprint arXiv:2010.15968 (2020).

[202] Samson Wang, Enrico Fontana, M. Cerezo, KunalSharma, Akira Sone, Lukasz Cincio, and Patrick JColes, “Noise-induced barren plateaus in variationalquantum algorithms,” arXiv preprint arXiv:2007.14384(2020).

[203] Daniel Stilck Franca and Raul Garcia-Patron, “Limita-tions of optimization algorithms on noisy quantum de-vices,” arXiv preprint arXiv:2009.05532 (2020).

[204] Leo Zhou, Sheng-Tao Wang, Soonwon Choi, HannesPichler, and Mikhail D Lukin, “Quantum approximateoptimization algorithm: Performance, mechanism, andimplementation on near-term devices,” Physical ReviewX 10, 021067 (2020).

[205] Edward Grant, Leonard Wossnig, Mateusz Ostaszewski,and Marcello Benedetti, “An initialization strategy foraddressing barren plateaus in parametrized quantumcircuits,” Quantum 3, 214 (2019).

[206] Tyler Volkoff and Patrick J Coles, “Large gradients viacorrelation in random parameterized quantum circuits,”Quantum Science and Technology 6, 025008 (2021).

[207] Andrea Skolik, Jarrod R McClean, Masoud Mohseni,Patrick van der Smagt, and Martin Leib, “Layerwiselearning for quantum neural networks,” arXiv preprintarXiv:2006.14904 (2020).

[208] Ernesto Campos, Aly Nasrallah, and Jacob Bia-monte, “Abrupt transitions in variational quantum cir-cuit training,” Physical Review A 103, 032607 (2021).

[209] Guillaume Verdon, Michael Broughton, Jarrod R Mc-Clean, Kevin J Sung, Ryan Babbush, Zhang Jiang,Hartmut Neven, and Masoud Mohseni, “Learning tolearn with quantum neural networks via classical neuralnetworks,” arXiv preprint arXiv:1907.05415 (2019).

[210] Abhinav Anand, Matthias Degroote, and AlánAspuru-Guzik, “Natural evolutionary strategies forvariational quantum computation,” arXiv preprintarXiv:2012.00101 (2020).

[211] Zhenyu Cai, “Resource estimation for quantum varia-tional simulations of the hubbard model,” Phys. Rev.Applied 14, 014059 (2020).

[212] Chris Cade, Lana Mineh, Ashley Montanaro, andStasja Stanisic, “Strategies for solving the fermi-hubbard model on near-term quantum computers,”Physical Review B 102, 235122 (2020).

[213] Andrew Jena, Scott Genin, and Michele Mosca, “Paulipartitioning with respect to gate sets,” arXiv preprintarXiv:1907.07859 (2019).

[214] Ophelia Crawford, Barnaby van Straaten, DaochenWang, Thomas Parks, Earl Campbell, and StephenBrierley, “Efficient quantum measurement of pauli oper-ators in the presence of finite sampling error,” Quantum5, 385 (2021).

[215] Vladyslav Verteletskyi, Tzu-Ching Yen, and Artur F Iz-maylov, “Measurement optimization in the variationalquantum eigensolver using a minimum clique cover,”The Journal of Chemical Physics 152, 124114 (2020).

[216] Artur F Izmaylov, Tzu-Ching Yen, Robert A Lang, andVladyslav Verteletskyi, “Unitary partitioning approachto the measurement problem in the variational quantumeigensolver method,” Journal of Chemical Theory andComputation 16, 190–195 (2019).

[217] Andrew Zhao, Andrew Tranter, William M Kirby,Shu Fay Ung, Akimasa Miyake, and Peter J Love,“Measurement reduction in variational quantum algo-rithms,” Physical Review A 101, 062322 (2020).

[218] Tzu-Ching Yen, Vladyslav Verteletskyi, and Artur FIzmaylov, “Measuring all compatible operators in oneseries of single-qubit measurements using unitary trans-formations,” Journal of Chemical Theory and Compu-tation 16, 2400–2409 (2020).

[219] Pranav Gokhale and Frederic T Chong, “o(n3) measure-ment cost for variational quantum eigensolver on molec-ular hamiltonians,” arXiv preprint arXiv:1908.11857(2019).

[220] Jarrod R McClean, Jonathan Romero, Ryan Babbush,and Alán Aspuru-Guzik, “The theory of variationalhybrid quantum-classical algorithms,” New Journal ofPhysics 18, 023023 (2016).




https://iopscience.iop.org/article/10.1088/1367-2630/ab965e

https://iopscience.iop.org/article/10.1088/1367-2630/ab965e










https://link.springer.com/article/10.1007%2Fs00220-009-0873-6




https://link.aps.org/doi/10.1103/PhysRevB.102.075104













https://iopscience.iop.org/article/10.1088/2058-9565/abd891







http://dx.doi.org/ 10.1103/PhysRevApplied.14.014059

http://dx.doi.org/ 10.1103/PhysRevApplied.14.014059

https://journals.aps.org/prb/abstract/10.1103/PhysRevB.102.235122






https://europepmc.org/article/med/31747266

https://europepmc.org/article/med/31747266


https://pubs.acs.org/doi/10.1021/acs.jctc.0c00008

https://pubs.acs.org/doi/10.1021/acs.jctc.0c00008



https://iopscience.iop.org/article/10.1088/1367-2630/18/2/023023/meta

https://iopscience.iop.org/article/10.1088/1367-2630/18/2/023023/meta

30

[221] William J Huggins, Jarrod R McClean, Nicholas C Ru-bin, Zhang Jiang, Nathan Wiebe, K Birgitta Whaley,and Ryan Babbush, “Efficient and noise resilient mea-surements for quantum chemistry on near-term quan-tum computers,” npj Quantum Information 7, 1–9(2021).

[222] Nicholas C Rubin, Ryan Babbush, and Jarrod Mc-Clean, “Application of fermionic marginal constraintsto hybrid quantum algorithms,” New Journal of Physics20, 053020 (2018).

[223] Andrew Arrasmith, Lukasz Cincio, Rolando D Somma,and Patrick J Coles, “Operator sampling for shot-frugaloptimization in variational algorithms,” arXiv preprintarXiv:2004.06252 (2020).

[224] Barnaby van Straaten and Bálint Koczor, “Measure-ment cost of metric-aware variational quantum algo-rithms,” arXiv preprint arXiv:2005.05172 (2020).

[225] Hsin-Yuan Huang, Richard Kueng, and John Preskill,“Predicting many properties of a quantum system fromvery few measurements,” Nature Physics 16, 1050–1057(2020).

[226] Charles Hadfield, Sergey Bravyi, Rudy Raymond, andAntonio Mezzacapo, “Measurements of quantum hamil-tonians with locally-biased classical shadows,” arXivpreprint arXiv:2006.15788 (2020).

[227] Giacomo Torlai, Guglielmo Mazzola, Giuseppe Carleo,and Antonio Mezzacapo, “Precise measurement of quan-tum observables with neural-network estimators,” Phys-ical Review Research 2, 022060 (2020).

[228] Laura Gentini, Alessandro Cuccoli, Stefano Piran-dola, Paola Verrucchi, and Leonardo Banchi, “Noise-resilient variational hybrid quantum-classical optimiza-tion,” Physical Review A 102, 052414 (2020).

[229] Enrico Fontana, M. Cerezo, Andrew Arrasmith,Ivan Rungger, and Patrick J Coles, “Optimizingparametrized quantum circuits via noise-induced break-ing of symmetries,” arXiv preprint arXiv:2011.08763(2020).

[230] Cheng Xue, Zhao-Yun Chen, Yu-Chun Wu, and Guo-Ping Guo, “Effects of quantum noise on quantumapproximate optimization algorithm,” arXiv preprintarXiv:1909.02196 (2019).

[231] Jeffrey Marshall, Filip Wudarski, Stuart Hadfield, andTad Hogg, “Characterizing local noise in QAOA cir-cuits,” IOP SciNotes 1, 025208 (2020).

[232] Isaac H Kim, “Noise-resilient preparation of quan-tum many-body ground states,” arXiv preprintarXiv:1703.00032 (2017).

[233] Suguru Endo, Zhenyu Cai, Simon C Benjamin, andXiao Yuan, “Hybrid quantum-classical algorithms andquantum error mitigation,” Journal of the Physical So-ciety of Japan 90, 032001 (2021).

[234] Kristan Temme, Sergey Bravyi, and Jay M Gambetta,“Error mitigation for short-depth quantum circuits,”Physical Review Letters 119, 180509 (2017).

[235] Suguru Endo, Simon C Benjamin, and Ying Li, “Prac-tical quantum error mitigation for near-future applica-tions,” Physical Review X 8, 031027 (2018).

[236] Zhenyu Cai, “Multi-exponential error extrapolation andcombining error mitigation techniques for nisq applica-tions,” arXiv preprint arXiv:2007.01265 (2020).

[237] Matthew Otten and Stephen K Gray, “Recovering noise-free quantum observables,” Physical Review A 99,012338 (2019).

[238] Suguru Endo, Qi Zhao, Ying Li, Simon Benjamin, andXiao Yuan, “Mitigating algorithmic errors in a hamilto-nian simulation,” Physical Review A 99, 012334 (2019).

[239] Jinzhao Sun, Xiao Yuan, Takahiro Tsunoda, Vlatko Ve-dral, Simon C Benjamin, and Suguru Endo, “Mitigat-ing realistic noise in practical noisy intermediate-scalequantum devices,” Physical Review Applied 15, 034026(2021).

[240] Armands Strikis, Dayue Qin, Yanzhu Chen, Simon CBenjamin, and Ying Li, “Learning-based quantum errormitigation,” arXiv preprint arXiv:2005.07601 (2020).

[241] Piotr Czarnik, Andrew Arrasmith, Patrick J Coles, andLukasz Cincio, “Error mitigation with clifford quantum-circuit data,” arXiv preprint arXiv:2005.10189 (2020).

[242] Angus Lowe, Max Hunter Gordon, Piotr Czarnik, An-drew Arrasmith, Patrick J Coles, and Lukasz Cincio,“Unified approach to data-driven quantum error miti-gation,” arXiv arXiv:2011.01157 (2020).

[243] Sam McArdle, Xiao Yuan, and Simon Benjamin,“Error-mitigated digital quantum simulation,” PhysicalReview Letters 122, 180501 (2019).

[244] Xavi Bonet-Monroig, Ramiro Sagastizabal, M Singh,and TE O’Brien, “Low-cost error mitigation by symme-try verification,” Physical Review A 98, 062339 (2018).

[245] Jarrod R McClean, Zhang Jiang, Nicholas C Rubin,Ryan Babbush, and Hartmut Neven, “Decoding quan-tum errors with subspace expansions,” Nature Commu-nications 11, 1–9 (2020).

[246] Bálint Koczor, “Exponential error suppressionfor near-term quantum devices,” arXiv preprintarXiv:2011.05942 (2020).

[247] William J Huggins, Sam McArdle, Thomas E O’Brien,Joonho Lee, Nicholas C Rubin, Sergio Boixo, K BirgittaWhaley, Ryan Babbush, and Jarrod R McClean, “Vir-tual distillation for quantum error mitigation,” arXivpreprint arXiv:2011.07064 (2020).

[248] Sergey Bravyi, Sarah Sheldon, Abhinav Kandala,David C Mckay, and Jay M Gambetta, “Mitigatingmeasurement errors in multi-qubit experiments,” arXivpreprint arXiv:2006.14044 (2020).

[249] Daiqin Su, Robert Israel, Kunal Sharma, Haoyu Qi,Ish Dhand, and Kamil Brádler, “Error mitigation ona near-term quantum photonic device,” arXiv preprintarXiv:2008.06670 (2020).

[250] Yudong Cao, Johnathan Romero, and Alán Aspuru-Guzik, “Potential of quantum computing for drug dis-covery,” IBM Journal of Research and Development 62,6–1 (2018).

[251] Yudong Cao, Jonathan Romero, Jonathan P Olson,Matthias Degroote, Peter D Johnson, Mária Kieferová,Ian D Kivlichan, Tim Menke, Borja Peropadre, Nico-las PD Sawaya, et al., “Quantum chemistry in the ageof quantum computing,” Chemical reviews 119, 10856–10915 (2019).

[252] Carlos Outeiral, Martin Strahm, Jiye Shi, Garrett MMorris, Simon C Benjamin, and Charlotte M Deane,“The prospects of quantum computing in computa-tional molecular biology,” Wiley Interdisciplinary Re-views: Computational Molecular Science , e1481 (2020).

[253] Steven R White, “Density matrix formulation for quan-tum renormalization groups,” Physical Review Letters69, 2863 (1992).

[254] Garnet Kin-Lic Chan and Sandeep Sharma, “The den-sity matrix renormalization group in quantum chem-








https://doi.org/10.1038/s41567-020-0932-7

https://doi.org/10.1038/s41567-020-0932-7










https://doi.org/10.1088%2F2633-1357%2Fabb0d7



https://journals.jps.jp/doi/10.7566/JPSJ.90.032001

https://journals.jps.jp/doi/10.7566/JPSJ.90.032001







https://journals.aps.org/prapplied/abstract/10.1103/PhysRevApplied.15.034026

https://journals.aps.org/prapplied/abstract/10.1103/PhysRevApplied.15.034026

















https://dl.acm.org/doi/10.1147/JRD.2018.2888987

https://dl.acm.org/doi/10.1147/JRD.2018.2888987

https://pubs.acs.org/doi/abs/10.1021/acs.chemrev.8b00803

https://pubs.acs.org/doi/abs/10.1021/acs.chemrev.8b00803

https://onlinelibrary.wiley.com/doi/full/10.1002/wcms.1481

https://onlinelibrary.wiley.com/doi/full/10.1002/wcms.1481



31

istry,” Annual Review of Physical Chemistry 62, 465–481 (2011).

[255] Sam McArdle, Suguru Endo, Alan Aspuru-Guzik, Si-mon C Benjamin, and Xiao Yuan, “Quantum com-putational chemistry,” Reviews of Modern Physics 92,015003 (2020).

[256] Frank Arute, Kunal Arya, Ryan Babbush, Dave Ba-con, Joseph C. Bardin, Rami Barends, Sergio Boixo,et al., “Hartree-fock on a superconducting qubit quan-tum computer,” Science 369, 1084–1089 (2020).

[257] Artem A Bakulin, Stoichko D Dimitrov, Akshay Rao,Philip CY Chow, Christian B Nielsen, Bob C Schroeder,Iain McCulloch, Huib J Bakker, James R Durrant, andRichard H Friend, “Charge-transfer state dynamics fol-lowing hole and electron transfer in organic photovoltaicdevices,” The Journal of Physical Chemistry Letters 4,209–215 (2013).

[258] Markus Gross, David C Müller, Heinz-Georg Nothofer,Ulrich Scherf, Dieter Neher, Christoph Bräuchle, andKlaus Meerholz, “Improving the performance of dopedπ-conjugated polymers for use in organic light-emittingdiodes,” Nature 405, 661–665 (2000).

[259] JR Schmidt, Priya V Parandekar, and John C Tully,“Mixed quantum-classical equilibrium: Surface hop-ping,” The Journal of Chemical Physics 129, 044104(2008).

[260] Thomas E O’Brien, Bruno Senjean, Ramiro Sagas-tizabal, Xavier Bonet-Monroig, Alicja Dutkiewicz,Francesco Buda, Leonardo DiCarlo, and Lucas Viss-cher, “Calculating energy derivatives for quantum chem-istry on a quantum computer,” npj Quantum Informa-tion 5, 1–12 (2019).

[261] John C Tully and Richard K Preston, “Trajectory sur-face hopping approach to nonadiabatic molecular col-lisions: the reaction of h+ with d2,” The Journal ofChemical Physics 55, 562–572 (1971).

[262] David RWeinberg, Christopher J Gagliardi, Jonathan FHull, Christine Fecenko Murphy, Caleb A Kent,Brittany C Westlake, Amit Paul, Daniel H Ess,Dewey Granville McCafferty, and Thomas J Meyer,“Proton-coupled electron transfer,” Chemical Reviews112, 4016–4093 (2012).

[263] Walter Kohn and Lu Jeu Sham, “Self-consistent equa-tions including exchange and correlation effects,” Phys-ical review 140, A1133 (1965).

[264] Bela Bauer, Dave Wecker, Andrew J Millis, Matthew BHastings, and Matthias Troyer, “Hybrid quantum-classical approach to correlated materials,” Physical Re-view X 6, 031045 (2016).

[265] Ryan Babbush, Craig Gidney, Dominic W Berry,Nathan Wiebe, Jarrod McClean, Alexandru Paler,Austin Fowler, and Hartmut Neven, “Encoding elec-tronic spectra in quantum circuits with linear t com-plexity,” Physical Review X 8, 041015 (2018).

[266] Dominic W Berry, Mária Kieferová, Artur Scherer, Yu-val R Sanders, Guang Hao Low, Nathan Wiebe, CraigGidney, and Ryan Babbush, “Improved techniques forpreparing eigenstates of fermionic hamiltonians,” npjQuantum Information 4, 1–7 (2018).

[267] Pierre-Luc Dallaire-Demers, Jonathan Romero, LiborVeis, Sukin Sim, and Alán Aspuru-Guzik, “Low-depth circuit ansatz for preparing correlated fermionicstates on a quantum computer,” arXiv preprintarXiv:1801.01053 (2018).

[268] Eugene F Dumitrescu, Alex J McCaskey, Gaute Ha-gen, Gustav R Jansen, Titus D Morris, T Papenbrock,Raphael C Pooser, David Jarvis Dean, and PavelLougovski, “Cloud quantum computing of an atomic nu-cleus,” Physical Review Letters 120, 210501 (2018).

[269] Hsuan-Hao Lu, Natalie Klco, Joseph M Lukens, Ti-tus D Morris, Aaina Bansal, Andreas Ekström, GauteHagen, Thomas Papenbrock, Andrew M Weiner, Mar-tin J Savage, et al., “Simulations of subatomic many-body physics on a quantum frequency processor,” Phys-ical Review A 100, 012320 (2019).

[270] Alessandro Roggero, Andy CY Li, Joseph Carlson, Ra-jan Gupta, and Gabriel N Perdue, “Quantum comput-ing for neutrino-nucleus scattering,” Physical Review D101, 074038 (2020).

[271] Julian Bender, Erez Zohar, Alessandro Farace, andJ Ignacio Cirac, “Digital quantum simulation of latticegauge theories in three spatial dimensions,” New Jour-nal of Physics 20, 093001 (2018).

[272] Mari Carmen Banuls, Rainer Blatt, Jacopo Catani,Alessio Celi, Juan Ignacio Cirac, Marcello Dalmonte,Leonardo Fallani, Karl Jansen, Maciej Lewenstein, Si-mone Montangero, et al., “Simulating lattice gaugetheories within quantum technologies,” The Europeanphysical journal D 74, 1–42 (2020).

[273] John Preskill, “Simulating quantum field theory witha quantum computer,” arXiv preprint arXiv:1811.10085(2018).

[274] Suguru Endo, Iori Kurata, and Yuya O Nakagawa,“Calculation of the green’s function on near-term quan-tum computers,” Physical Review Research 2, 033281(2020).

[275] Chinmay Mishra, Shane Thompson, Raphael Pooser,and George Siopsis, “Quantum computation of an in-teracting fermionic model,” Quantum Science and Tech-nology 5, 035010 (2020).

[276] Danny Paulson, Luca Dellantonio, Jan F Haase, AlessioCeli, Angus Kan, Andrew Jena, Christian Kokail, Rickvan Bijnen, Karl Jansen, Peter Zoller, et al., “Towardssimulating 2d effects in lattice gauge theories on aquantum computer,” arXiv preprint arXiv:2008.09252(2020).

[277] A Avkhadiev, PE Shanahan, and RD Young, “Accel-erating lattice quantum field theory calculations via in-terpolator optimization using noisy intermediate-scalequantum computing,” Physical Review Letters 124,080501 (2020).

[278] Jad C Halimeh, Valentin Kasper, and Philipp Hauke,“Fate of lattice gauge theories under decoherence,”arXiv preprint arXiv:2009.07848 (2020).

[279] Kunal Sharma, M. Cerezo, Zoë Holmes, Lukasz Cincio,Andrew Sornborger, and Patrick J Coles, “Reformu-lation of the no-free-lunch theorem for entangled datasets,” arXiv preprint arXiv:2007.04900 (2020).

[280] Francisco Barahona, Martin Grötschel, Michael Jünger,and Gerhard Reinelt, “An application of combinatorialoptimization to statistical physics and circuit layout de-sign,” Operations Research 36, 493–513 (1988).

[281] Wolfgang Küchlin and Carsten Sinz, “Proving consis-tency assertions for automotive product data manage-ment,” Journal of Automated Reasoning 24, 145–163(2000).

[282] Edward Farhi, Jeffrey Goldstone, and Sam Gutmann,“A Quantum Approximate Optimization Algorithm Ap-

https://www.annualreviews.org/doi/abs/10.1146/annurev-physchem-032210-103338

https://www.annualreviews.org/doi/abs/10.1146/annurev-physchem-032210-103338

https://doi.org/10.1103/RevModPhys.92.015003

https://doi.org/10.1103/RevModPhys.92.015003

http://dx.doi.org/10.1126/science.abb9811

https://pubs.acs.org/doi/10.1021/jz301883y

https://pubs.acs.org/doi/10.1021/jz301883y

https://www.nature.com/articles/35015037



https://doi.org/10.1038/s41534-019-0213-4

https://doi.org/10.1038/s41534-019-0213-4



https://pubs.acs.org/doi/10.1021/cr200177j

https://pubs.acs.org/doi/10.1021/cr200177j

https://journals.aps.org/pr/abstract/10.1103/PhysRev.140.A1133

https://journals.aps.org/pr/abstract/10.1103/PhysRev.140.A1133









https://doi.org/10.1103/PhysRevA.100.012320

https://doi.org/10.1103/PhysRevA.100.012320

https://doi.org/10.1103/PhysRevD.101.074038

https://doi.org/10.1103/PhysRevD.101.074038

http://dx.doi.org/ 10.1088/1367-2630/aadb71

http://dx.doi.org/ 10.1088/1367-2630/aadb71

http://dx.doi.org/10.1140/epjd/e2020-100571-8

http://dx.doi.org/10.1140/epjd/e2020-100571-8



https://doi.org/10.1103/PhysRevResearch.2.033281

https://doi.org/10.1103/PhysRevResearch.2.033281

https://doi.org/10.1088/2058-9565/ab8f63

https://doi.org/10.1088/2058-9565/ab8f63



https://doi.org/10.1103/PhysRevLett.124.080501

https://doi.org/10.1103/PhysRevLett.124.080501



https://www.jstor.org/stable/170992?seq=1

https://link.springer.com/article/10.1023/A:1006370506164

https://link.springer.com/article/10.1023/A:1006370506164

32

plied to a Bounded Occurrence Constraint Problem,”arXiv preprint arXiv:1412.6062 (2014).

[283] Edward Farhi and Aram W Harrow, “Quantumsupremacy through the quantum approximate opti-mization algorithm,” arXiv preprint arXiv:1602.07674(2016).

[284] Matthew B Hastings, “Classical and quantum boundeddepth approximation algorithms,” arXiv preprintarXiv:1905.07047 (2019).

[285] Sergey Bravyi, Alexander Kliesch, Robert Koenig, andEugene Tang, “Obstacles to variational quantum opti-mization from symmetry protection,” Physical ReviewLetters 125, 260505 (2020).

[286] Matthew P Harrigan, Kevin J Sung, Matthew Neeley,Kevin J Satzinger, Frank Arute, Kunal Arya, Juan Ata-laya, Joseph C Bardin, Rami Barends, Sergio Boixo,et al., “Quantum approximate optimization of non-planar graph problems on a planar superconducting pro-cessor,” Nature Physics , 1–5 (2021).

[287] Maria Schuld, Ilya Sinayskiy, and Francesco Petruc-cione, “An introduction to quantum machine learning,”Contemporary Physics 56, 172–185 (2015).

[288] Nathan Wiebe, Ashish Kapoor, and Krysta MSvore, “Quantum deep learning,” arXiv preprintarXiv:1412.3489 (2014).

[289] Hsin-Yuan Huang, Richard Kueng, and John Preskill,“Information-theoretic bounds on quantum advantagein machine learning,” arXiv preprint arXiv:2101.02464(2021).

[290] Samuel Yen-Chi Chen, Chao-Han Huck Yang, Jun Qi,Pin-Yu Chen, Xiaoli Ma, and Hsi-Sheng Goan, “Vari-ational quantum circuits for deep reinforcement learn-ing,” IEEE Access 8, 141007–141024 (2020).

[291] Michael Broughton, Guillaume Verdon, Trevor Mc-Court, Antonio J Martinez, Jae Hyeon Yoo, Sergei VIsakov, Philip Massey, Murphy Yuezhen Niu, RaminHalavati, Evan Peters, et al., “Tensorflow quantum:A software framework for quantum machine learning,”arXiv preprint arXiv:2003.02989 (2020).

[292] Xiu-Zhe Luo, Jin-Guo Liu, Pan Zhang, and Lei Wang,“Yao. jl: Extensible, efficient framework for quantumalgorithm design,” Quantum 4, 341 (2020).

[293] Yuval R Sanders, Dominic W Berry, Pedro CS Costa,Louis W Tessler, Nathan Wiebe, Craig Gidney, Hart-mut Neven, and Ryan Babbush, “Compilation of fault-tolerant quantum heuristics for combinatorial optimiza-tion,” Physical Review X Quantum 1, 020312 (2020).

ACKNOWLEDGEMENTS

MC is thankful to Kunal Sharma for helpful discus-sions. MC was initially supported by the Laboratory Di-rected Research and Development (LDRD) program ofLos Alamos National Laboratory (LANL) under projectnumber 20180628ECR, and later supported by the Cen-ter for Nonlinear Studies at LANL. AA was initially sup-ported by the LDRD program of LANL under projectnumber 20200056DR, and later supported by the bythe U.S. Department of Energy (DOE), Office of Sci-ence, Office of High Energy Physics QuantISED pro-gram under under Contract Nos. DE-AC52-06NA25396

and KA2401032. SCB acknowledges financial supportfrom EPSRC Hub grants under the agreement num-bers EP/M013243/1 and EP/T001062/1, and from EUH2020-FETFLAG-03-2018 under the grant agreementNo 820495 (AQTION). SE was supported by MEXTQuantum Leap Flagship Program (MEXT QLEAP)Grant Number JPMXS0120319794, JPMXS0118068682and JST ERATO Grant Number JPMJER1601. KF wassupported by JSPS KAKENHI Grant No. 16H02211,JST ERATO JPMJER1601, and JST CREST JP-MJCR1673. KM was supported by JST PRESTOGrant No. JPMJPR2019 and JSPS KAKENHI GrantNo. 20K22330. KM and KF were also supportedby MEXT Quantum Leap Flagship Program (MEXTQLEAP) Grant Number JPMXS0118067394 and JP-MXS0120319794. XY acknowledges support from theSimons Foundation. LC was initially supported bythe LDRD program of LANL under project number20190065DR, and later supported by the U.S. DOE, Of-fice of Science, Office of Advanced Scientific Comput-ing Research under the Quantum Computing ApplicationTeams (QCAT) program. PJC was initially supported bythe LANL ASC Beyond Moore’s Law project, and latersupported by the U.S. DOE, Office of Science, Office ofAdvanced Scientific Computing Research, under the Ac-celerated Research in Quantum Computing (ARQC) pro-gram. Most recently, MC, LC, and PJC were supportedby the Quantum Science Center (QSC), a National Quan-tum Information Science Research Center of the U.S. De-partment of Energy (DOE).

AUTHOR CONTRIBUTIONS

All authors have read, discussed and contributed tothe writing of the manuscript.

COMPETING INTERESTS

The authors declare no competing interests.

KEY POINTS:

• Variational quantum algorithms (VQAs) are theleading proposal for achieving quantum advantageusing near-term quantum computers.

• VQAs have been developed for a wide range ofapplications including finding ground states ofmolecules, simulating dynamics of quantum sys-tems, and solving linear systems of equations,among others.

• VQAs share a common structure, where a task isencoded into a parameterized cost function that isevaluated using a quantum computer, and a classi-cal optimizer trains the parameters in the VQA.








https://www.nature.com/articles/s41567-020-01105-y

https://www.tandfonline.com/doi/full/10.1080/00107514.2014.964942





https://ieeexplore.ieee.org/abstract/document/9144562




33

• The adaptive nature of VQAs is well suited to han-dle the constraints of near-term quantum comput-ers.

• Trainability, accuracy, and efficiency are three chal-lenges that arise when applying VQAs to large-scaleapplications, and strategies are currently being de-veloped to address these challenges.

Documents

Variational Quantum Algorithms - arXiv