Variational Quantum Optimization with Continuous Bandits

Variational Quantum Optimization with Continuous Bandits
Notice: This research summary and analysis were automatically generated using AI technology. For absolute accuracy, please refer to the [Original Paper Viewer] below or the Original ArXiv Source.

We introduce a novel approach to variational Quantum algorithms (VQA) via continuous bandits. VQA are a class of hybrid Quantum-classical algorithms where the parameters of Quantum circuits are optimized by classical algorithms. Previous work has used zero and first order gradient based methods, however such algorithms suffer from the barren plateau (BP) problem where gradients and loss differences are exponentially small. We introduce an approach using bandits methods which combine global exploration with local exploitation. We show how VQA can be formulated as a best arm identification problem in a continuous space of arms with Lipschitz smoothness. While regret minimization has been addressed in this setting, existing methods for pure exploration only cover discrete spaces. We give the first results for pure exploration in a continuous setting and derive a fixed-confidence, information-theoretic, instance specific lower bound. Under certain assumptions on the expected payoff, we derive a simple algorithm, which is near-optimal with respect to our lower bound. Finally, we apply our continuous bandit algorithm to two VQA schemes: a PQC and a QAOA quantum circuit, showing that we significantly outperform the previously known state of the art methods (which used gradient based methods).


💡 Research Summary

This paper tackles one of the most pressing challenges in variational quantum algorithms (VQAs): the barren‑plateau phenomenon, which renders gradient‑based optimizers ineffective as the number of parameters grows. The authors recast VQA parameter optimization as a best‑arm identification problem in a continuous‑armed bandit setting. Each point x in the unit interval corresponds to a set of quantum circuit parameters, and pulling arm x yields a noisy estimate of the circuit’s expected objective value µ(x). Assuming that µ is L‑Lipschitz and the noise is sub‑Gaussian, the task becomes to locate an ε‑optimal arm with probability at least 1 – δ while using as few circuit evaluations as possible.

The theoretical contribution consists of two parts. First, the authors derive an information‑theoretic, instance‑specific lower bound on the expected sample complexity of any (ε, δ)‑PAC learner. The bound takes the form
E


Comments & Academic Discussion

Loading comments...

Leave a Comment