Grover 확산 연산자는 어떻게 작동하며 왜 최적입니까?

에서 이 대답 , 그로버의 알고리즘을 설명한다. 설명은 알고리즘이 Grover Diffusion Operator 에 크게 의존 하지만이 연산자의 내부 작동에 대한 세부 사항은 제공하지 않음을 나타냅니다 .

간단히 말하면, Grover Diffusion Operator는 '단계에 대한 역전'을 만들어 초기 단계의 작은 차이를 측정 할 수있을만큼 충분히 크게 만듭니다.

질문은 지금 :

Grover 확산 연산자는 어떻게 이것을 달성합니까?
정렬되지 않은 데이터베이스를 검색하는 데 총 시간이 걸리는 결과 가 왜 최적입니까? $O(\sqrt{n})$

algorithm grovers-algorithm

— 이산 도마뱀
소스

두 번째 질문에 대한 의견입니다. Grover 알고리즘의 상태 트랙이 알고리즘의 초기 상태와 대상 상태를 연결하는 측지선을 정확하게 따른다는 것을 보여주는 작품이 있습니다. 최적입니다.

— XXDD

답변:

$\newcommand{\bra}[1]{\left<#1\right|}\newcommand{\ket}[1]{\left|#1\right>}\newcommand{\braket}[2]{\left<#1\middle|#2\right>}\newcommand{\bke}[3]{\left<#1\middle|#2\middle|#3\right>}\newcommand{\proj}[1]{\left|#1\right>\left<#1\right|}$ 원래 질문 은 평신도의 설명에 관한 것이 었 으므로 나는 지속적인 시간 진화에 기초하여 이해하기 쉬운 (배경에 따라 다름) 약간 다른 해결책을 제시합니다. (그러나 나는 그것이 평신도에게 적합하다고 주장하지 않습니다.)

우리는 모든 국가의 균일 중첩는 초기 상태에서 시작, 우리가 상태를 발견하는 것을 목표로하고 있습니다, 오히려 모든 요구하는 것보다. 우리가이 상태들로부터 직교 정상적인 기초를 만든다면 설명하기가 더 쉽다.

| ψ ⟩ = \frac{1}{\sqrt{2^{n}}} \sum_{y \in {0, 1}^{n}} | y ⟩

$\ket{\psi}=\frac{1}{\sqrt{2^n}}\sum_{y\in\{0,1\}^n}\ket{y}$

정답으로 인식 할 수있다 (이 일반화 될 수 있지만, 이러한 하나 개의 상태가 정확하게 가정). 이를 위해 우리는 해밀턴의 행동에 따라 시간이 지남에 따라 진화합니다

그로버 검색의 가장 아름다운 기능은이 시점에서 수학을 두 상태의 하위 공간으로 줄일 수 있다는 것입니다

| x ⟩

$\ket{x}$

H = | x ⟩ ⟨ x | + | ψ ⟩ ⟨ ψ | .

$H=\proj{x}+\proj{\psi}.$

{| x ⟩, | ψ ⟩}

$\{\ket{x},\ket{\psi}\}$

2^{n}

$2^n$

곳

{| x ⟩, | ψ^{⊥} ⟩}

$\{\ket{x},\ket{\psi^\perp}\}$

이 기초를 사용하여 시간 진화

으로 작성 가능

| ψ^{⊥} ⟩ = \frac{1}{\sqrt{2^{n} - 1}} \sum_{y \in {0, 1}^{n} : y \neq x} | y ⟩ .

$\ket{\psi^{\perp}}=\frac{1}{\sqrt{2^n-1}}\sum_{y\in\{0,1\}^n:y\neq x}\ket{y}.$

e^{- i H t} | ψ ⟩

$e^{-iHt}\ket{\psi}$

및

표준 파울리 행렬이다. 이것은

e^{- i t (I + 2^{- n} Z + \frac{\sqrt{2^{n} - 1}}{2^{n}} X)} \cdot (\begin{matrix} \frac{1}{\sqrt{2^{n}}} \\ \sqrt{1 - \frac{1}{2^{n}}} \end{matrix}),

$e^{-it\left(\mathbb{I}+2^{-n}Z+\frac{\sqrt{2^n-1}}{2^{n}}X\right)}\cdot\left(\begin{array}{c}\frac{1}{\sqrt{2^n}} \\ \sqrt{1-\frac{1}{2^n}} \end{array}\right),$

X

$X$

Z

$Z$

그래서, 우리는 시간 동안 진화 경우

e^{- i t} (I \cos (\frac{t}{2^{n / 2}}) - i \frac{1}{2^{n / 2}} \sin (\frac{t}{2^{n / 2}}) (Z + X \sqrt{2^{n} - 1})) (\begin{matrix} \frac{1}{\sqrt{2^{n}}} \\ \sqrt{1 - \frac{1}{2^{n}}} \end{matrix}) .

$e^{-it}\left(\mathbb{I}\cos\left(\frac{t}{2^{n/2}}\right)-i\frac{1}{2^{n/2}}\sin\left(\frac{t}{2^{n/2}}\right)\left(Z+X\sqrt{2^n-1}\right)\right)\left(\begin{array}{c}\frac{1}{\sqrt{2^n}} \\ \sqrt{1-\frac{1}{2^n}} \end{array}\right).$

이며 전역 단계를 무시하면 최종 상태는

t = \frac{π}{2} 2^{n / 2}

$t=\frac{\pi}{2}2^{n/2}$

다시 말해, 확률이 1이면 상태

우리가 찾고 있다고. Grover의 검색에 대한 일반적인 회로 기반 설명은 실제로이 연속적인 시간의 진화가 불연속적인 단계로 나뉘어져 있으며, 일반적으로 결과에 대한 확률 1을 정확히 얻을 수 없다는 점과 약간의 단점이 있습니다.

\frac{1}{2^{n / 2}} (Z + X \sqrt{2^{n} - 1}) (\begin{matrix} \frac{1}{\sqrt{2^{n}}} \\ \sqrt{1 - \frac{1}{2^{n}}} \end{matrix}) = (\begin{matrix} \frac{1}{2^{n}} \\ - \frac{\sqrt{2^{n} - 1}}{2^{n}} \end{matrix}) + (\begin{matrix} 1 - \frac{1}{2^{n}} \\ \frac{\sqrt{2^{n} - 1}}{2^{n}} \end{matrix}) = (\begin{matrix} 1 \\ 0 \end{matrix}) .

$\frac{1}{2^{n/2}}\left(Z+X\sqrt{2^n-1}\right)\left(\begin{array}{c}\frac{1}{\sqrt{2^n}} \\ \sqrt{1-\frac{1}{2^n}} \end{array}\right)=\left(\begin{array}{c}\frac{1}{2^n} \\ -\frac{\sqrt{2^n-1}}{2^n} \end{array}\right)+\left(\begin{array}{c} 1-\frac{1}{2^n} \\ \frac{\sqrt{2^n-1}}{2^n}\end{array}\right)=\left(\begin{array}{c} 1 \\ 0 \end{array}\right).$

| x ⟩

$\ket{x}$

$\tilde H=5H$ $\tilde H$ $2^{n/2}$ $2^{n/2}$ $k$ $1/k$

$\ket{x}$ $\ket{y}$

— 다 프트 울리
소스

$D = -H^{\otimes n}U_0H^{\otimes n}$ $U_0$

U_{0} | 0^{\otimes n} ⟩ = - | 0^{\otimes n} ⟩, U_{0} | x ⟩ = | x ⟩ for | x ⟩ \neq | 0^{\otimes n} ⟩ .

$U_0\left|0^{\otimes n}\right> = -\left|0^{\otimes n}\right>,\,U_0\left|x\right> = \left|x\right>\,\text{for} \left|x\right>\neq\left|0^{\otimes n}\right>.$

$U_0$ $U_0 = I-2\left|0^{\otimes n}\rangle\langle0^{\otimes n}\right|$ , giving

D = 2 | + ⟩ ⟨ + | - I,

$D= 2\left|+\rangle\langle+\right| - I,$ where

| + ⟩ = 2^{- n / 2} {(| 0 ⟩ + | 1 ⟩)}^{\otimes n}

$\left|+\right> = 2^{-n/2}\left(\left|0\right> + \left|1\right>\right)^{\otimes n}$ .

This gives² that the diffusion operator is a reflection about $\left|+\right>$

As the other part of Grover's algorithm is also a reflection, these combine to rotate the current state closer to the 'searched-for' value $x_0$ . This angle decreases linearly with the number of rotations (until it overshoots the searched-for value), giving that the probability of correctly measuring the correct value increases quadratically.

Bennet et. al. showed that this is optimal. By taking a classical solution to an NP-problem, Grover's algorithm can be used to quadratically speed this up. However, taking a language $\mathcal L_A = \left\lbrace y:\exists x\, A\left(x\right) = y\right\rbrace$ for a length preserving function $A$ (here, an oracle), any bounded-error oracle based quantum turing machine cannot accept this language in a time $T\left(n\right)=\mathcal o\left(2^{n/2}\right)$ .

This is achieved by taking a set of oracles where $\left|1\right>^{\otimes n}$ has no inverse (so is not contained in the language). However, this is contained in some new language $\mathcal L_{A_y}$ by definition. The difference in probabilities of a machine accepting $\mathcal L_A$ and a different machine accepting $\mathcal L_{A_y}$ in time $T\left(n\right)$ is then less than $1/3$ and so neither language is accepted and Grover's algorithm is indeed asymptotically optimal.³

Zalka later showed that Grover's algorithm is exactly optimal.

^{1 In Grover's algorithm, minus signs can be moved round, so where the minus sign is, is somewhat arbitrary and doesn't necessarily have to be in the definition of the diffusion operator}

^{2 alternatively, defining the diffusion operator without the minus sign gives a reflection about $\left|+^\perp\right>$}

^{3 Defining the machine using the oracle $A$ as $M^A$ and the machine using oracle $A_y$ as $M^{A_y}$ , this is a due to the fact that there is a set $S$ of bit strings, where the states of $M^A$ and $M^{A_y}$ at a time $t$ are $\epsilon$ -close⁴, with a cardinality $<2T^2/\epsilon^2$ . Each oracle where $M^A$ correctly decides if $\left|1\right>^{\otimes n}$ is in $\mathcal L_A$ can be mapped to $2^n - \text{Card}\left(S\right)$ oracles where $M^A$ fails to correctly decide if $\left|1\right>^{\otimes n}$ is in that oracle's language. However, it must give one of the other $2^n-1$ potential answers and so if $T\left(n\right)=\mathcal o\left(2^{n/2}\right)$ , the machine is unable to determine membership of $\mathcal L_A$ .}

^{4 Using the Euclidean distance, twice the trace distance}

— Mithrandir24601
소스