Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Entanglement Entropy
Candidato: Relatore:
Federico Grasselli Prof. Gianluca Grignani
Matricola 256291
1 Density Operator 3
1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 Nomenclature . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 Pure States . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Mixed States . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.5 Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.6 Time Evolution of a mixed state . . . . . . . . . . . . . . . . 10
1.7 Density Matrices . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.8 Reduced Density Operator . . . . . . . . . . . . . . . . . . . . 13
4 Conclusions 46
Appendices 48
1
C Shannon and von Neumann Entropy 52
Bibliography 54
2
Chapter 1
Density Operator
1.1 Introduction
Let’s consider a quantum system: we usually identify it through its state, i.e.
its wave function. In this way, we assume it is possible to define the system
completely and build the function which represents it. However, this is not
always feasible. For instance, in an electron-target scattering experiment we
use the electron wave function to compute the cross section of the process;
while if we want to estimate it experimentally, an electron beam –made
of many identical electrons prepared in the same initial state– is needed.
Clearly, the experimental apparatus won’t produce electrons in the exact
same conditions, hence considering a single electron as representative of the
whole beam implies a process of idealization. Although such a process is
acceptable in the majority of the cases, sometimes it is not: suppose the
beam was prepared without taking into account electron spin, therefore a
measure of Sz over all electrons will statistically result in 50% h̄/2 and
50% −h̄/2 (neglecting the presence of polarizers). Consequently, we cannot
specify the spin wave function more than:
1
ψ = √ χ+ + eiα χ−
2
where α is unknown. A spin function like the one just written, or in general
a function like
ψ = aχ+ + bχ−
(where a and b are fixed complex numbers satisfying |a|2 +|b|2 = 1), describe1
an electron spin oriented in a particular direction n = n (θ, ϕ) with θ =
2 arccos |a|, ϕ = arg b − arg a. This means that ψ can represent an electron
beam polarized along n while it cannot represent an unpolarized beam
such as the one previously hypothesized. Moreover, since the latter is an
1
proof in appendix A
3
incoherent superposition of states, it cannot be represented by any single
state of which it is composed, contradicting the assumption made at the
beginning.
In these cases, that is when the lack of information about a collection
of single states prevents us from describing them one by one completely, we
can still describe such collection through a statistical analysis, in particular
by means of the density operator [1].
1.2 Nomenclature
• Given a quantum system, if it can be identified by a complete set of
commuting observables, its state is called a pure state. We call pure
states even those states which are a coherent superposition – thus
completely determined– of such states. In other words, every state
which is represented by a definite wave function is a pure state;
Now we will go through the description of pure and mixed states made
by the density operator [1].
where
cn = hϕn |si. (1.3)
The probability of obtaining fn as a result of the measure of F on the system
in its state |si is:
P(fn ) = |cn |2 (1.4)
4
which may also be written, introducing the projector Pn of the nth eigen-
vector: Pn = |ϕn ihϕn | , as:
and since trace is invariant with respect to a change of basis2 , the (1.9)
has general validity. As a matter of fact, computing the trace of ρF in an
arbitrary orthonormal basis {|αk i} we obtain hF is :
X X
Tr(ρF ) = hαk |sihs|F |αk i = hs|F |αk ihαk |si = hs|F |si = hF is (1.10)
k k
5
respective probabilities pi (for i = 1, ..., n). The “state” of such a quantum
system is called a mixed state.
Note that the only assumption on the states |s(i) i is the normalization
to one, with no orthogonality
P required; obviously the statistical weights pi
satisfy: 0 ≤ pi ≤ 1 and i pi = 1.
The definition of the mean value of an observable F over a mixed state
is quite obvious: X
hF i = pi hF ii (1.12)
i
2 2
where P (i) (fn ) = |cn (i) | = |hϕn |s(i) i| and |s(i) i = n cn (i) |ϕn i.
P
As done in the previous section, we will now define the density operator
of a mixed state by making explicit the expectation value of F in the state
|s(i) i:
X 2 X
hF ii = hs(i) |F |s(i) i = fn |hϕn |s(i) i| = fn hϕn |s(i) ihs(i) |ϕn i (1.14)
n n
which substituted in hF i:
X X X X
hF i = pi fn hϕn |s(i) ihs(i) |ϕn i = hϕn | pi |s(i) ihs(i) |F |ϕn i (1.15)
i n n i
6
By means of the density operator one can also compute the probability
P(fn ) as in (1.11):
X
P(fn ) = pi P (i) (fn )
i
X 2
= pi |hϕn |s(i) i|
i
X
= pi hs(i) |ϕn ihϕn |s(i) i
i
X
= pi hs(i) |Pn |s(i) i
i
X
= pi hPn ii
i
= hPn i [choosing F = Pn ]
1.5 Properties
Now let’s list some properties of density operators. Each property is satis-
fied both by density operators of pure and mixed states, unless otherwise
specified.
1. ρ = ρ† : follows from the definition in (1.16) since pi ∈ < ∀i ;
2. Tr(ρ) = 1 :
X X X X 2
Tr(ρ) = hαk | pi |s(i) ihs(i) |αk i = pi |hs(i) |αk i| =
k i i k
2
(Parseval’s equation assures us that k |hs(i) |αk i| = hs(i) |s(i) i = 1)
P
X
= pi = 1;
i
7
In conclusion we have seen that: 0 ≤ λk ≤ 1 ∀k.
N.B.: if the eigenvalue λk̄ is equal to 1, then (1.19) shows that all
the others are 0; this is the particular case of a pure state, as a
matter of fact, writing ρ through its spectral representation: ρ =
P
k λk |λk ihλk | = |λk̄ ihλk̄ | which is the definition of pure state.
Tr(ρ2 ) ≤ 1,
where equality occurs for pure states: recalling previous N.B., all eigen-
values are zero except for one which equals 1, hence
Tr(ρ2 ) = Tr(ρ) = 1.
Therefore relations
ρ2 = ρ and Tr(ρ2 ) = 1
8
Such a difference may be better visualized through an ideal experiment:
consider many identically prepared systems, such as electrons, directed nor-
mally toward a Young double-slit interferometer, and, if not absorbed, sub-
sequently detected on an opaque screen. Let a1 (x) be the probability am-
plitude –i.e. the wave function– corresponding to the passage through the
first slit toward the spatial point x on the measurement screen, which, like
the interferometer, is perpendicular to the direction of the initial beam.
Similarly, let a2 (x) be the probability amplitude corresponding to passage
through the second slit toward the spatial point x. The amplitude for sys-
tems being found at x when both slits are passable, so that either slit
may be entered on the way to the screen, is:
1
a12 (x) = √ (a1 (x) + a2 (x)) (1.20)
2
√
according to the superposition principle (the factor 1/ 2 confers proper
normalization on the total probability). The probability density of finding
the quantum systems at point x on the collection screen upon measurement
is the complex square of the amplitude:
1h i
p12 (x) = |a12 (x)|2 = |a1 (x)|2 + |a2 (x)|2 + 2Re(a1 (x)∗ a2 (x)) (1.21)
2
and refers to the coherent superposition of quantum states a1 (x) and a2 (x),
which clearly interfere.
Suppose now that the experimental apparatus is modified so that only
one slit is available at a time, with both being made available in the
full course of the experiment on the entire ensemble of systems. Assuming
that diffraction from each slit is negligible, interference previously detected
vanishes. As a matter of fact in this case there is no quantum superposition
of states, but rather a mixing of them: the modified apparatus implies that
a certain population p of quantum systems –arbitrarily determined by the
experimentalist– is in state a1 (x), while the remaining population 1 − p is in
state a2 (x). The probability density of finding one of the quantum systems
at point x on the collection screen upon measurement is3 :
X
ρxx = pi |ai (x)|2 (1.22)
i
9
We would like to point out one last important fact regarding density
matrices. When talking about statistical ensembles of quantum systems,
we necessarily deal with two distinct kinds of probability: statistical prob-
abilities pi which appear explicitly in ρ’s definition, and those (P (i) (fn ))
which characterize the expectation valuePof observables measured on each
pure subsystem of the mixture (hF ii = n fn P (i) (fn )) and are implicit in
the vectors |s(i) i. Consequently, when we compute traces in the density op-
erator formalism, we refer to particular representations of it, dealing with
density matrices depending on both kinds of probabilities indistinguishably.
As a result, while one can uniquely define a density operator starting from
a given mixture of states, one cannot associate a definite mixture of sys-
tems to a given density matrix; therefore there can be different system
ensembles described by the same density matrix [1].
which yields:
dρ 1
= [H, ρ]. (1.27)
dt ih̄
This last equation is called the von Neumann equation or Liouville-von Neu-
mann equation and describes how a density operator evolves in time. This
equation is the quantum analogous of the Liouville equation valid in classical
statistical mechanics:
dρcl
= {H, ρcl },
dt
where ρcl (p, q) is the phase space distribution of the system. Note that,
although identical up to a minus sign, the von Neumann equation and time
evolution of a Heisenberg operator A:
dA 1
= [A, H]
dt ih̄
10
are completely disconnected: while the former was obtained in the Schrödin-
ger picture, the latter arises in a context where operators are, in general,
time-dependent, whereas states are not; this means that in the Heisenberg
picture the density operator –as defined in (1.24)– is time-independent as
well [1].
At last, one can wonder whether (1.27) and the Ehrenfest theorem are
reconcilable, since both belong to the Schrödinger picture. One can imme-
diately verify they are self-consistent by deducing one through the other:
d d
hAi = [using (1.17)] Tr(ρA)
dt dt
dρ 1
= Tr A = Tr([H, ρ]A)
dt ih̄
1 1
= Tr(HρA − ρHA) = Tr(ρAH − ρHA)
ih̄ ih̄
1 1
= Tr(ρ[A, H]) = h[ρ, H]i
ih̄ ih̄
which is the Ehrenfest theorem.
In case of a time-independent Hamiltonian, the integrated form of the
von Neumann equation is:
and it can be easily proved by recalling time evolution of a state for time-
independent Hamiltonians:
11
noting that X
|s(i) i = ck (i) |αk i with ck (i) = hαk |s(i) i
k
we can rewrite it: X ∗
ρkl = pi ck (i) cl (i) . (1.29)
i
2
• The diagonal elements are ρkk = i pi |ck (i) | ; each ρkk is given by the
P
sum of products between probability pi that a generic system belonging
2
to the mixture is in state |s(i) i and probability |ck (i) | of finding the
state |αk i in the state |s(i) i. Hence, ρkk is the global probability of
finding a generic element of the mixture in the state |αk i. For this
reason, diagonal Pelements are called relative populations of the states
|αk i. Obviously k ρkk = 1 due to property 2 of density operators.
∗
• Regarding off-diagonal matrix elements, coefficients ck (i) cl (i) express
interference effects between the states |αk i and |αl i which can arise if
the pure state |s(i) i is a linear coherent superposition of such states.
Therefore, the element ρkl represents the average of these interference
effects over the mixed ensemble, and can result in zero even if singu-
∗
larly each coefficient ck (i) cl (i) is not. In a statistical mixture of states
the averaging can cancel out interference effects. Vice versa, ρkl 6= 0
means that even at the mixture level there are residual effects of co-
herence between |αk i and |αl i. For this reason, off-diagonal matrix
elements are called coherences.
Clearly, both populations and coherences depend on the chosen basis {|αk i}.
Coherences are bounded from above by populations in the following way:
where we also notice that coherences oscillate in time with Bohr frequencies.
12
1. In a completely mixed state, all states must result equally populated
and there are no coherences. By naming the dimension of H as N
({|αk i} ∈ H), it follows that the population of each state is 1/N ,
hence ρ is the identity matrix multiplied by 1/N (ρkk = 1/N and
ρkl = 0 for l 6= k);
it follows that: X
ρ= ak al ∗ |αk ihαl |
k,l
ρkl = ak al ∗ . (1.33)
13
Denoting ρA the partial trace computed in the subspace containing subset
B, i.e.: X
ρA = TrB ρ = hβl |ρ|βl i (1.34)
l
we have:
X
hFA i = hαk |ρA |αj ihαj |FA |αk i (completeness of {|αj i})
kj
X
= hαk |ρA FA |αk i = Tr(ρA FA ) (1.35)
k
As one can see, the expectation value of FA has been manipulated achieving
an analogous form of (1.17) by means of the operator ρA , named reduced
density operator. The reduced density operator works only on the subspace
containing subset A and satisfies all the properties already listed for a generic
density operator [1].
14
Chapter 2
In this chapter we will briefly show how all the postulates of Q.M. given
in the state vector language can be reformulated in terms of the density
operator language, since some of them will be useful afterwards. The two
formulations are equivalent and both arise in the context of Schrödinger’s
picture, however it is sometimes much easier to approach problems from one
point of view rather than the other [3].
One knows that closed quantum systems evolve according to unitary evo-
lution; nevertheless, when an experimentalist observes the system to carry
out a measure of some physical magnitude, the interaction which follows
makes the system no longer closed, thus not subjected to unitary evolution.
The following postulate provides a means for describing the effects of mea-
surements on quantum systems: (number 3 refers to the numeration we will
give at the end of the chapter)
15
and the state of the system after the measurement is:
Mm |ψi
|ψ 0 i = p (2.2)
p(m)
Inserting Mm = Pm in (2.1):
= |c(m)|2
Mm |ψi |mihm|ψi
|ψ 0 i = p = = ±|mi (2.4)
p(m) |hm|ψi|
16
2.2 Time Evolution and Measurements for Den-
sity Operators
What we are going to show is that the postulates of Q.M. related to unitary
evolution and measurement can be rephrased in the language of density
operators. In the next section we complete this rephrasing by giving an
intrinsic characterization of the density operator that doesn’t rely on the
idea of a state vector.
What is the density operator of the system after obtaining the measurement
result m? If the initial state was |ψi i then the state after getting m is (2.2):
Mm |ψi i
|ψi m i = p (2.8)
p(m|i)
17
operator ρm is:
X
ρm = p(i|m)|ψi m ihψi m | =
i
X Mm |ψi ihψi |Mm †
= p(i|m) =
p(m|i)
i
Before moving to the next section we want to remark that stating that
a quantum system is prepared in the state P ρi with probability pi means it is
described by the density operator ρ = i pi ρi . Indeed, although prepared
in ρi with probability pi , there is a probability 1 − pi of finding it in many
other states ρi0 with respective probabilities pi0 .
ρ0 = U ρU † (2.10)
18
Postulate 3 (Generalized Measurements) Quantum measurements are
described by a collection {Mm } of measurement operators acting on the state
space of the system being measured. The index m refers to the measurement
outcomes that may occur in the experiment. If the state of the quantum
system is ρ immediately before the measurement, then the probability that
result m occurs is given by (2.7):
Mm ρMm †
ρm = (2.12)
Tr(Mm † Mm ρ)
19
Chapter 3
Entanglement and
Entanglement Entropy
3.1 Introduction
Entanglement is arguably the most fundamental – according to Schrödinger
– and potentially disturbing characteristic distinguishing the quantum from
the classical world. One could describe it as follows: it implies that the
measurement of an observable of a subsystem may affect drastically and
instantaneously the possible outcome of a measurement on another part of
the system, no matter how far apart it is spatially. This is to be distinguished
from the phenomenon of classical correlation, where the distribution of the
possible outcomes of a measurement on one part of the system may depend
on the outcome of a previous measurement elsewhere, but is strictly limited
by the signal propagation speed [7].
In order to better understand the above description, let’s immediately
present a concrete example of entanglement: the singlet state of two spin- 12
particles
1
|s0 i = √ (| ↑i1 ⊗ | ↓i2 − | ↓i1 ⊗ | ↑i2 ) (3.1)
2
where subscripts 1,2 refer respectively to the first and second particle. If an
observer measures the spin of the first particle and obtains a particular value,
say σz 1 = +1/2, the spin value of the second particle is automatically fixed
to σz 2 = −1/2, since after the measurement the system has collapsed into
its eigenstate | ↑i1 ⊗ | ↓i2 . This phenomenon implies a sort of non-classical
correlation between the two particles, i..e. entanglement. The corresponding
state is called entangled. Obviously entanglement is not always guaranteed
in all two-spin systems, for instance in the triplet:
measuring the spin of the first particle doesn’t give additional information
20
about the state of the second particle, hence the two particles are not en-
tangled, and their state is called separable.
Entanglement can be defined for both pure and mixed states. In the first
case the definition of entanglement poses no difficulty and can be naturally
understood in terms of standard correlations. In the second, however, the
definition is subtle, as correlations do not directly imply entanglement. At
the same time, for two objects, entanglement can be easily defined and quan-
tified, while for many objects this is not so simple: new correlations may
appear, which cannot be reduced to the case of two objects. We will intro-
duce separately entanglement for pure and mixed states, considering in each
the case of bipartite systems (i.e. two objects) and multipartite ones. More
details can be found in [6].
21
I acting on B. Pauli operators acting on qubits will often appear; they are
defined as follows:
σx = |0ih1| + |1ih0|
σy = i(|0ih1| − |1ih0|)
σz = |1ih1| − |0ih0| = −iσx σy
22
a generic pure state Φ representing two subsystems A and B it is always
possible to find an orthonormal basis {|un i} in HA and {|vm i} in HB such
that: X
|Φi = dk |uk , vk i (3.7)
k≤dA ,dB
where
P {|niA } and {|miB } are orthonormal bases and having defined |ñiB =
m n,m |miB which are not necessarily orthonormal. Now suppose that
c
{|niA } is an eigenbasis1 of ρA (reduced density operator) with eigenvalues
{dn 2 }. Then spectral representation for ρA is:
X
ρA = dn 2 |niA hn|A (3.9)
n
23
1. Every pure state can be decomposed
P according to SD, however while
this decomposition: |Ψi = n,m cn,m |n, mi can be realized with any
orthonormal basis, the basis used in SD depends on the pure state to
be decomposed, as the proof highlights.
3. Every pure state is associated with its Schmidt number, which is the
number of non-zero eigenvalues of ρA , hence the number of terms com-
posing the state’s SD. If the Schmidt number is equal to 1, then the
state is a product state, also called a separable state. If the Schmidt
number is greater than 1, then the state is entangled, since it cannot
be written as a tensor product of two vectors.
4. It is important to stress the fact that if the pure state representing the
whole system is separable, then the reduced density matrices ρA and
ρB describe pure subsystems (because they are in the form (1.7));
if the pure state is entangled, then the reduced density matrices
describe mixed subsystems (because they are in the form (1.16)).
Φ is already written in the the SD form. The eigenvalues of the reduced den-
sity operator are cos(θ)2 , sin(θ)2 ; for θ = 0 we have a product state, whereas
for θ = π/4 we have the Bell state Φ+ , which is a maximally entangled
state, as explained in the previous paragraph. In parallel, the reduced den-
sity operators get more and more mixed as one increases θ from 0 to π/4
(note that the purity of a mixed state ρ is related to the distribution if its
eigenvalues when considered as probabilities).
24
A natural way of measuring the latter2 is the von Neumann entropy:
25
maximally entangled for p = 1/2, as we expected.
26
Proof
!
pi λji log pi λji
X X
S pi ρi = −
i ij
3. As in the previous proof, let pi λji and |eji iB be the eigenvalues and
corresponding eigenvectors of pi ρi . Clearly, the normalized eigenvector
j j
of the operator |iihi| is |iiA with eigenvalue 1. Hence pi λP i and |iiA |ei iB
are the eigenvalues and eigenvectors of the operator i pi |iihi| ⊗ ρi ,
which means that:
!
pi λji log pi λji = H(pi ) +
X X X
S pi |iihi| ⊗ ρi = − pi S(ρi ).
i ij i
27
entropy:
!
pi λji ηik log(pi λji ηik )
X X
S pi ρi ⊗ σi = −
i ijk
Some properties of the Shannon entropy H(X) fail to hold for the von Neu-
mann entropy, for instance for random variables X and Y , the inequality
H(X) ≤ H(X, Y ) holds. This makes intuitive sense: surely we cannot be
more uncertain about the sate of X than we are about the joint state of X
and Y . This intuition fails for quantum states. Consider a system AB of
two qubits in the entangled state Φ+ . This is a pure state, so S(A, B) = 0.
On the other hand, system A has density operator I/2, and thus has entropy
equal to one. Another way of stating this result is that, for this system, the
quantity S(B|A) = S(A, B) − S(A) is negative. In other words, the subsys-
tems of the entangled system may exhibit more disorder than the system as
a whole. In the classical world this never happens.
28
Property 5 Suppose |ABi is a pure state of a composite system. Then
|ABi is entangled ⇐⇒ S(B|A) < 0.
Proof
5. S(A, B) = S(B, A) = 0 since |A, Bi is pure. If |ABi is entangled,
then ρA is a mixed state, for which S(A) > 0, consequently S(B|A) =
S(B, A) − S(A) = −S(A) < 0.
If S(B|A) < 0 then S(A) > 0 indicating that ρA is a mixed state,
which means that |A, Bi is entangled.
29
Applying the sub-additivity inequality S(A, B) ≤ S(A) + S(B), we
obtain: !
X X
pi S(ρi ) ≤ S pi ρi
i i
which proves the theorem.
However we still have to justify equations (3.23) and (3.24). For the
former we can say that:
S(A) = S(ρA ) = S(TrB (ρAB )) where
!
X
TrB (ρAB ) = TrB pi ρi ⊗ |iihi| =
i
(we choose basis {|ii} to compute the partial trace)
XX XX
= pi ρi hk|iihi|ki = pi ρi δik 2 =
k i k i
X
= pi ρi .
i
30
which yields X
TrA (ρAB ) = pi |iihi|
i
31
optimize
2n
1X
Ē = pd log2 (d)
n
d=1
32
if there exist |ϕX i ∈ HX such that
otherwise we say that we have an entangled state. This definition was given
by Schrödinger.
33
mentioned state within the pure state in which it was prepared. Hence
the definition of entanglement has broadened beyond Schrödinger’s original
definition in order to include such mixed states [2].
ρ = ρA ⊗ ρB
Without loss of generality, we can assume that ρA i and ρB i are all pure
ensembles of the appropriate subsystems, i.e. ρA i = |aiha|i and ρB i = |bihb|i ;
indeed, if they were mixed states, we could make them explicit through (1.16)
obtaining a sum, over 3 indexes, of pure states and we could then reduce such
sum to only one index:
X X
ρ= pi |aiha|i ⊗ |bihb|i = pi |ai bi ihai bi |
i i
requiring that ρ is separable means that every operator |Ψi ihΨi |, which acts
on HAB , is a product state: |Ψi ihΨi | = ρA i ⊗ ρB i , or equivalently that every
state |Ψi i is a product state |ai i|bi i. However it is not always possible to
tell whether or not a given mixed state is separable; this problem is known
as the separability problem.
What differentiates pure and mixed states is that non-entangled mixed
states may contain correlations, while non-entangled pure states do not. For
example, the mixed separable state:
1 1
ρ = (|0ih0| ⊗ |0ih0| + |1ih1| ⊗ |1ih1|) = (|0, 0ih0, 0| + |1, 1ih1, 1|)
2 2
34
fulfills hσzA ⊗σzB i = Tr(σzA ⊗σzB ρ) = ... = 1 whereas hσzA,B i = Tr(σzA,B ρA,B ) =
... = 0. These correlations are, however, very trivial. If we had a classical
system we could also have them. Only entangled states may display non-
classical correlations.
A subtle point is that a state may look entangled even though it is
separable. Let us take, for instance:
1
σ = (|Φ+ ihΦ+ | + |Φ− ihΦ− |)
2
According to this formula, we can prepare σ by mixing two maximally en-
tangled states. However, there is another way of preparing the same state
σ which does not require using entangled states at all. This immediately
follows from the fact that σ = ρ (one has just to replace the definition of
the Bell states in this formula), and thus one can prepare it by mixing two
product states. Therefore σ is a separable state as well. This simple exam-
ple illustrates the difficulty of finding out whether a state is entangled or
not: we have to check all the possible decompositions and only if none of
them involves exclusively product states, we will have an entangled state.
Unfortunately there exist infinite decompositions, so that this task is hope-
less. Fortunately, in some special cases there are shortcuts, which can give
us the right answer with much less effort. We will now go through them.
(i) 2
since pi ≥ 0, |ckl | ≥ 0, hkl|W |kli ≥ 0. Thus, a negative expectation value
of a witness indicates the presence of entanglement. Note, however, that
the converse is not necessarily true: if the expectation value of a witness is
35
positive, this does not imply that the corresponding state is not entangled.
Hence the important question one can ask about entanglement witnesses
regards their optimality. We say that an entanglement witness W1 is finer
than W2 if and only if the entanglement of any ρ detected by W2 is also
detected by W1 . Horodecki showed ([5]) that for any entangled state there
always exists a witness that detects it. Sadly, there is no simple way of find-
ing out such a witness, which makes the problem of detecting entanglement
in mixed states rather non-trivial.
Let’s consider the following example. For a system of two qubits, W =
2 − S, where S = σ1A ⊗ (σ1B + σ2B ) + σ2A ⊗ (σ1B − σ2B ) and the sigma’s are
Pauli operators, is an entanglement witness. This can be shown by noticing
that for a product state: |hSi| ≤ |hσ1B i + hσ2B i| + |hσ1B i − hσ2B i| ≤ 2, since
hσi ≤ 1. By choosing the sigma’s as follows: σ1 = σ~n , σ2 = σm ~ (where σ~ n,m
~
are those in subsection (3.2.1)), we see that for ρ(p) = (1 − p) 14 + p|Φ+ ihΦ+ |,
entanglement is detected for p > √12 .
(ρTAB
B
)mµ,nν = hm|hµ|ρTAB
B
|ni|νi ≡ hm|hν|ρAB |ni|µi
36
they share the same Characteristic polynomial: det(A − λI) =
det (A − λI)T = det(AT − λI)– the spectrum of ρTAB
B
is the same
TB
as the spectrum of ρAB , and in particular ρAB must still be
positive semidefinite, which concludes the demonstration.
Comparing these properties with the ones fulfilled by the von Neumann
entropy, one immediately notices that while S(ρ) is concave, E(ρ) is con-
vex. This striking difference is basically the reason for which the entropy
37
of entanglement fails to hold as a satisfactory measure of entanglement over
mixed states.
This statement
P can be better visualized by considering a generic mixed state
ρ: ρ = i pi |Ψi ihΨi | and by evaluating its entanglement through the en-
tropy of entanglement:
(equality occurs only when all ρA i are equal). The last inequality is clearly
in contradiction with the axiom of convexity:
!
X X
E(ρ) = E pi |Ψi ihΨi | ≤ pi E(Ψi )
i i
Inequalities (3.25) and (3.26) show that ES (ρ) 6= E(ρ), hence the entropy
of entanglement can not be taken as a measure of entanglement for mixed
states.
A possible measure of entanglement that satisfies the above axioms and that
can be determined in practice is the entanglement of formation:
X
EF (ρ) = min pi E(Ψi )
i
38
where F (Ψ) is the fidelity previously defined and pi are the coefficients of
the convex combination representing ρ.
Finally, entanglement can also be measured using the definition of partial
transposition. One defines negativity as
where the 1-norm is given by the sum of the absolute values of the eigen-
values. Negativity can be positive only if we have an entangled state, so it’s
a sufficient condition but it is not necessary: there exist entangled states
for which it is zero. In fact, we are going to prove that N (ρ) > 0 ⇒ ρ is
entangled:
1. S(A : B) ≥ 0; S(A : B) = 0 ⇔ ρ = ρA ⊗ ρB
The first indicates that it is zero only for product states, i.e. when there are
no correlations. The second indicates that it decreases whenever we discard
a subsystem (in this case a), but it cannot decrease by more than twice the
entropy of such a system.
Proof
39
1. At first let’s show that there are no correlations for product states
such as ρ = ρA ⊗ ρB :
X
hA ⊗ Bi = Tr(ρA ⊗ B) = hkl|ρA ⊗ B|ki|li =
kl
X
= hkl|ρA ⊗ ρB |A|kiB|li =
kl
X
= hkl|ρA A|kiρB B|li =
kl
X
= hk|ρA A|kihl|ρB B|li =
kl
= Tr(ρA A)Tr(ρB B) =
obviously for product states the reduced density matrix 3 for system
A,B is ρA,B , resulting in:
hA ⊗ Bi = Tr(ρA A)Tr(ρB B) = hAihBi (3.29)
which confirms the absence of correlations.
In order to prove the property we will use the so called strong sub-
additivity of the von Neumann entropy: given three subsystems X, Y, Z
then
S(X, Y ) + S(X, Z) ≥ S(X, Y, Z) + S(X)
If we choose Y = A, Z = B and X a system which is pure (S(X) = 0)
and disentangled from the others (i.e. XY and XZ are product states)
we immediately obtain the first property, since S(X, Y ) = S(ρX ⊗
ρY ) = recall the von Neumann property number 4 = S(X) + S(Y ) =
S(Y ) and S(X, Z) = S(ρX ⊗ρZ ) = S(X)+S(Z) = S(Z). The equality
holds when ρ = ρA ⊗ ρB : S(A, B) = S(ρA ⊗ ρB ) = S(ρA ) + S(ρB ) =
S(A) + S(B).
2. By choosing X = A, Y = B and Z such that the whole state Ψ is
pure and the reduced state in AB is our state ρ, we obtain S(A, B) ≥
S(A)−S(A, Z); using property 1 of the von Neumann entropy we have
S(A, Z) = S(B) (being ABZ a pure state), which yields: S(A, B) ≥
S(A) − S(B).
We prove S(A : B) ≤ S((AZ) : B), which is equivalent to S(A) +
S(B) − S(A, B) ≤ S(A, Z) + S(B) − S(Z, A, B), by simply choosing
X = A, Y = B in the strong sub-additivity inequality.
We prove S((AZ) : B) ≤ S(A, B)+2S(Z) making it explicit: S(A, Z)+
S(B) − S(A, Z, B) ≤ S(A) + S(B) − S(A, B) + 2S(Z) and using in-
equalities: S(A, Z, B) ≥ S(A, B) − S(Z) (obtained from S(A, B) ≥
S(A) − S(B)) and S(Z) + S(A) ≥ S(A, Z) (obtained from property 6
of the von Neumann entropy).
3
TrB (ρ) = TrB (ρA ⊗ ρB ) = ρA TrB (ρB ) = ρA
40
Moreover, quantum mutual information for pure states Ψ reduces to the
entropy of entanglement: S(A, B) = 0 and S(A) = S(B), which yield:
S(A : B) = 2S(A) = 2E(Ψ).
We end up with a table in which for each partition we declare whether the
state is entangled or not. In order to check the entanglement for each parti-
tion, we have to, for example, find the appropriate witness, whose definition
follows very naturally the one for bipartite systems. Such a table may con-
tain some redundancies since, for instance, if a tripartite state is completely
separable, i.e. separable with respect to the partition (A)(B)(C), it is auto-
matically separable for any other partition, and thus not entangled at all.
This is obviously a general fact: if a multipartite state is completely
separable, then it is not entangled with respect to any possible
partition.
41
3.4 An interesting example
In this section we will show that the von Neumann entropy and the state
function entropy known in thermodynamics have something in common. In
particular, we will compute ([8, page 188]) the von Neumann entropy of an
ensemble in thermal equilibrium and show ([4]) that it coincides with the
above-mentioned thermodynamic entropy.
First we obtain the density operator ρ describing the ensemble in thermal
equilibrium. The basic assumption we make is that nature tends to maximize
the von Neumann entropy of the ensemble S(ρ), subject to the constraint
that the ensemble average of the Hamiltonian has a certain prescribed value,
since it is in equilibrium. Once thermal equilibrium is established, we expect:
dρ
=0 (3.30)
dt
otherwise hHi = Tr(ρH) would be time-dependent, in contradiction with
the hypothesis of equilibrium. Because of (1.27), this means that ρ and
H can be simultaneously
P diagonalized. Therefore, the eigenstates |ki used
to trace: S(ρ) = − k xk log xk , which are ρ’s eigenstates (and xk are its
eigenvalues), are chosen to be also energy eigenstates. With this choice, the
fractional population of the state |ki expresses the probability of finding a
mixture element with energy Ek .
Thus we are trying to maximize S(ρ), taking into account the constraints:
X N
X
−βEk −γ−1
e =1 ⇒Z≡ e−βEk = eγ+1
k k
42
where we recognize the partition function Z, that can also be written Z =
Tr(e−βH ); in conclusion
e−βEk
xk = (3.33)
Z
where the constant β can also be determined in terms of U through the first
constraint.
Had we attempted to maximize S(ρ) without the internal-energy con-
straint, we would have obtained instead:
1
xk = (3.34)
N
which is the density matrix eigenvalue appropriate for a completely random
ensemble, as we stated in section 1.7. Comparing (3.33) with (3.34), we
infer that a completely random ensemble can be regarded as the β → 0
limit (physically the high-temperature limit) of a canonical ensemble.
Now, since ρ and H are simultaneously diagonalized with respect to basis
{|ki}, one can write ρ through its spectral decomposition:
X X e−βEk
ρ = xk |kihk| = |kihk| =
Z
k k
X e−βH e−βH X
= |kihk| = |kihk| =
Z Z
k k
e−βH
= (3.35)
Z
The last expression for ρ, with β = 1/kB T , is known in statistical mechanics
to describe a canonical ensemble.
Now it’s time to compute the von Neumann entropy of the canonical ensem-
ble: !
e−H/kB T e−H/kB T
S(ρcan ) = −Tr log
Z Z
−H/k T
where log e Z B = log e−H/kB T − log Z and log e−H/kB T = −H/kB T as far
as the exponent is a Hermitian operator. Thus:
1
S(ρcan ) = Tr(ρcan H) + log ZTr(ρcan ) =
kB T
recalling that Helmholtz free energy F = −kB T log Z
hHi − F
=
kB T
recalling F ’s definition: F = U − T Stherm and that hHi = U
= Stherm /kB
We have shown that, up to a factor, the von Neumann entropy of a canonical
ensemble is the state function entropy in the thermodynamic sense!
43
3.5 Monogamy and Area Law
In this section we will present, from a qualitative point of view, two interest-
ing properties fulfilled by many-body quantum systems: the area law ([6])
and the monogamy of entanglement ([4]).
e−H/kB T
ρ=
Z
where the Hamiltonian H expresses the short range interaction involving
only a few spins close to each other. The area law can be obtained both at
zero and finite temperature.
In the first case, the state ρ reduces to a projector onto the ground
subspace, whose vector basis {|Ψi i} fulfills H|Ψi i = E0 |Ψi i where E0 is the
ground state energy. If the ground state is not degenerate, which we will
assume, we will just use a pure state |Ψ0 i to denote the ground state. We
consider a connected region A of the lattice with a smooth boundary and the
complementary region B, and concentrate on the amount of entanglement
between these two regions, and its growth as we make region A larger and
larger. Since we are dealing with a bipartite system of a pure state |Ψ0 i,
we can evaluate such amount through the entropy of entanglement E(Ψ0 ) =
S(ρA ). In general, since entropy is an extensive quantity, one would expect
that it scales with the number of spins in region A. However, for ground
states of Hamiltonians as we are considering here, this seems not to be the
case. Instead, the entanglement (entropy of entanglement) scales not with
the volume of region A, but with its boundary area, and thus the name area
law.
In the second case, the one with finite temperature, ρ is in general a
bipartite mixed state, hence its amount of entanglement cannot be mea-
sured by the entropy of entanglement (see discussion in subsection 3.3.1).
Nevertheless, one can still derive an analogous area law by measuring the
amount of correlations between regions A and B through the quantum mu-
tual information S(A : B). Note that at zero temperature, i.e. pure states,
the quantum mutual information reduces (up to a factor two) to the entan-
glement entropy, providing us with a suitable generalization of the previous
area law.
44
decay with distance, but there’s no upper bound to the total number of
correlations a single particle can establish with the others of the system. In-
stead, in 2006 Osborne and Verstraete proved the Coffman-Kundu-Wootters
(CKW) monogamy inequality, which states that given a multipartite system
with subsystems (qubits) A, B1 , B2 , . . . then it results:
45
Chapter 4
Conclusions
In this paper we have briefly given insight into the world of entanglement
and entanglement entropy.
In the first chapter we introduced the notion of density operator and ana-
lyzed its properties in order to show, in the following chapter, how Quantum
Mechanics could be reformulated in the density operator picture. As we
highlighted, this new formulation is equivalent to the most-known one; how-
ever, the density operator approach really shines for two applications: the
description of quantum systems whose state is not known, and the descrip-
tion of subsystems of a composite quantum system. Since the entanglement
of a system arises as a quantum correlation between two or more subsystems,
the density operator picture is fundamental in its description.
In the third chapter we presented the basic concepts regarding entangle-
ment of bipartite and multipartite systems, both for pure and mixed states.
As we have argued, whereas entanglement for bipartite systems is well es-
tablished, for multipartite ones there exist many possibilities of defining
entanglement measures. It may turn out that one state is more entangled
according to one measure, but not according to a different one.
Nevertheless, we have highlighted two measures of entanglement/correlations
which have a clear physical meaning. The first one, the entropy of entan-
glement, applies to bipartite pure states and is given by the von Neumann
entropy of the reduced density operator of one of the subsystems. The
second one, the quantum mutual information, measures the correlations be-
tween the two subsystems and for pure states it reduces to the entropy of
entanglement. One can apply these two measures to multi-spin systems in
lattices by separating all the spins into two disjoint regions which are then
considered as a bipartite system. In the particular case where we deal with
ground or thermal states of short range interaction Hamiltonians, the appli-
cation of those measurements gives rise to area laws. These laws state that
the quantum mutual information, between a region A and its complemen-
tary, scales with the number of spins at the boundary of A (and not, as one
46
would expect, with the total number of spins in A).
Another peculiar characteristic of entanglement that distinguishes it from
classical correlations is its monogamy, according to which each subsystem
can be strongly entangled only with one other subsystem, as there is an
upper bound to the sum of its quantum correlations.
Entanglement and its quantification have been primary issues in the
study of black holes by means of quantum field theory and in the science
of quantum information, for which entanglement is an important resource.
Since these two subjects naturally overlap in the context of many-body
systems (in particular close to criticality), it soon became apparent that
there is a rich structure to be uncovered by studying entanglement entropy
in this context, and by putting together ideas from field theory and quantum
information science; much has been done, much more is yet to come.
47
Appendices
48
Appendix A
The most general spin state of a 12 -spin particle is represented, in the eigen-
basis of operator Sz , by the spinor:
a
χ= = aχ+ + bχ− (A.1)
b
where a and b are two arbitrary complex numbers. Requiring the normal-
ization to one, a and b have to satisfy the relation: |a|2 + |b|2 = 1. The
remarkable fact is that spinor (A.1) represents a state in which the spin is
parallel to a precise direction, defined by the versor n̂ = n̂(ϑ, ϕ), where ϑ
and ϕ are obtained as follows:
In order to verify the validity of this assertion, let’s start by observing that
it’s not very meaningful to talk about a precise direction defined by the
two coefficients a and b, since they are probability amplitudes. What we
can actually show is that the spinor χ is an eigenvector of the operator
σn = ~σ · n̂ with corresponding eigenvalue +1. This means that measuring σn
(i.e. the spin) along direction n̂ yields the result +1 with √ 100% probability
of success; however, being the modulus of the spin vector 31 , we can only
conclude that the spinor is situated on a cone with axis n̂, and not precisely
along n̂. Consequently it is not χ’s direction that is fixed, but rather the
direction on which the projection of the spin vector has the fixed value +1.
We could interpret n̂ as the average direction of spinor χ.
Let’s determine the eigenvector of σn and show that it coincides with (A.1).
Given the following components for versor n̂:
49
and using the explicit form of Pauli’s matrices, it results:
cos ϑ sin ϑe−iϕ
σn =
sin ϑeiϕ − cos ϑ
Note that σn 2 = I, hence according to Dirac’s theorem the operator σn has
eigenvalues λ = ±1. This means that measuring the spin of a 12 -particle
along an arbitrary direction n̂, the only possible results are +h̄/2 and −h̄/2.
We define the eigenspinor associated to the eigenvalue λ = 1 as:
+ a
χn = (A.3)
b
and impose that:
cos ϑ sin ϑe−iϕ
a a
=
sin ϑeiϕ − cos ϑ b b
which implies the following equalities:
a cos ϑ + b sin ϑe−iϕ = a
a sin ϑeiϕ − b cos ϑ = b
and therefore:
sin ϑ iϕ sin(ϑ/2) iϕ
b=a e =a e (A.4)
1 + cos ϑ cos(ϑ/2)
This, substituted in the normalization condition yields:
|a|2 (1 + tan2 (ϑ/2)) = 1
and thus:
|a| = cos(ϑ/2) (A.5)
We then can choose a = cos(ϑ/2) which implies b = sin(ϑ/2)eiϕ thanks
to (A.4). In this way, up to an arbitrary phase factor, the spinor (A.3) is
completely determined:
χ+ iϕ
n = cos(ϑ/2)χ+ + sin(ϑ/2)e χ−
The fact that any spinor like (A.1) represents a spin state with a definite
polarization has an obvious explanation: the spinor is identified by two
complex numbers, or equivalently four real numbers, but only two of these
are essential in defining the state described by χ. One is eliminated by
the normalization condition, and another is incorporated in an arbitrary
phase factor and doesn’t play any role. The two remaining parameters can
always be put in biunivocal correspondence with two polar angles ϑ and
ϕ, associating to the state a definite polarization direction identified by the
versor n̂ = n̂(ϑ, ϕ) ([1] for further reading).
50
Appendix B
51
Appendix C
52
(in quantum information theory the natural logarithm is substituted with
log2 ). For instance, a constant distribution maximizes H(X) since each
outcome is equally probable, while a delta distribution pk = δk,k0 minimizes
H(X) since there’s only one possible issue for a measure. An event that can
never occur should not contribute to the entropy, so by convention we agree
that 0 ln 0 = 0. We could immediately use Shannon Entropy to quantify the
uncertainty about the distribution of pure states composing the mixed state
ρ, that is, the “amount of mixedness” of such state, hence making Shan-
non Entropy a measure of the entanglement of the whole system of which ρ
represents a subsystem. However, since the same operator ρ can be written
in many different ways –due to the non-uniqueness of the decomposition
of mixed states as convex combinations of pure states– we would face the
problem of deciding which decomposition provides the classical probability
distribution employed in computing the Shannon Entropy of the selected
mixed state. This problem is solved by defining a generalization of Shan-
non Entropy that is suitable for quantum systems, with density operators
replacing probability distributions, the von Neumann Entropy:
S(ρ) = −Tr(ρ ln ρ)
X = {d1 2 , d2 2 , ...dn 2 }
X
H(X) = − dk 2 ln dk 2
k
S(ρ) = −Tr(ρ ln ρ) = (choosing for ρ the diagonalized matrix: δkj dk 2 )
X
= − dk 2 ln dk 2
k
⇒ S(ρ) = H(X)
53
Bibliography
54