Abstract
We study the asymptotic growth of the entanglement entropy of ground states of non-interacting (spinless) fermions in \({{\mathbb {R}}}^3\) subject to a constant magnetic field perpendicular to a plane. As for the case with no magnetic field we find, to leading order \(L^2\ln (L)\), a logarithmically enhanced area law of this entropy for a bounded, piecewise Lipschitz region \(L\Lambda \subset {{\mathbb {R}}}^3\) as the scaling parameter L tends to infinity. This is in contrast to the two-dimensional case since particles can now move freely in the direction of the magnetic field, which causes the extra \(\ln (L)\) factor. The explicit expression for the coefficient of the leading order contains a surface integral similar to the Widom–Sobolev formula in the non-magnetic case. It differs, however, in the sense that the dependence on the boundary, \(\partial \Lambda \), is not solely on its area but on the “surface perpendicular to the direction of the magnetic field”. We utilize a two-term asymptotic expansion by Widom (up to an error term of order one) of certain traces of one-dimensional Wiener–Hopf operators with a discontinuous symbol. This leads to an improved error term of the order \(L^2\) of the relevant trace for piecewise \(\textsf{C}^{1,\alpha }\) smooth surfaces \(\partial \Lambda \).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In recent years, entanglement entropy (EE) has become an important and intensively studied quantity of states of many-particle quantum systems. For an introduction to this topic, we refer to [3, 6, 15]. In this paper, we study the EE of ground states of the ideal Fermi gas in a magnetic field in three-dimensional Euclidean space, \({{\mathbb {R}}}^3\), see [18]. The two-dimensional Fermi gas in a constant magnetic field was recently analyzed in [5, 21], starting from the earlier work in [31]. Here, a strict area-law holds, while for the free Fermi gas in any dimension d a logarithmically enhanced area-law is valid, see [13, 20]. Stability of these area-laws has been proved in [25, 26] for \(d\ge 2\) and in [29] in the sense that adding a “small” electric or magnetic potential to the Hamiltonian does not change the leading asymptotics of the entropy. The one-dimensional case seems to be still open (for the \(\gamma \)-Rényi entropy with \(\gamma \le 1\)).
There is an extensive literature on EE by now with many fascinating connections and implications to related fields. Here, we only mention and refer to a small fraction of mathematical results. In [30], an enhanced area-law was proved for the one-dimensional free Fermi gas in a periodic potential; the higher-dimensional case remains an open problem. By the work in [8, 24, 28], we understand EE in Anderson-type models on the lattice. An extension to the EE of positive temperature equilibrium states (of the ideal Fermi gas) was presented in [22, 23, 35]. Finally, we mention results on the XY and XXZ quantum spin chain [1, 4, 7, 11, 12, 16].
By a (strict) area-law for a ground state of the infinitely extended Fermi gas, say in \({{\mathbb {R}}}^d\) with spatial dimension \(d\in {\mathbb {N}}\), we mean that the entanglement (or local) entropy of this state reduced to the scaled (bounded) region \(L\Lambda \) grows to leading order like \(L^{d-1}{\mathcal {H}}^2(\partial \Lambda )\) as the dimensionless real parameter L tends to infinity. Here, \({\mathcal {H}}^2(\partial \Lambda )\) is the (Hausdorff) surface area of the boundary \(\partial \Lambda \). If there is an extra \(\ln (L)\) factor in this leading asymptotics, then we call it a logarithmically enhanced area-law.
Whether one should expect a strict area-law or an enhanced area-law is related to the spectral properties of the one-particle Hamiltonian of the non-interacting many-particle Fermi gas. If the off-diagonal part of the integral kernel of the corresponding spectral (Fermi) projection has a fast decay (e.g., exponential), then we expect a strict area-law to hold. It is not difficult to argue for that (see [28]) but to compute and finally prove the precise leading coefficient has only been accomplished in special cases. On the other hand, if the decay of the off-diagonal part of the integral kernel is weak (e.g., inverse linear), then we can expect an enhanced area-law. In the present model, we have a mixture. Namely, we have an exponential decay in the planar coordinate (orthogonal to the magnetic field) and a \(1/|\cdot |\) decay in the longitudinal coordinate along the magnetic field. The latter prevails and leads to a logarithmically enhanced area-law. Our main result is formulated in Theorem 4.1.
As in previous proofs there are two parts to proving such a result. Firstly, we prove a two-term asymptotic expansion for polynomials (see Theorem 2.3). Due to the product structure of the ground state, see (2.7), we can dimensionally reduce the asymptotics of a three-dimensional problem to an asymptotic expansion of a one-dimensional problem with localizing sets \(L\Lambda _{x^\perp }\subset {{\mathbb {R}}}\) and with the spectral projection of the one-dimensional Laplacian, see Lemma 3.2. The corresponding asymptotic expansion was already proved by Landau and Widom [19] and then improved by Widom [36]. But here we need to take care of the error term which depends on the planar coordinate \(x^\perp \in {{\mathbb {R}}}^2\) and integrate over \(x^\perp \). To this end, we show that the error term is of order one and is integrable as a function of \(x^\perp \) under some assumptions on \(\Lambda \). We believe that the precise description of the error term for the one-dimensional free case in terms of the finite collection of intervals \(\Lambda _{x^\perp }\) is of independent interest and we provide a proof in Appendix C. This dimensional reduction is also the strategy of Widom [37] and of Sobolev in the proof of the Widom conjecture in [32]. In fact, due to the fast (exponential) decay in the planar direction error estimates are simpler to obtain than in the case with no magnetic field. This and the improved Landau–Widom (or Widom) asymptotics allows us to prove for \(\textsf{C}^{1,\alpha }\) (smooth) regions \(\Lambda \) an error term (for polynomials as in Theorem 2.3) of the order \(L^2\) rather than merely of lower order than \(L^2\ln (L)\) in [32, Theorem 2.9].
Secondly, in Sect. 4 we make the transition in the asymptotic expansion from polynomials to the entropy function. This requires certain Schatten–von Neumann quasi-norm bounds presented in Sect. 5, which in turn are based on bounds obtained in previous papers [20, 21] and notably by Sobolev [34].
The smoothness conditions on the region \(\Lambda \) to prove our two-term asymptotic result with error term \(o(L^2\ln (L))\) are rather weak; namely, we require \(\Lambda \) to be only piecewise Lipschitz smooth. For a smooth region \(\Lambda \), one would expect the next lower order term to be of the order \(L^2\). This is indeed true if the boundary \(\partial \Lambda \) is piecewise \({\textsf{C}}^{1,\alpha }\) smooth. We also present regions with weaker regularity on the boundary for which the error term (for a quadratic polynomial) can be arbitrarily close to the leading \(L^2\ln (L)\)-term. This may also be of independent interest and is the content of Sect. 6.
A note on our notation: As L, \(L\ge 1\), is our scaling parameter that tends to infinity, we use the “big-O” and “small-o” notation in the sense that for two functions f and g on \({{\mathbb {R}}}^+\), \(f = O(g)\) if \(\limsup _L f(L)/g(L) <\infty \) and \(f = o(g)\) if \(\limsup _L f(L)/g(L) =0\). By C with or without indices, we denote various positive, finite constants, whose precise values is of no importance, and may even change from line to line.
2 Setup
We consider a nonzero constant magnetic field in \({{\mathbb {R}}}^3\) of strength B which is perpendicular to a plane. We assume without loss of generality that this constant magnetic field points in the positive z-direction with \(B>0\).
We denote the Euclidean norm in \({{\mathbb {R}}}^d\), \(d\in {\mathbb {N}}\), or the norm in the Hilbert-space \(\text{ L}^2({{\mathbb {R}}}^d)\) of complex-valued, square-integrable functions on \({{\mathbb {R}}}^d\) by the same symbol \(\Vert \cdot \Vert \). For \(x \in {{\mathbb {R}}}\), let \(\langle x \rangle :=\sqrt{ 1+ x^2}\) denote the Japanese bracket. For a Borel set \(\Omega \subset {{\mathbb {R}}}^d\) and \(k <d\), let \({\mathcal {H}}^k(\Omega )\) be the k-dimensional Hausdorff measure of \(\Omega \), \(\#\Omega ={\mathcal {H}}^0(\Omega )\) its counting measure, and let \(|\Omega |\) be its d-dimensional Lebesgue measure/volume. By \(\mathbb {1}_{\Omega }\) we denote the multiplication operator on \({\textsf{L}}^2({{\mathbb {R}}}^d)\) by the indicator function \(1_\Omega \) of the set \(\Omega \). As usual, we write for the complement .
For \(r>0\), \(x\in {{\mathbb {R}}}^d\), and a set \(X\subset {{\mathbb {R}}}^d\) we denote by
the open ball of radius r with center x and the (open) r-neighborhood of the set \(X\subset {{\mathbb {R}}}^{d}\) of width r, respectively. In most cases, the dimension, d, is clear from the context and we omit it in the definition; if not, we write \(B^{(d)}_r(x)\). We denote the closed ball of radius r with center x by \(\overline{B}_r^{(d)}(x)\).
For a point \(x\in {{\mathbb {R}}}^3\), we write \(x = (x^\perp ,x^\parallel )\) with (planar coordinate) \(x^\perp \in {{\mathbb {R}}}^2\) and (longitudinal coordinate) \(x^\parallel \in {{\mathbb {R}}}\), and \(\nabla = (\nabla ^\perp ,\nabla ^\parallel )\), where \(\nabla ^\perp \) and \(\nabla ^\parallel \) are the gradients in the respective Cartesian coordinates.
By our assumption, the magnetic field is equal to \(B\cdot e_3\) with \(e_3 :=(0,0,1)\). We use the symmetric gauge \(a:{{\mathbb {R}}}^2\rightarrow {{\mathbb {R}}}^2\) defined as \(a(x^\perp ) :=B/2\,(-x^\perp _2,x^\perp _1)\) so that the curl
The one-particle Hamiltonian of the ideal Fermi gas in three-dimensional Euclidean space \({{\mathbb {R}}}^3\) subject to the magnetic field \(B\cdot e_3\) is informally given by
We use physical units such that Planck’s constant \(\hbar = 1\), the mass is equal to 1/2 and the charge of the particles is equal to one. \(\textrm{H}_B\) is well defined as a self-adjoint operator on a suitable domain in the one-particle Hilbert space \(\textsf{L}^2({{\mathbb {R}}}^3)\).
The ground state of free fermions with one-particle Hamiltonian \(\textrm{H}_B\) is described by the spectral projection (or Fermi projection) \(\textrm{D}_\mu :=\mathbb {1}(\textrm{H}_B\le \mu ) :=1_{(-\infty ,\mu ]}(\textrm{H}_B)\) of \(\textrm{H}_B\) below some so-called Fermi energy (or chemical potential) \(\mu \in {{\mathbb {R}}}\). As is well-known, we have [10, 18]
with explicitly known (infinite-dimensional) eigenprojections \(\textrm{P}_\ell \) on \(\textsf{L}^2({{\mathbb {R}}}^2)\). In order to write down these projections, let us introduce the Laguerre polynomials, \(\mathcal {L}_\ell (t) :=\sum _{j=0}^\ell \frac{(-1)^j}{j!}\, \left( {\begin{array}{c}\ell \\ \ell - j\end{array}}\right) \, t^j\), \(t\ge 0\), of degree \(\ell \in {\mathbb {N}}_0\). Then, the integral kernel of \(\textrm{P}_\ell \) is given by the function
Here, \(\wedge \) refers to the exterior or wedge product on \({{\mathbb {R}}}^2\). The explicit description of this kernel is not relevant for this paper. We only use the exponential decay in \(\Vert x^\perp -y^\perp \Vert ^2\) and \(p_\ell (x^\perp ,x^\perp )=B/(2 \pi )\). In the z-direction, we meet the spectral projection \(\mathbb {1}((-\nabla ^\parallel )^2\le \mu )\) with (sine) integral kernel, \(\mathbb {1}((-\nabla ^\parallel )^2\le \mu )(z,z')=k_\mu (z-z')\),
The following factorization of spectral projections is crucial, which stems from the fact that the magnetic field is pointing in the z-direction. We work with the identification \({\textsf{L}}^2({{\mathbb {R}}}^2) \otimes {\textsf{L}}^2({{\mathbb {R}}})={\textsf{L}}^2({{\mathbb {R}}}^3)\). Since the spectrum of \(\textrm{H}_{B}\) is the set \([B,\infty )\), we may always consider \(\mu > B\) since for smaller values of \(\mu \) the ground state is zero. If \(B<\mu \le 3B\) then \(\textrm{D}_\mu = \textrm{P}_0 \otimes \mathbb {1}[(-\textrm{i}\nabla ^\parallel )^2 \le \mu - B]\). For higher values of \(\mu \), let \(\nu :=\lceil \frac{1}{2}(\mu /B -1)\rceil \in {\mathbb {N}}\) be the smallest integer larger or equal to \(\frac{1}{2}(\mu /B -1)\), and let us set \(\mu (\ell ) :=\mu - B(2\ell +1)\). Then,
with integral kernel (\(x=(x^\perp ,x^\parallel ), y=(y^\perp ,y^\parallel )\))
For any Borel subset \(\Lambda \subset {{\mathbb {R}}}^3\) we define the spatial reduction (or truncation) of \(\textrm{D}_\mu \) to \(\Lambda \) by
Before we define the main object in this paper, we introduce for any \(\gamma >0\) the \(\gamma \)-Rényi entropy function, \(h_\gamma :[0,1]\rightarrow [0,\ln (2)]\),
Now, for a ground state described by the projection \(\textrm{D}_\mu =\mathbb {1}(\textrm{H}_B\le \mu )\) as above, a Borel subset \(\Lambda \subset {{\mathbb {R}}}^3\), and localized ground-state projection, \(\textrm{D}_\mu (\Lambda ) \), we define the \(\gamma \)-Rényi entanglement entropy of the ground state at Fermi energy \(\mu \) localized (in space) to \(\Lambda \) by
Here, \(\textrm{tr}\,\) refers to the (usual Hilbert space) trace on \(\textsf{L}^2({{\mathbb {R}}}^d)\). For bounded \(\Lambda \), \(h_\gamma (\textrm{D}_\mu (\Lambda ))\) is trace-class by the same arguments as in the proof of Lemma 7 in [21]; thus, the entanglement entropy \(\mathrm S_\gamma (\Lambda )\) is trivially a positive number. This entropy is a rather complicated function of \(\Lambda \), but there is a chance to describe it for large regions. To this end, we scale a fixed set \(\Lambda \) by \(L, L \ge 1\), and we determine the leading growth (scaling) of the entropy \(\mathrm S_\gamma (L\Lambda )\) as \(L\rightarrow \infty \).
As there does not seem to be a common definition for regions with piecewise differentiable boundary, we will now provide the one used in this paper.
Definition 2.1
Let \(0< \alpha <1, d\in {\mathbb {N}}\). A region \(\Lambda \subset {{\mathbb {R}}}^{d+1}\) is a finite union of bounded, open, connected sets in \({{\mathbb {R}}}^{d+1}\) such that their closures (denoted by \(\bar{\cdot }\)) are disjoint. The boundary \(\partial \Lambda \) is the set \(\bar{\Lambda }{\setminus }\Lambda \). We assume that the closures \(\overline{\Lambda }\) and are topological manifolds with boundary \(\partial \Lambda \).
We call a bi-LipschitzFootnote 1 map \(\Psi :[0,1]^d \rightarrow \partial \Lambda \) a Lipschitz chart of \(\partial \Lambda \) if \(\Psi ((0,1)^d) \subset \partial \Lambda \) is relatively open. If in addition \(\Psi \in \textsf{C}^1((0,1)^d)\) and its differential \(D\Psi \) satisfies the Hölder condition
for some constant C, we say that \(\Psi \) is a \(\textsf{C}^{1,\alpha }\) chart. A finite set of charts \((\Psi _i)_{i \in I}\) is called a piecewise atlas of \(\partial \Lambda \) if \( \partial \Lambda =\bigcup _{i\in I} \Psi _i([0,1]^d)\), and a global atlas of \(\partial \Lambda \) if \( \partial \Lambda =\bigcup _{i\in I} \Psi _i((0,1)^d)\). We say an atlas is a Lipschitz atlas (resp. \(\textsf{C}^{1,\alpha }\)) if it consists of Lipschitz (resp. \(\textsf{C}^{1,\alpha }\)) charts.
We say that \(\Lambda \) is a piecewise Lipschitz region (resp. global Lipschitz region) if \(\partial \Lambda \) admits a piecewise Lipschitz atlas \((\Psi _{\text {pL},i})_{i \in I}\) (resp. global Lipschitz atlas \((\Psi _{\text {gL},i})_{i \in I}\)). We call \(\Lambda \) a piecewise \(\textsf{C}^{1,\alpha }\) region if it admits both a global Lipschitz atlas \((\Psi _{\text {gL},j})_{j \in J}\) and a piecewise \(\textsf{C}^{1,\alpha }\) atlas \((\Psi _{\text {pC},i})_{i \in I}\).
For a piecewise \(\textsf{C}^{1,\alpha }\) region \(\Lambda \), we fix a piecewise \(\textsf{C}^{1,\alpha }\) atlas \((\Psi _{\text {pC},i})_{i \in I}\) and define the set of all edges, \(\Gamma \) by
Remarks 2.2
-
(i)
Any global Lipschitz region is obviously a piecewise Lipschitz region.
-
(ii)
Our definition of a global Lipschitz region is a bit more general than the usual notion of a strong Lipschitz region (see [2, Pages 66–67]), where every \(v \in \partial \Lambda \) has a neighborhood \(U_v \subset {{\mathbb {R}}}^{d+1}\) such that, after an affine-linear transformation, the set \(\Lambda \cap U_v\) looks like the graph below a Lipschitz function \(\Psi _v :(0,1)^d \rightarrow {{\mathbb {R}}}\). To get to our definition from this, one can choose the graph function \(x \mapsto (x, \Psi _v(x))\) on \((0,1)^d\) as the bi-Lipschitz function needed in our definition. (As a Lipschitz function, it naturally extends to all of \([0,1]^d\).)
-
(iii)
For a piecewise Lipschitz region \(\Lambda \subset {{\mathbb {R}}}^{d+1}\) and for \(v \in \partial \Lambda \), let n(v) be the unit outward normal vector at v. This is only well defined up to null sets with respect to the d-dimensional Hausdorff (surface) measure \({\mathcal {H}}^{d}\) on \(\partial \Lambda \), see Lemma A.6.
-
(iv)
As the set of edges, \(\Gamma \), depends on the piecewise \(\textsf{C}^{1,\alpha }\) atlas \(\Psi _{\textrm{pC},i}\) it may be a different set depending on the atlas.
For a continuous function \(f:[0,1]\rightarrow {\mathbb {C}}\) with \(f(0)=0\) and being Hölder continuous at the two endpoints 0 and 1, we introduce the linear functional
By our assumption, \(|{\textsf{I}}(f)|<\infty \). We note for later use two special cases. Namely, \(\textsf{I}(m):=\mathsf I((\cdot )^m) = -1/(4\pi ^2)\,\sum _{r=1}^{m-1}r^{-1}\); as usual we interpret the sum on the right-hand side as zero if \(m=1\), which coincides with the vanishing of \(\textsf{I}\) on affine linear functions. The second example concerns the \(\gamma \)-Rényi entropy function \(h_\gamma \) defined in (2.10). Here, \(\mathsf I(h_\gamma ) = (1+\gamma )/(24\gamma )\), see [20].
Our first main result is the following theorem, which we prove in the next section.
Theorem 2.3
Let \(f:[0,1]\rightarrow {\mathbb {C}}\) be a polynomial with \(f(0)=0\), let \(\Lambda \subset {{\mathbb {R}}}^3 , \mu>B>0, \nu :=\lceil \frac{1}{2}(\mu /B -1)\rceil \in {\mathbb {N}}\), the smallest integer larger or equal to \(\frac{1}{2}(\mu /B -1)\), and \(\mu (\ell ) :=\mu - (2\ell +1)B\). Let \( \textrm{D}_\mu (L\Lambda ) \) be the operator defined in (2.9).
-
(i)
If \(\Lambda \) is a piecewise Lipschitz region (see Definition 2.1), then we have the asymptotic expansion of the trace on \(\textsf{L}^2({{\mathbb {R}}}^3)\),
$$\begin{aligned} \textrm{tr}\,f(\textrm{D}_\mu ({L\Lambda }))&= L^3 \frac{B}{2\pi ^2} \,\sum _{\ell =0}^{\nu -1} \sqrt{\mu (\ell )} f(1) |\Lambda | \nonumber \\&+ L^2\ln (L) \nu B \,\textsf{I}(f) \,\frac{1}{\pi } \int _{\partial \Lambda } \textrm{d} {\mathcal {H}}^2(v) \,|n(v) \cdot e_3 |+ o(L^2\ln (L)) , \end{aligned}$$(2.16)as \(L\rightarrow \infty \). Here, n(v) is the unit normal outward vector at \(v\in \partial \Lambda \), which is well defined for almost every \(v \in \partial \Lambda \), and \({\mathcal {H}}^2\) is the two-dimensional (surface) Hausdorff measure on \(\partial \Lambda \).
-
(ii)
If \(\Lambda \) is a piecewise \(\textsf{C}^{1,\alpha }\) region (see Definition 2.1), then the error term is \(O(L^2)\) instead of \(o(L^2\ln (L))\).
Remarks 2.4
-
(i)
The condition \(f(0)=0\) is no restriction in the sense that in general the operator on the left-hand side has to be replaced by \(f(\textrm{D}_\mu ({L\Lambda })) - f(0)\textrm{D}_\mu ({L\Lambda })\) and \(\textsf{I}(f)\) on the right-hand side by \(\textsf{I}(\tilde{f})\) with \(\tilde{f}(t):= f(t) - (1-t)f(0)\).
-
(ii)
For the ideal Fermi gas with one-particle Hamiltonian \(\textrm{H}_0 = -\Delta \) on \(\textsf{L}^2({{\mathbb {R}}}^3)\), Fermi energy \(\mu >0\), ground state Fermi projection \(\textrm{D}_\mu = \mathbb {1}(-\Delta \le \mu )\) and Fermi sea \(\Gamma :=\{p\in {{\mathbb {R}}}^3 : p^2\le \mu \}\) it was proved in [20] that
$$\begin{aligned} \textrm{tr}\,f(\textrm{D}_\mu (L\Lambda )) = L^3 f(1) |\Gamma /(2\pi ) ||\Lambda |+ L^2\ln (L) \,\frac{\mu }{2\pi }\,\textsf{I}(f) \, {\mathcal {H}}^2(\partial \Lambda ) + o(L^2\ln (L))\nonumber \\ \end{aligned}$$(2.17)as \(L\rightarrow \infty \). To this end, note that \(|\Gamma | = \frac{4\pi }{3} \mu ^{3/2}\) and that our functional \(\textsf{I}\) here is the same as the functional I in [20]. The double-surface integral \(J(\partial \Gamma ,\partial \Lambda )\) [20, (2)] equals \(\frac{\mu }{2\pi }\mathcal H^{2}(\partial \Lambda )\). Letting B tend to zero in (2.16) but keeping the Fermi energy \(\mu \) fixed, the prefactor \(\nu B\) tends to \(\mu /2\). The remaining integral over \(\partial \Lambda \) is independent of the strength B and remains fixed. For the volume term, we have in this limit
$$\begin{aligned}&\frac{B}{2\pi ^2}\sum _{\ell =0}^{\nu -1} \sqrt{\mu - (2\ell +1)B} \sim \frac{\mu ^{3/2}}{4\pi ^2 \nu } \\&\quad \sum _{\ell =0}^\nu \sqrt{1-\ell /\nu } \sim \frac{\mu ^{3/2}}{4\pi ^2} \int _0^1\text{ d }x\,\sqrt{x} = \frac{\mu ^{3/2}}{6\pi ^2} . \end{aligned}$$In this limit the volume term equals the above volume term at \(B=0\) as in (2.17). To summarize, we obtain
$$\begin{aligned} \lim _{B\downarrow 0} \text{ rhs } \text{ of } (2.16)&= L^3 f(1) \frac{\mu ^{3/2}}{6\pi ^2}|\Lambda | + L^2\ln (L) \frac{\mu }{2\pi } \,\mathsf I(f) \int _{\partial \Lambda } \textrm{d}{\mathcal {H}}^2(v) \,|n(v)\cdot e_3| \\&\quad + o(L^2\ln (L)) , \end{aligned}$$which is identical to the right-hand side (rhs) of (2.17) except for the prefactor depending on \(\partial \Lambda \).
-
(iii)
There is no ’level mixing’ at the order in \(L^2\ln (L)\) in the sense that each Landau level enters individually in the numerical coefficient. In [21], we proved that level mixing occurs in the two-dimensional setting at the next-to-leading order, namely at the order L. We expect level mixing to occur in the present case at the order \(L^2\). This is certainly possible to prove, say for a cylindrical region, but it requires a three-term expansion in the \(x^\parallel \)-coordinate and the by now proved two-term expansion in the \(x^\perp \)-coordinate [21]. The caveat for us to proceed with this question is that the mentioned three-term expansion has not been proved so far for the entropy function. This is an interesting open problem.
-
(iv)
For (2.16) to hold we require only weak regularity of the boundary \(\partial \Lambda \) like in the proof in [20] for the ideal Fermi gas. In contrast, the proof of the corresponding two-term asymptotics for the two-dimensional model in [21] required \(\textsf{C}^3\) smooth regions. This smoothness was a technical condition and may not be necessary. On the other hand and more importantly, only the leading contribution of the two-dimensional Hamiltonian enters and the extra logarithm stems from an expansion in the longitudinal direction, where weaker conditions suffice.
3 Proof of Theorem 2.3
We split the proof into two steps. The first one is the lemma below, which reduces the computation of the trace to an integral of the trace of the projection operator \(\mathbb {1}[(-\textrm{i}\nabla ^\parallel )^2 \le \mu ]\) localized to the sets \(L\Lambda _{x^\perp }\subset {{\mathbb {R}}}\) with respect to \(x^\perp \in {{\mathbb {R}}}^2\). The second step starts from there, proves an asymptotic expansion of this trace, and finishes the proof of Theorem 2.3.
Definition 3.1
For any Borel set \(E\subset {{\mathbb {R}}}^3\) and any \(x^\perp \in {{\mathbb {R}}}^2\) we define \(E_{x^\perp } :=\{x^\parallel \in {{\mathbb {R}}}: (x^\perp ,x^\parallel )\in E\}\) to collect the third components of the intersection \(E\cap (\{x^\perp \}\times {{\mathbb {R}}})\).
Lemma 3.2
Let \(m \in {\mathbb {N}}\) with \(m \ge 2\). Then, under the same conditions as in Theorem 2.3(i), there is a constant C depending only on B, m and \(\mu \) such that
where the \(\Lambda \) dependent constant \({\mathcal {K}}(\Lambda )\) is defined in Lemma A.3; it is positive and finite for any piecewise Lipschitz region \(\Lambda \). Note that \(L \Lambda _{x^\perp } :=L \left( \Lambda _{x^\perp } \right) \) is (in general) different from \((L \Lambda )_{x ^\perp }\).
Proof
We utilize the same changes of coordinates in the first two components (that is, for the planar \(x_0^\perp \)-coordinates) as in [21]. For the convenience of the reader we repeat all steps.
As \(\Lambda \) is bounded, the operator \(\mathbb {1}_{L\Lambda } \textrm{D}_\mu \) is Hilbert–Schmidt and therefore \(\textrm{D}_\mu (L \Lambda )\) is trace-class. We may write
with integral kernel
as in (2.8). Therefore, the trace is of the form
We begin by approximating \(1_{L\Lambda }(x_j)\) by \(1_{L\Lambda }(x_0^\perp ,x_j^\parallel )\). We call the resulting approximate term \(T(L\Lambda )\). This means
As the second line is independent of \(x_j^\perp \), the integrals over \(x_1^\perp , \ldots , x_{m-1}^\perp \) can be easily resolved and yield the diagonal of the integral kernel of the operator \(\textrm{P}_{\ell _1} \cdots \textrm{P}_{\ell _{m}}\) at \(x_0^\perp \), which is \(B/(2\pi )\), if \(\ell _1= \dots =\ell _m\) and 0 otherwise. Thus, we have
Now, we set \(x^\perp :=x_0^\perp /L \) and observe \(1_{L \Lambda }(x^\perp _0, x_j^\parallel )=1_{L \Lambda _{x ^\perp }}(x_j^\parallel )\). Therefore, we have
which is the expression in the claim. Thus, we are left to bound the error term of our approximation. Let us denote by \(U \subset {{\mathbb {R}}}^{3m}\) the set of all tuples \((x_0,x_1, \dots ,x_{m-1})\) where \(1_{L\Lambda } (x_0) 1_{L \Lambda } (x_1) \cdots 1_{L \Lambda } (x_{m-1}) \) is not equal to \(1_{L\Lambda } (x_0) 1_{L \Lambda } (x_0^\perp ,x_1^\parallel ) \cdots 1_{L \Lambda } (x_0^\perp ,x_{m-1}^\parallel )\). Then, using the notation \(x_m :=x_ 0\) we trivially have
We will now enlarge U until we get a set where the integral can easily be calculated. Let \((x_0,x_1,\dots , x_{m-1}) \in U\). Then, there is a \(j \in \{ 1, \dots , m-1\}\) such that \(1_{L \Lambda } (x_j) \ne 1_{L \Lambda } (x_0^\perp , x_j^\parallel )\). Thus, the line between \(x_j\) and \((x_0^\perp ,x_j^\parallel )\) has to intersect the boundary \(L \partial \Lambda \), which implies \({\text {dist}}(x_j, L \partial \Lambda ) \le \Vert x_j^\perp - x_0^\perp \Vert \). By the triangle and mean inequalities, we observe that
For \(j \in \{0, \dots , m-1\}\), let \(U_j \subset {{\mathbb {R}}}^{3m}\) be the set of all \((x_0,x_1, \dots , x_{m-1})\in {{\mathbb {R}}}^{3m}\) satisfying
As \(U \subset \bigcup _{j=1}^{m-1} U_j\), we see that
The cyclic parameter shift \((x_0,x_1, \dots , x_{m-1}) \mapsto (x_1, x_2, \dots , x_0)\) sends \(U_j\) to \(U_{j+1}\) and does not change the integrand. For \(1 \le j \le m\), let \(y_j :=x_j- x_{j-1}\). We will change variables from \((x_0,x_1, \dots , x_{m-1})\) to \((x_0,y_1, \dots , y_{m-1}) =:(x_0, {\textbf{y}})\). Using \(y_m^\perp = - \sum _{j=1}^{m-1} y_j^\perp \), similar to (3.5), we observe that
Thus, under this change of variables the set \(U_0\) is mapped into the set
Let us first estimate the integrand in terms of the \(y_j\)’s. With (2.8), (2.5) and (2.6), we get
We recall that \(\langle x \rangle =\sqrt{1+ x^2}\) is the Japanese bracket.
For \(x_0 \in {{\mathbb {R}}}^3\), let \(\Omega _{x_0} :=\{ {\textbf{y}} ^\perp \in {{\mathbb {R}}}^{2(m-1)} :{\text {dist}}(x_0, L \partial \Lambda ) \le m\Vert {\textbf{y}}^\perp \Vert \}\), and thus \(V=\{(x_0,{\textbf{y}}) \in {{\mathbb {R}}}^{3m} :{\textbf{y}}^\perp \in \Omega _{x_0}\}\). We have
We need the estimate
which is proved in Appendix B. We also have the bound
Thus, we arrive at
Here, we used an \((1,\infty )\) Hölder estimate on the sets \(k \le {\text {dist}} (x_0,L\partial \Lambda )\le k+1\) for the integral over \({{\mathbb {R}}}^3\). We then enlarged these sets to the \((k+1)\)-neighborhood \(B_{k+1}(L \partial \Lambda )\), as their measures can be estimated more easily. Thus, using Lemma A.3 with \(d=2\) and \(r=k+1\), we arrive at
which was our claim. \(\square \)
In the next step, we accomplish the
Proof of Theorem 2.3
As the expression is linear in f, it suffices to consider monomials \(f(t) = t^m\) with integer \(m\ge 1\). In the special case \(m=1\), we just use (2.8), (2.5), and (2.6) to see
As \({\textsf{I}}(1)={\textsf{I}}(id)=0\), this covers the case \(m=1\) and we may from now on assume \(m\ge 2\).
Our first aim is to understand the open sets \(\Lambda _{x^\perp }\). This is essentially a question about the nature of the sets \(\Lambda \). There are some results to choose from, so let us take a look. Due to Lemmas A.6 and A.8, for Lebesgue almost every \(x^\perp \in {{\mathbb {R}}}^2\), the set \(\Lambda _{x^\perp }\) is a finite union of disjoint intervals, \(\partial \left( \Lambda _{x^\perp } \right) =(\partial \Lambda )_{x^\perp }\), and \(\#(\partial (\Lambda _{x^\perp }))\) is twice the number of these intervals. Henceforth, we set \(\partial \Lambda _{x^\perp } :=\partial \left( \Lambda _{x^\perp }\right) \). The (improved) asymptotic expansion goes back to Landau and Widom [19] and is presented in Appendix C, see Corollary C.3. The coefficient \(\textsf{I}(m) = -1/(4\pi ^2)\,\sum _{r=1}^{m-1}r^{-1}\) is mentioned below (2.15).
For fixed \(\Lambda _{x^\perp }\), the error term \(\varepsilon ( \Lambda _{x^\perp }, L) \) remains bounded as \(L \rightarrow \infty \). However, we need to know, whether this error term is integrable over \(x^\perp \). Thus, the dependency on \(\Lambda _{x^\perp }\) is relevant.
To derive the \(o(L^2 \ln (L))\) error term, we subtract the volume term, divide by \(L^2 \ln (L)\) and use dominated convergence in order to exchange the limit \(L \rightarrow \infty \) with the integral over \(x^\perp \). Thus, instead of an estimate for the error term that is of a lower order in L than \(\ln (L)\), we only need an upper bound for the difference to the volume term, which is of order \(\ln (L)\). This upper bound is provided by Lemma 6.1. As any interval in \(L \Lambda _{x^\perp }\) has length at most CL, we arrive at
where the constant C depends on m, \(\mu (\ell )\) and \(\Lambda \), but not on \(x^\perp \). With this estimate, we apply dominated convergence to get
We moved the sum over \(\ell \) to the front, as every summand converges as \(L \rightarrow \infty \). In the second line we used that \(\int _{{{\mathbb {R}}}^2} \text{ d } x^\perp \,|\Lambda _{x^\perp }| = |\Lambda |\). Finally, we inserted (A.44) to obtain the expansion with error term \(o(L^2\ln (L))\) as claimed in the theorem.
For the second part, we need to show that the error term for polynomials can be bounded by \(CL^2\), if \(\Lambda \) is a piecewise \(\textsf{C}^{1,\alpha }\) region for some \(0< \alpha <1\), as defined in Definition 2.1. This time, we use Corollary C.3 to deal with the trace of the one-dimensional operator. For that, we arrange each \(\partial \Lambda _{x^\perp }:=(\partial \Lambda )_{x^\perp }=\{w_{x^\perp 1}, \dots , w_{x^\perp \#( \partial \Lambda _{x^\perp })}\}\subset {{\mathbb {R}}}\) in the order of increasing third components and write
In the last step, we used that the distance between any two points in \(\partial \Lambda \) is bounded from above, as \(\Lambda \) is bounded to conclude that only short distances \(|v- v_i|\) can lead to an error term larger than the \(O(\#( \partial \Lambda _{x^\perp }))\)-term we have in front. A lower bound for the infimum is provided by Lemma A.1. This bound is zero in some cases, which leads to the logarithm being infinite. This just means that our integrand in the integral over \(x^\perp \) attains infinity. The integral can still exist and we will show that it does.
As the terms of order \(L^3\) and \(L^2\ln (L)\) work just like in the previous case, we will only consider the error term. Hence, we need to estimate
In the first step, we applied Lemma A.1 with the vectors \(v_1 :=(x^\perp ,w_{x^\perp i})\) amd \(v_2 :=(x^\perp , v)\) noting that \(\frac{v_1-v_2}{\Vert v_1 -v_2\Vert } = \pm e_3\). We now want to rewrite this integral as an integral over the boundary \(\partial \Lambda \). This is possible by Lemma A.7. Hence, we have (recall that \({\mathcal {H}}^2\) is the canonical surface measure on \(\partial \Lambda \)),
In the second step, we used that \(0 \le |n(w) \cdot e_3 |\le 1\) and that for \(0 \le t \le 1\), we have \( 0\le |t \ln (t) |\le 1/\textrm{e}\). The last step is a rather lengthy, not particularly insightful calculation, which can be found in Lemma A.9.
Once we put the factor \(L^2\) back in front of this, we arrive at the error term \(O(L^2)\) which completes the proof of the second part of this theorem. \(\square \)
4 Entanglement Entropy
Here is the main result of this paper.
Theorem 4.1
Suppose that \(\Lambda \subset {{\mathbb {R}}}^3\) is a piecewise Lipschitz region and let \(\mu > B\). Let \(\nu :=\lceil \frac{1}{2}(\mu /B -1)\rceil \) and let \(h:[0,1] \rightarrow {{\mathbb {R}}}\) be a continuous function, which is \(\beta \)-Hölder continuous at 0 and 1 for some \(1 \ge \beta >0\), and assume that \(h(0)=h(1)=0\). Then, we have the asymptotic expansion
In particular, as the \(\gamma \)-Rényi entropy function \(h_\gamma \) is \(\beta \)-Hölder continuous for any \(\beta < \min (\gamma ,1)\), the \(\gamma \)-Rényi entanglement entropy, \(\mathrm S_\gamma (L\Lambda )\), of the ground state at Fermi energy \(\mu \) localized to \(L\Lambda \), satisfies the asymptotic expansion
as \(L\rightarrow \infty \).
Remarks 4.2
-
(1)
Unlike in Theorem 2.3, we cannot improve the \(o(L^2\ln (L))\) error term in (4.2) if we assume stronger regularity conditions on the boundary of \(\Lambda \). This is a limitation of our method of proof, which relies on the Stone–Weierstrass approximation. Here, we lose control of the error term.
-
(2)
Let us compare (4.2) to the asymptotic expansion of the entanglement entropy of the ground state at Fermi energy \(\mu >0\) in the ideal Fermi gas, as introduced in Remark 2.4(ii). Here, the \(\gamma \)-Rényi entanglement entropy satisfies
$$\begin{aligned} \mathrm S_\gamma (L\Lambda ) = L^2\ln (L) \frac{\mu (1+\gamma )}{48\gamma \pi }\mathcal H^{2}(\partial \Lambda ) + o(L^2\ln (L)) . \end{aligned}$$ -
(3)
To the best of our knowledge, the result (4.2) is new, even in the physics literature. The factor \(\nu B\) satisfies \(\nu B = \mu /2 +(\delta -\frac{1}{2})B\) for some \(\delta \in [0,1)\). We can bound the surface integral in (4.2) by \({\mathcal {H}}^{2}(\partial \Lambda )\) due to \(|n(v)\cdot e_3|\le 1\). Let us set \(\mu _{\text {ref}} :=2\nu B \). This corresponds to the same number \(\nu \) of Landau levels as the original \(\mu \), and seems to be a suitable reference value for comparing the Landau Hamiltonian with the free Hamiltonian. Thus the entanglement entropy associated to the Landau Hamiltonian is always smaller than the one associated to the free Hamiltonian at the reference value \(\mu _{\text {ref}}\).
We use certain estimates on traces. To this end, let us denote by \(s_n({T}), n\in {\mathbb {N}}\), the singular values of the compact operator T on a (separable) Hilbert space, arranged in decreasing order. The standard notation \({\mathfrak {S}}_p, 0<p<\infty \) is used for the class of operators with a finite Schatten–von Neumann quasi-norm:
If \(p\ge 1\), then \(\Vert \cdot \Vert _p\) defines a norm. For \(0< p < 1\) it is a quasi-norm that satisfies the p-triangle inequality
The class \({\mathfrak {S}}_1\) is the standard trace-class. The class \({\mathfrak {S}}_2\) is the ideal of Hilbert–Schmidt operators. The p-Schatten quasi-norm estimate required for this proof is shown in Theorem 5.5.
Proof of Theorem 4.1
The proof goes along the same line of arguments as presented in [20, 21]. We recall that \(\textsf{I}(h_\gamma ) = (1+\gamma )/(24\gamma )\) and thus we are left to show the claim for the function h. Let \(r = \beta /2\) and \(\varepsilon >0\). We choose a smooth cutoff function \(\zeta _\varepsilon \) such that \(0\le \zeta _\varepsilon \le 1\) and such that \(\zeta _\varepsilon \) vanishes on \([\varepsilon ,1-\varepsilon ]\) and equals 1 on \([0,\varepsilon /2] \cup [1-\varepsilon /2,1]\). As h is continuous and \(\beta \)-Hölder continuous at 0 and 1, there is a constant C such that
This implies
As the function \(t \mapsto \frac{(1-\zeta _\varepsilon (t) )h(t) }{t(1-t)}\) is continuous, we can infer from the Stone–Weierstrass approximation theorem that there is a polynomial p and a function \(\delta _\varepsilon :[0,1] \rightarrow {{\mathbb {R}}}\) with \(\Vert \delta _\varepsilon \Vert _{{\textsf{L}}^\infty ([0,1])} \le \varepsilon ^r \) and
Thus, we have
As \(t(1-t) \le t^r (1-t)^r\), we observe
Thus, using Theorem 5.5, (2.7) and (4.3), we arrive at
In (4.10), we used that \(\mathrm D_\mu \) is a projection. Let \(q(t) :=p(t)t(1-t)\). Now, by linearity of \(\textsf{I}\) and the estimate (4.8), we have
Theorem 2.3(i) applied for the polynomial q with \(q(0)=q(1)=0\) yields
Now, combining (4.13), (4.14) and (4.15), we arrive at
As \(\varepsilon >0\) is arbitrary, we have proved the claim. \(\square \)
5 Schatten–von Neumann Quasi-Norm Estimates
By a box in \({\mathbb {R}}^d\), we mean a Cartesian product of d intervals. These intervals do not have to be bounded. We will denote subsets of \({\mathbb {R}}\) by I, of \({\mathbb {R}}^2\) by \(\Upsilon \), and of \({\mathbb {R}}^3\) by \(\Lambda \). We will combine known estimates for the two-dimensional magnetic Hamiltonian from [21] and for the one-dimensional Hamiltonian [20, 34] without a magnetic field.
Let \(\Upsilon ,\Upsilon '\subset {\mathbb {R}}^2\) be Lipschitz regions and let \(I,I' \subset {\mathbb {R}}\) be finite unions of closed intervals. Then, we have
As the singular values of the tensor product of two operators are given by all possible products of pairs of the individual singular values, we have for any \(0<p \le \infty :\)
The following general properties will be useful:
Lemma 5.1
For any self-adjoint bounded operators \(S,T:\textsf{L}^2({\mathbb {R}}^d) \rightarrow \textsf{L}^2({\mathbb {R}}^d)\), any measurable sets \(\Omega _1,\Omega _2\), \(\Omega _1',\Omega _2' \subset {\mathbb {R}}^d\) and any \(0<p \le 1\), we have
-
Symmetry
\( \left\Vert \mathbb {1}_{\Omega _1} T \mathbb {1}_{\Omega _2} \right\Vert _p = \left\Vert \mathbb {1}_{\Omega _2} T \mathbb {1}_{\Omega _1} \right\Vert _p\),
-
Monotonicity I
\( \left\Vert \mathbb {1}_{\Omega _1} T \mathbb {1}_{\Omega _2} \right\Vert _p \le \left\Vert \mathbb {1}_{\Omega _1 \cup \Omega _1'} T \mathbb {1}_{\Omega _2\cup \Omega _2'} \right\Vert _p\),
-
Monotonicity II
If \(0 \le S \le T\), then \(\Vert S \Vert _p \le \Vert T \Vert _p\),
-
Subadditivity
\(\left\Vert \mathbb {1}_{\Omega _1 \cup \Omega _1'} T \mathbb {1}_{\Omega _2} \right\Vert _p^p \le \left\Vert \mathbb {1}_{\Omega _1} T \mathbb {1}_{\Omega _2} \right\Vert _p^p + \left\Vert \mathbb {1}_{\Omega _1'} T \mathbb {1}_{\Omega _2} \right\Vert _p^p\).
A proof of these properties can be found, for example, in [29].
We assume now that the magnetic-field strength has been “scaled out” so that \(B=1\) for the remainder of this section. The effective scale in the planar coordinates is \(L\sqrt{B}\) and in the perpendicular it is \(L\sqrt{\mu }\).
Next, we collect some more specific (quasi-)norm estimates for both the one dimensional free Hamiltonian and the constant magnetic field Hamiltonian in two dimensions.
Proposition 5.2
Let \(0< p \le 1, \ell \in {\mathbb {N}}_0\) and let \(\mu > 0\). Then, there is a constant C such that for any \(x \in {\mathbb {R}}^2, t \in {\mathbb {R}}, h\ge 2, \delta \ge 1 \), any measurable set \(\Upsilon \subset {{\mathbb {R}}}^2 \) such that \([-\delta ,1+\delta ]^2+x \subset \Upsilon \) and any measurable set \(I\subset {{\mathbb {R}}}\) such that \([t,t+h] \subset I\), we have the estimates
Proof
The first two inequalities follow by [21, Lemma 12], monotonicity I in Lemma 5.1, and the unitary translation invariance of \(\textrm{P}_\ell \). The 8 in the denominator was increased to 18 in (5.4) as we switched from circles to squaresFootnote 2. To prove the last inequality, we first use monotonicity I and the translation invariance, then the standard unitary equivalence, see, for example, [19, (7–10)], and finally [34, Corollary 4.7]. Thus,
For the third inequality, we will reduce to the case \(h=2, t=0\) by subadditivity, monotonicity I and translation invariance. Let \(m :=\lceil h/2 \rceil \in {\mathbb {N}}\) be the smallest integer larger or equal to h/2. Thus, as \(h\ge 2\), we have \(m \le h\). We observe that
Using subadditivity once more we now estimate
The second summand was bounded by (5.6), and the last quasi-norm identity is derived by the singular value identity \(s_n(A)^2=s_n(A^*A)\). Define \(Q :=\mathbb {1}_{[0,2]} \mathbb {1}[(-\textrm{i}\nabla ^\parallel )^2 \le \mu ]\). Our claim is \(Q \in {\mathfrak {S}}_p\) for all \(0<p \le 1\). The last estimate shows that \(Q \in {\mathfrak {S}}_p\), if \(Q \in {\mathfrak {S}}_{2p}\) for \(p \le 1\). We now observe
Thus, we have \(Q \in {\mathfrak {S}}_2\) and hence \(Q \in \mathfrak S_{2^{1-n}}\) for any \(n \in {\mathbb {N}}\). Lastly, as \({\mathfrak {S}}_p \subset {\mathfrak {S}}_q\) for \(p<q\), we arrive at \(Q \in {\mathfrak {S}}_p\) for any \(0<p \le \infty \), which finishes the proof. \(\square \)
After all these preparations we finally state the crucial local estimates that are needed in the proof of Theorem 5.5.
Lemma 5.3
Let \(0< p \le 1\). Then, there is a constant C such that for any \(x \in {\mathbb {R}}^2, t \in {{\mathbb {R}}}, h\ge 2\), \(\delta \ge 1\), any measurable \(\Upsilon \subset {{\mathbb {R}}}^2\) such that \([-\delta ,1+\delta ]^2+x \subset \Upsilon \) and any interval \(I\subset {{\mathbb {R}}}\) such that \([t,t+h] \subset I \), we have the estimates
Proof
For the first inequality, we use (5.2) and Proposition 5.2. For the second inequality, we first observe
and then we use the p-triangle inequality, (5.2) and Proposition 5.2. \(\square \)
We now fix a region \(\Lambda \subset {\mathbb {R}}^3\) and define the signed distance function
where \({\text {dist}}\) is the Euclidean distance. The signed distance function is Lipschitz-continuous with Lipschitz constant 1.
In order to utilize Lemma 5.3, we need to essentially cover \(L \Lambda \) with a lot of very long boxes (of dimensions \(1 \times 1 \times O(L)\)). This boils down to choosing appropriate intervals that cover most of \(\Lambda _x\) (as defined in Definition 3.1), for any \(x \in {{\mathbb {R}}}^2\). Let \(G(x,\varepsilon )\) be the number of these intervals. The following lemma explicitly constructs such intervals and lists the properties that \(G(x,\varepsilon )\) and the intervals satisfy, which we need for our estimates. The basic idea is to collect connected components of \(\Lambda _x\), which go sufficiently deep inside \(\Lambda \).
Lemma 5.4
For any \(x \in {{\mathbb {R}}}^2\) and \(\varepsilon >0\), there is a finite (possibly empty) set of intervals \(A(x,\varepsilon )=\{ I_{1,x,\varepsilon }, \dots , I_{G(x, \varepsilon )),x,\varepsilon } \}\), satisfying the following conditions:
-
(1)
We have \({\text {d}}_\Lambda (I_{k,x,\varepsilon }) \subset (-\infty , -\varepsilon )\) and \({\text {dist}}(I_{k,x,\varepsilon }, \partial \Lambda )= \varepsilon \).
-
(2)
For any \(\lambda \in \Lambda _x\), there exists a j with \(1 \le j \le G(x,\varepsilon ) :\lambda \in I_{j,x,\varepsilon }\), or \({\text {d}}_\Lambda ((x,\lambda ))>-2\varepsilon \).
-
(3)
We have \(G(x, \varepsilon )= \# A(x,\varepsilon ) \le {\mathcal {H}}^1\left( {\text {d}}_\Lambda ^{-1}((-2 \varepsilon , -\varepsilon )) \cap (\{x \} \times {{\mathbb {R}}})\right) /\varepsilon \).
The signed distance function \({\text {d}}_\Lambda \), dependent on the piecewise Lipschitz region \(\Lambda \), is defined in (5.21).
We regard the lemma and its proof as the definitions of \(A(x,\varepsilon ),I_{j,x,\varepsilon }\) and \(G(x,\varepsilon )\).
Proof
We consider the set \(A_0(x,\varepsilon )\) of all connected components of \(\left( {\text {d}}_\Lambda ^{-1}((-\infty ,-\varepsilon )) \right) _x \subset {{\mathbb {R}}}\) (with the convention that the empty set has no connected components). The set \(A(x,\varepsilon )\) is defined as the set of all \(I \in A_0(x,\varepsilon )\), such that there is a \(\lambda \in I\) with \({\text {d}}_\Lambda ((x,\lambda )) \le -2 \varepsilon \). The first point is already satisfied for all \( I \in A_0(x,\varepsilon )\) and thus holds for all I in the smaller set \(A(x,\varepsilon )\). For the second claim, we observe that if \(\lambda \in \Lambda _x\) with \({\text {d}}_\Lambda ((x,\lambda )) \le -2 \varepsilon \), then \(\lambda \in \left( {\text {d}}_\Lambda ^{-1}((-\infty ,-\varepsilon )) \right) _x \) and thus there is an \(I \in A_0(x,\varepsilon )\) with \(\lambda \in I\). By definition of \(A(x,\varepsilon )\), this ensures \(I \in A(x,\varepsilon )\).
For the third claim, if \(A(x, \varepsilon ) \ne \emptyset \), let \(I =(\lambda _1, \lambda _4) \in A(x, \varepsilon )\) and define \(\lambda _2 :=\inf \{ \lambda \in I :{\text {d}}_\Lambda ( (x, \lambda ) \le -2 \varepsilon \}, \lambda _3 :=\sup \{ \lambda \in I :{\text {d}}_\Lambda ( (x, \lambda ) \le -2 \varepsilon \}\). Thus, \(\lambda _1< \lambda _2<\lambda _3<\lambda _4\),
and, as \({\text {d}}_\Lambda \) has Lipschitz constant 1, this means that
As \((\lambda _1, \lambda _2) \subset I\), \((\lambda _3, \lambda _4) \subset I\) and different elements of \(A(x,\varepsilon )\) are disjoint (as they are connected components), we can sum the inequality over all elements of \(A(x, \varepsilon )\) and arrive at
which implies the last claim (with an additional factor 1/2). \(\square \)
Theorem 5.5
Let \(\Lambda \) be a piecewise Lipschitz region (see Definition 2.1) and let \(0<p \le 1, \ell \in {\mathbb {N}}, \mu \in {{\mathbb {R}}}^+\). Then, there are constants \(L_0=L_0(\Lambda ,p,\ell , \mu ) > 3\) and \(C=C(\Lambda ,p,\ell , \mu )\) such that for all \(L>L_0\)
Proof
We want to cover most of \(L\Lambda \) with translates of cubes \([0,1]^2\times [0,h]\), where h grows like L and will use Lemma 5.3 on these.Footnote 3 We set \(\delta :=6 p^{-1/2} \sqrt{ \ln (L)}\). Let \(L_0\) be large enough to ensure that \(\delta \ge 1 \) for \(L>L_0\). Hence, these cubes need to keep a distance of at least \(\delta \) from the boundary. Set \(\varepsilon :=\frac{ 2(\delta +1)}{L}\). We also define the shorthand
Let \(h_0\) be the length of the longest straight line contained in \(\Lambda \).
Consider any \(x \in {\mathbb {R}}^2\) with \(G(x, \varepsilon )\ge 1\), as defined in Lemma 5.4. For \(k \in \{1, \dots , G(x,\varepsilon )\}\), we define the boxes
These inclusions hold because \(\sqrt{2}< \sqrt{2} (\delta +1) <L \varepsilon =L {\text {dist}} ( \{ x \} \times I_{k,x,\varepsilon } , \partial \Lambda ) = {\text {dist}} ( \{ L x \} \times (L I_{k,x,\varepsilon }) , L \partial \Lambda )\).
We assume \(L>h_0\), \(Lh_0>1\) and \(L>2\). Now we have by monotonicity I in Lemma 5.1 and by Lemma 5.3
The constant C depends only on p and \(h_0\).
Now we consider some offset parameter \(s \in [0,1)^2\), and we define
The inclusion is based on the fact that for each \(y \in {{\mathbb {R}}}^3\), there is a \(\frac{z+s}{L} \in {\mathbb {R}}^2\) with \(z\in {\mathbb {Z}}^2\) such that \(y \in \left( \frac{z+s}{L} + \frac{1}{L}[0,1]^2 \right) \times {\mathbb {R}}\). If \({\text {d}}_\Lambda (y) \le -3 \varepsilon \), then the point \(\left( \frac{z+s}{L} , y_3 \right) \) is at most \(\frac{\sqrt{2}}{L} < \varepsilon \) away from y and hence at least \(2\varepsilon \) away from the boundary. Therefore, there is a k such that \(y \in \frac{1}{L}Q_{\frac{z+s}{L},k}\).
We further define
so that \(L{\text {d}}_\Lambda ^{-1}\left( \left( - 3\varepsilon ,0 \right) \right) \subset \bigcup _{{u} \in Z_\varepsilon } \left( {u}+ [0,1]^3\right) \). As \(\varepsilon >\frac{\sqrt{3}}{L}\), the length of the diagonal in a cube \(\frac{1}{L} [0,1]^3\), we have (second inclusion)
Hence, the volume of the middle term, which is the cardinality, \(\# Z_\varepsilon \), of \(Z_\varepsilon \), can be bounded by the volume of the right-hand side. For \(\varepsilon <1\), using Lemma A.3, this is bounded by \(L^3C(\Lambda ) \varepsilon \). Hence for \(L > L_0\):
Using the monotonicity I and subadditivity properties in Lemma 5.1 and the covering (5.32), we can finally estimate,
The summands in the first sum can be bounded by \(C\ln (L)\) using (5.31). The second term will be bounded using monotonicity I, (5.32) and (5.34) in the first step and using monotonicity II, subadditivity, (5.18) and (5.35) in the second step. Hence, we have
For any fixed \(L>L_0\) and \(s \in [0,1)^2\), this is finite. Hence, we can integrate this over \(s \in [0,1)^2\) and get a different upper bound. As the volume of \([0,1)^2\) is 1, the left-hand side and the last term do not change, as it is an integral over a constant in both cases.
Now we can use Fubini on the product \({\mathbb {Z}}^2 \times [0,1)^2= {\mathbb {R}}^2\). Hence we have
In the first step, we did a change of variables, in the third step we used Lemma 5.4, in the last but one step Fubini and in in the final step we applied (A.9). \(\square \)
6 The Error Term can be Large and not Smaller than \(o(L^2\ln (L))\)
Without loss of generality, we assume throughout this section that \(\nu =1\) and \(B=1\) because the precise values are not relevant now. The non-asymptotic bound in the following lemma is simple and useful in the proof of the main theorem in this section.
Lemma 6.1
Let \(\Omega \subset {\mathbb {R}}\) be a finite union of intervals of finite lengths \(\ell _1, \dots , \ell _n\) with disjoint closures. Let \(m \in {\mathbb {N}}\) with \(m \ge 2\), \(\mu >0\), and \(\Delta = \textrm{d}^2/\textrm{d}^2 x\) the one-dimensional Laplacian. Then, we have the estimate
where C is an entirely independent constant.
For \(m=2\), this estimate is sharp in the sense that the prefactor \(1/\pi ^2\) equals the coefficient of the leading asymptotic behavior of \(\textrm{tr}\,(\mathbb {1}_{L\Omega } \mathbb {1}(- \Delta \le \mu )\mathbb {1}_{L\Omega })^2\) for large L.
Proof
By scaling we can assume \(\mu =1\) since \(\mathbb {1}_{\Omega } \mathbb {1}(- \Delta \le \mu )\) is unitarily equivalent to \(\mathbb {1}_{\sqrt{\mu }\Omega } \mathbb {1}(- \Delta \le 1)\). In other words, we may set \(\mu =1\) and eventually replace the lengths \(\ell _j\) by \(\sqrt{\mu }\ell _j\).
Then, we use the geometric series \(a^m-a=a(a-1) (a^{m-2} + \cdots + a+1)\) with \(a :=\mathbb {1}_\Omega \mathbb {1}(- \Delta \le 1)\mathbb {1}_\Omega \). As a has on operator norm of at most 1, we can estimate
with the function \(k=k_1\) defined in (2.6).
For a fixed \(x \in \Omega \), we now enlarge the domain of integration in y by allowing \(y \in \Omega \), as long as x and y are in different intervals in \(\Omega \). In a formula, with \(\pi _0(\Omega )\) denoting the connected components (subintervals) of \(\Omega \), the new domain of integration in (6.2) is
As the integrand only depends on \(x-y\), we may translate I to be of the form \((0,\ell _j)\). Hence, with \(n:=\#\pi _0(\Omega )\) the number of connected components of \(\Omega \), we have
The last step relies on an improved result of Landau and Widom with \(L=1\), see Corollary C.3. \(\square \)
In Theorem 2.3, we obtained for a general Lipschitz region \(\Lambda \subset {{\mathbb {R}}}^3\) an error term \(o(L^2 \ln (L))\) and not of the order \(L^2\). Specifically, using \(\textsf{I}(2) = -1/(4\pi ^2)\), we have the asymptotic expansion
This allows us to define the error term \(\varepsilon (L,\Lambda )\) by the identity
In this notation, Theorem 2.3 states that \(\lim _{L \rightarrow \infty }\varepsilon (L,\Lambda )=0\) for a piecewise Lipschitz region \(\Lambda \) and we have \(\sup _{L \ge 2} |\varepsilon (L, \Lambda )|\ln (L) < \infty \), if \(\Lambda \) is a piecewise \(\textsf{C}^{1,\alpha }\) region. The main result of this section, which is the next theorem, shows that the estimate for Lipschitz regions is sharp and the error term can be large and just \(o(L^2\ln (L))\). The negative sign in front of the error term does not necessarily mean that it has a definite sign although in our example it will be. Although our result only deals with the error term for the simplest, non-trivial polynomial, namely \(t\mapsto t(1-t)\), we believe that also for the entropy the error term can be as large and only \(o(L^2\ln (L))\) for a Lipschitz region.
Theorem 6.2
Let \(\varphi :{{\mathbb {R}}}^+ \rightarrow {{\mathbb {R}}}^+\) be a bounded function with \(\lim _{L \rightarrow \infty } \varphi (L)=0\). Then, there is a piecewise Lipschitz region \(\Lambda \) and an \(L_0\) such that for any \(L \ge L_0\), the error term defined in (6.10) satisfies
Remark 6.3
Let \({\mathcal {A}}\) be the subset of the space of all polynomials vanishing at 0 and 1 such that the error term in Theorem 2.3 for \(f \in {\mathcal {A}}\) is of order \(O(L^2)\) for any Lipschitz domain \(\Lambda \). This is clearly a linear subspace and the theorem tells us that \(t \mapsto t(1-t) \not \in {\mathcal {A}}\). Thus, the subspace has at least codimension one, which means that it satisfies (at least ) one linear constraint. We conjecture that this constraint might be \(f \in {\mathcal {A}} \implies \textsf{I}(f)=0\). That is, the error term can only achieve the order \(O(L^2)\), if the leading term of order \(L^2 \ln (L)\) vanishes.
Proof of Theorem 6.2
We begin with a non-negative, summable sequence \((a_i)_{i \in {\mathbb {N}}}\) with \(\sum _{i \in {\mathbb {N}}} a_i=1\), which we will choose later. Let \(g_0 : [0,1] \rightarrow {{\mathbb {R}}}^+\) be the zigzag function defined by \(g_0(0)=1\) and for \(t>0\),
If \(j=1\), then we use the convention that \(\sum _{j<1}a_j :=0\). Clearly, \(g_0\) is Lipschitz continuous with Lipschitz bound 1. We expand \(g_0\) to \([-1,2]\) by setting \(g_0(t)=t+1\) for \(t < 0\) and \(g_0(t)=2-t\) for \(t>1\). This extension is still Lipschitz continuous and satisfies \(g_0(-1)=g_0(2)=0\). Now, we can define the region \(\Lambda \),
This clearly defines a piecewise Lipschitz region. We will now sketch why this is even a strong Lipschitz domain (See [2, Pages 66–67] for the definition.)
For any \(x_0 \in B_{1/ (2\sqrt{2})} (1/2,0,1/2)\), the region \(\Lambda \) is star shaped with respect to \(x_0\). For the definition of a strong Lipschitz domain, we need to choose an open cover of \(\partial \Lambda \) and a projection with a certain direction on every set of the cover. For any orthogonal (rank 2) projection \(\pi :{{\mathbb {R}}}^3 \rightarrow {{\mathbb {R}}}^2\), on the two connected components of the set \(\pi ^{-1} ( \pi (B_{1/ (4\sqrt{2} )} (1/2,0,1/2))) \cap \partial \Lambda \), one can define the chart as the inverse of \(\pi \), which has a Lipschitz constant less than 10. This leads to an open cover of \(\partial \Lambda \) and one can then choose a finite subcover.
The boundary \(\partial \Lambda \) can be covered by the sets \(\partial _1 \Lambda :=\big \{x \in \partial \Lambda :x_1 \in \{0,1\}\big \}\) and \(\partial _2 \Lambda :=\big \{ x \in \partial \Lambda :x_1 \in [0,1]\big \}\). These two boundary sets have a non-empty intersection, but \(\partial _1 \Lambda \cap \partial _2 \Lambda \) is a “one-dimensional” set with two-dimensional Hausdorff measure zero, that is, \(\mathcal H^2(\partial _1 \Lambda \cap \partial _2 \Lambda )=0\).
For almost every \(x \in \partial _1 \Lambda \), the outward normal vector n(x) is given by \(\pm e_1\), while for almost every \(x \in \partial _2 \Lambda \), the outward normal vector is given by \(\frac{1}{\sqrt{2}} (\pm e_2 \pm e_3)\); the vectors \({e_1,e_2,e_3}\) are the usual unit vectors in the positive \(x_1,x_2,x_3\) directions. Hence, we observe
It is important that \({{\mathcal {H}}^2}( \partial \Lambda ) = {\mathcal H^2}( \partial _1\Lambda ) + {{\mathcal {H}}^2}( \partial _2\Lambda )\) is bounded independently of the sequence \((a_i)_{i \in {\mathbb {N}}}\) and that the surface integral in (6.16) is completely independent of the sequence.
The leading asymptotic term for the trace on the left-hand side of (6.10) is provided by Theorem 2.3. Here, \(\textsf{I}(2) = -1/(4\pi ^2)\) and hence
where we used (6.16).
We need an upper bound for the constant \({\mathcal {K}}( \Lambda )\) defined in Lemma A.3, which is independent of the function g. We observe that \(\partial _1 \Lambda \) is the image of the Lipschitz functions \(f_j :[0,1]^2 \rightarrow \partial _1\Lambda ; (x_1,x_2) \mapsto (j,3x_1-1,g(3x_1-1)x_2)\) for \(j=0,1\) and \(\partial _2 \Lambda \) is in the image of the Lipschitz functions \(\tilde{f}_\pm :[0,1]^2 \rightarrow \partial _2 \Lambda ; (x_1,x_2) \mapsto (x_1,3x_2-1,\pm g(3x_2-1))\). Thus, the set \(\{f_0,f_1\tilde{f}_+, \tilde{f}_-\}\) defines a piecewise Lipschitz atlas of \(\partial \Lambda \). Hence, as \(C_{\text {lip}}(f_j)=3 \sqrt{2}\) and \(C_{\text {lip}}(\tilde{f}_\pm )=3\), we observe
Hence, by Lemma 3.2, we have
with \(x^\perp =(x_1,x_2)\). To get to the polynomial \(f(t)=t(1-t)\) we have to subtract this term with \(m=2\) from the term with \(m=1\). Now, we intend to use Lemma 6.1. To do so, we need to describe the lengths of the (sub)intervals of \(\Lambda _{x^\perp }\) depending on \(x^\perp =(x_1,x_2)\).
We can ignore the case \(x_1 \in \{0,1\}\), as this is a null set with respect to the Lebesgue measure on \({{\mathbb {R}}}^2\). If \(x_1 \in (0,1)\) and \(|x_2 |\le 1\) then the set \(\Lambda _{x^\perp }\) is a single interval of length \(\ell _1(x_2)=3-2 |x_2 |\). The interesting case is \(x_1 \in (0,1)\) and \(1< |x_2 |<2\). Here, for any \(i \in {\mathbb {N}}\) with \(a_i>2(|x_2 |-1)\), there is an interval of size \(\ell _i(x_2)= a_i - 2 (|x_2 |-1)\), as illustrated in Equation (1). For any \(|x_2 |>1\), this will only lead to finitely many intervals, as the sequence \((a_i)_{i \in {\mathbb {N}}}\) is a null sequence. Now, we apply (6.20) for \(m=1\) and \(m=2\), and then Lemma 6.1 and see that
In the second step, we also used that the set \(\Lambda _{x^\perp }\) is independent of \(x_1 \in (0,1)\). The third step uses Fubini to exchange the sum and the integral and then transforms the integration variable to \(t :=\frac{1}{2} a_i+1- x_2\). The lower bound 0 in the last integral stems from the condition \(a_i>2(x_2-1)\), respectively, from \(t>0\).
We intend to show that this upper bound is significantly smaller than the known asymptotics. The difference between the asymptotics and this upper bound can then be used as a lower bound for the error term. This is why it is very important that the coefficient in front of the upper bound is equal to the asymptotic coefficient and is thus the reason why we can only do this here for the polynomial \(f(t)=t(1-t)\).
We now allow our constants to depend on \(\mu \) (for general \(\nu \ge 1\), they depend on all values of \(\mu (\ell )\)) and use the trivial inequality \(\ln (1+ab) \le \ln (1+a) + \ln (1+b)\) for \(a,b\ge 0\) to arrive at
or equivalently,
In the first step, we used \(1 \le 3- x_2 \le 3\), and in the fourth step, we used \(\ln (1+L)\le \ln (L)+1\) for \(L \ge 1\) and \(\ln (1+\frac{1}{2} a_i L) \le \frac{1}{2} a_i L\).
Now we rewrite (6.10) and use (6.16) and (6.31) to obtain
The fourth step uses \(\sum _i a_i=1\) and the fifth step relies on \(L \ge 2, a_i \le 1\) to get \(\ln ( \frac{1}{L}+\frac{1}{2} a_i ) \le 0 \). In the last step, C changed. Now, we just need to find a good sequence \((a_i)_{i \in {\mathbb {N}}}\). To show our claim, it suffices to find a sequence \((a_i)_{i \in {\mathbb {N}}}\) such that
since then the quotient \(\varphi (L)/\varepsilon (L,\Lambda ) \le 2 \pi ^3\varphi (L)/\varepsilon _0(L)\rightarrow 0\) is less than 1 for large L, that is, \(\varepsilon (L,\Lambda )\ge \varphi (L)\ge 0\) for \(L\ge L_0\), where \(L_0\) is chosen below.
The construction of the sequence \(a_i\) relies on Lemma D.1, and we apply this Lemma with f as \(\varphi \). With the resulting function \(\textrm{Env}(\varphi )\) we define the sequence of real numbers \(a_i :=\textrm{Env}(\varphi )(i-1)-\textrm{Env}(\varphi )(i)\) for \(i \in {\mathbb {N}}\). As \(\lim _{L \rightarrow \infty } \textrm{Env}(\varphi )(L)=0\), we have \(\sum _{i \ge L} a_i=\textrm{Env}(\varphi )(L)\) and in particular \(\sum _{i \in {\mathbb {N}}}a_i=\textrm{Env}(\varphi )(0)=1\). As \(\textrm{Env}(\varphi )\) is non-increasing and convex, the \(a_i\) are non-negative and non-increasing. As the sequence defined this way is non-increasing and \(\sum _{i\in {\mathbb {N}}} a_i =1\), we have \(a_i \le \frac{1}{i} \sum _{j \le i}a_j \le \frac{1}{i}\). Hence, we know that \(i \ge L\) implies \(a_i \le \frac{1}{L}\). Thus, we have the estimate
Furthermore, as \(\textrm{Env}(\varphi )(L) \ge C/\sqrt{\ln (2+L)}\), for L large enough, we have
Hence, we conclude that
One choice of \(L_0\) could be that \(4\varphi (L) \le \textrm{Env}(\varphi )(L)\) for \(L>L_0\) is satisfied. Thus, by this and (6.37), there is an \(L_0 >0\) such that for any \(L>L_0\), we have
This finishes the proof. \(\square \)
Notes
A function f is bi-Lipschitz if there is a constant \(C_{\text {lip}} \in {{\mathbb {R}}}^+\) such that \(C_{\text {lip}}^{-1}\Vert x-y \Vert \le \Vert f(x)-f(y) \Vert \le C_{\text {lip}} \Vert x-y \Vert \). Such a function f is (obviously) invertible on its image and satisfies \(C_{\text {lip}}^{-1}\Vert x-y \Vert \le \Vert f^{-1}(x)-f^{-1}(y) \Vert \le C_{\text {lip}} \Vert x-y \Vert \).
The choice of the value \(18=3^2\times 2\) is convenient for this paper.
In \({{\mathbb {R}}}^2\) resp. \({{\mathbb {R}}}^3\).
See, e.g., [9, Theorem 3.2].
The values \(x_0\) and \(x_m\) only matter through \(x_0-x_m=0\). Thus, we can set both to 0.
As of August 25, 2022.
If \(ab\le 1\), then \(\ln (1+ab)\le ab\le 1\) and (C.25) holds. If \(ab\ge 1\), then we distinguish between the case that both \(a\ge 1\) and \(b\ge 1\) and the case where one of them is smaller than 1. In the first case (C.25) is equivalent to \(\ln (1+ab)-\ln (ab) = \ln (1+1/(ab))\le 1\) which holds because \(\ln (1+1/(ab))\le 1/(ab)\le 1\). In the remaining case, we may assume \(a\le 1\) and \(b\ge 1\) (but still \(ab\ge 1\)). Then, \(ab\le b/a\) and \(\ln (1+ab) = \int _1^{ab} \textrm{d}x/x \le \int _1^{b/a} \textrm{d}x/x =\ln (b/a)\).
References
Abdul-Rahman, H., Fischbacher, C., Stolz, G.: Entanglement bounds in the XXZ quantum spin chain. Ann. Henri Poincaré 21, 2327–2366 (2020)
Adams, R.A.: Sobolev Spaces. Academic Press, New York (1975)
Amico, L., Fazio, R., Osterloh, A., Vedral, V.: Entanglement in many-body systems. Rev. Mod. Phys. 80, 517–576 (2008)
Beaud, V., Warzel, S.: Bounds on the entanglement entropy of droplet states in the XXZ spin chain. J. Math. Phys. 59, 012109 (2018). https://doi.org/10.1063/1.5007035
Charles, L., Estienne, B.: Entanglement entropy and Berezin–Toeplitz operators. Commun. Math. Phys. 376, 521–554 (2020)
Eisert, J., Cramer, M., Plenio, M.B.: Area laws for the entanglement entropy—a review. Rev. Mod. Phys. 82, 277 (2010)
Elgart, A., Klein, A., Stolz, G.: Many-body localization in the droplet spectrum of the random XXZ quantum spin chain. J. Funct. Anal. 275, 211–258 (2018)
Elgart, A., Pastur, L., Shcherbina, M.: Large block properties of the entanglement entropy of free disordered Fermions. J. Stat. Phys. 166, 1092–1127 (2017)
Evans, L.C., Gariepy, R.F.: Measure Theory and Fine Properties of Functions. Revised Edition (1st edn.), Chapman and Hall/CRC (2015)
Fock, V.: Bemerkung zur Quantelung des harmonischen Oszillators im Magnetfeld. Z. Phys. 47, 446–448 (1928)
Fischbacher, C., Schulte, R.: Lower bound on the entanglement entropy of the XXZ spin ring. arXiv:2007.00735
Fischbacher, C., Ogunkoya, O.: Entanglement entropy bounds in the higher spin XXZ chain. J. Math. Phys. 62, 101901 (2021)
Gioev, D., Klich, I.: Entanglement entropy of fermions in any dimension and the Widom conjecture. Phys. Rev. Lett. 96, 100503 (2006)
Guillemin, V., Pollack, A.: Differential Topology. Prentice-Hall (1974)
Horodecki, R., Horodecki, P., Horodecki, M., Horodecki, K.: Quantum entanglement. Rev. Mod. Phys. 81, 865–942 (2009)
Jin, B.-Q., Korepin, V.E.: Quantum spin chain, Toeplitz determinants and the Fisher–Hartwig conjecture. J. Stat. Phys. 116, 79 (2004)
Klich, I.: Lower entropy bounds and particle number fluctuations in a Fermi sea. J. Phys. A Math. Gen. 39, L85 (2006)
Landau, L.: Diamagnetismus der Metalle. Z. Phys. 64, 629–637 (1930)
Landau, H.J., Widom, H.: Eigenvalue distribution of time and frequency limiting. J. Math. Anal. Appl. 77, 469–481 (1980)
Leschke, H., Sobolev, A.V., Spitzer, W.: Scaling of Rényi entanglement entropies of the free Fermi-gas ground state: a rigorous proof. Phys. Rev. Lett. 112, 160403 (2014)
Leschke, H., Sobolev, A.V., Spitzer, W.: Asymptotic growth of the local ground-state entropy of the ideal Fermi gas in a constant magnetic field. Commun. Math. Phys. 381, 673–705 (2021)
Leschke, H., Sobolev, A.V., Spitzer, W.: Trace formulas for Wiener–Hopf operators with applications to entropies of free fermionic equilibrium states. J. Funct. Anal. 273, 1049–1094 (2017)
Leschke, H., Sobolev, A.V., Spitzer, W.: Rényi entropies of the free Fermi gas in multi-dimensional space at high temperature. Oper. Theory Adv. Appl. 289, 477–508 (2022)
Müller, P., Pastur, L., Schulte, R.: How much delocalisation is needed for an enhanced area law of the entanglement entropy? Commun. Math. Phys. 376, 649–679 (2020)
Müller, P., Schulte, R.: Stability of the enhanced area law of the entanglement entropy. Ann. Henri Poincaré 21, 3639–3658 (2020)
Müller, P., Schulte, R.: Stability of a Szegő-type asymptotics. J. Math. Phys. 64, 022101 (2023)
NIST Digital Library of Mathematical Functions. In: Olver, F.W.J., Olde Daalhuis, A.B., Lozier, D.W., Schneider, B.I., Boisvert, R.F., Clark, C.W., Miller, B.R., Saunders, B.V., Cohl, H.S., McClain, M.A. (eds.). http://dlmf.nist.gov/. Release 1.1.5 of 2022-03-15
Pastur, L., Slavin, V.: Area law scaling for the entropy of disordered quasifree fermions. Phys. Rev. Lett. 113, 150404 (2014)
Pfeiffer, P.: On the stability of the area law for the entanglement entropy of the Landau Hamiltonian. arXiv:2102.07287
Pfirsch, B., Sobolev, A.V.: Formulas of Szegő type for the periodic Schrödinger operator. Commun. Math. Phys. 358, 675–704 (2018)
Rodríguez, I.D., Sierra, G.: Entanglement entropy of integer Quantum Hall states. Phys. Rev. B 80, 153303 (2009)
Sobolev, A.V.: Pseudo-differential operators with discontinuous symbols: Widom’s Conjecture. Mem. AMS 222, 1043 (2013)
Sobolev, A.V.: Wiener–Hopf operators in higher dimensions: the Widom conjecture for piece-wise smooth domains. Integr. Equ. Oper. Theory 81, 435–449 (2015)
Sobolev, A.V.: On the Schatten-von Neumann properties of some pseudo-differential operators. J. Funct. Anal. 266, 5886–5911 (2014)
Sobolev, A.V.: Quasi-classical asymptotics for functions of Wiener–Hopf operators: smooth vs non-smooth symbols. Geom. Funct. Anal. 27, 676–725 (2017)
Widom, H.: On a class of integral operators with discontinuous symbol. Toeplitz centennial (Tel Aviv, 1981), pp. 477–500, Operator Theory: Adv. Appl., 4, Birkhäuser, Basel-Boston, Mass (1982)
Widom, H.: On a class of integral operators on a half-space with discontinuous symbol. J. Funct. Anal. 88(1), 166–193 (1990)
Acknowledgements
We are grateful to the referees for a careful reading of the manuscript.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Alain Joye.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A. Some Geometric Results
Here, we assemble a few geometric statements that we used.
Lemma A.1
Let \(\Lambda \subset {{\mathbb {R}}}^{d+1}\) be a piecewise \(\textsf{C}^{1,\alpha }\) region for some \(0< \alpha <1\). Let \((\Psi _{\textrm{pC},i})_{i \in I}\) be a piecewise \(\textsf{C}^{1,\alpha }\) atlas of \(\partial \Lambda \) and \(\Gamma \) as defined in Definition 2.1. Then, there is a constant C depending only on \(\Lambda \) such that for all unequal \(v_1\) and \(v_2\) in \(\partial \Lambda \), we have
Remark A.2
The normal vector \(n(v_1)\) is well defined if \(v_1 \not \in \Gamma \). In the case \(v_1 \in \Gamma \), the minimum on the right-hand side is meant to be \(0={\text {dist}}(v_1, \Gamma )\), which turns it into a trivial statement.
Proof
We begin with the case \(v_1 \in \Gamma \) or \(v_2 \in \Gamma \); \(v_1\in \Gamma \) is explained in the above remark. If \(v_2\in \Gamma \), then we trivially have \(\Vert v_1 - v_2 \Vert \ge {\text {dist}}(v_1, \Gamma )\) and thus the claim holds for any \(C \le 1\).
Let us now consider the case that there is an \(i\in I\) such that both \(v_1\) and \(v_2\) are in \(\Psi _{\text {pC},i}((0,1)^d)\). As \(\Psi :=\Psi _{\text {pC},i}\) is injective, there are unique \(x_k\in (0,1)^d\) such that \(\Psi (x_k)=v_k\in {{\mathbb {R}}}^{d+1}\) for \(k=1,2\). We observe that \(n(v_1) \cdot D\Psi (x_1)=0 \in {{\mathbb {R}}}^d\), as the image of the matrix \(D\Psi (x_1)\) is the tangent space to \(\partial \Lambda \) at \(v_1\) and hence is orthogonal to the outward normal vector \(n(v_1)\). Thus, using (2.13), we see
As \(\Psi \) is bi-Lipschitz, we know that \(\Vert v_1- v_2 \Vert = \Vert \Psi (x_1)-\Psi (x_2) \Vert \ge C \Vert x_1 -x_2 \Vert \). Using this and dividing both sides by \(\Vert v_1-v_2\Vert \), we arrive at
We are now in the remaining case that \(v_1\) and \(v_2\) lie in different sets \(\Psi _{\text {pC},i}((0,1)^d)\) since \(\partial \Lambda = \Gamma \cup \bigcup _{i\in I} \Psi _{\text {pC},i}((0,1)^d)\).
Let \((\Psi _{\text {gL},j})_{j\in J}\) be a global Lipschitz atlas of \(\partial \Lambda \). As the boundary \(\partial \Lambda = \bigcup _{j\in J} \Psi _{\text {gL},j}((0,1)^d)\) is a cover by (relatively) open sets and \(\partial \Lambda \) is a compact metric space, by Lebesgue’s number lemma, there is an \(\varepsilon >0\) such that for all \(v \in \partial \Lambda \) there is an \(j\in J\) with \(B_{\varepsilon }(v) \cap \partial \Lambda \subset \Psi _{\text {gL},j}((0,1)^d)\), where \(B_{\varepsilon }(v)\subset {{\mathbb {R}}}^{d+1}\) is the open ball of radius \(\varepsilon \) at v.
If \(\Vert v_1- v_2 \Vert \ge \varepsilon \), we can choose \(C= \varepsilon \) to get the statement, as the first expression inside the minimum is at most 1.
Hence, we are left with the case \(\Vert v_1- v_2 \Vert < \varepsilon \). Now, we get an \(j \in J\) such that \(v_1,v_2 \in \Psi _{\text {gL},j}((0,1)^d)\). Again, we define \(y_k\) by \(\Psi _{\text {gL},j}(y_k)=v_k\) for \(k=1,2\). The image \(\gamma \) of the linear path from \(y_1\) to \(y_2\) is at most \(C \Vert y_1-y_2\Vert \le C \Vert v_1 -v_2 \Vert \) long. As \(v_1\) and \(v_2\) are in the images of two different \(\Psi _{\text {pC},i}\)’s, the path \(\gamma \) has to intersect some edge \(\Psi _{\text {pC},i}(\partial (0,1)^d)\) which implies \(\gamma \cap \Gamma \ne \emptyset \). Hence, we have
The last inequality follows since \(\Psi _{\text {gL},j}\) is bi-Lipschitz. This finishes the proof. \(\square \)
Lemma A.3
For \(d\ge 1\), let \(f :[0,1]^d \rightarrow {{\mathbb {R}}}^{d+1}\) be a Lipschitz continuous function with Lipschitz constant \(C_{\textrm{lip}}(f)\) and let \(\Lambda \subset {{\mathbb {R}}}^{d+1}\) be a piecewise Lipschitz region. Then, for any \(r>0\), the \((d+1)\)-dimensional Lebesgue volume of the r-neighborhood (see (2.1)) of the set \(f([0,1]^d)\) in \({{\mathbb {R}}}^{d+1}\) satisfies
and the set \(\partial \Lambda \) satisfies the bounds
where \({\mathcal {K}}(\Lambda )\) is described as follows: Let \({\mathcal {A}}\) be the set of all piecewise Lipschitz atlases of \(\partial \Lambda \), as defined in Definition 2.1. Then, we define
Proof
We consider the set
The maximum distance a point in \([0,1]^d\) can have from \(A_r\) is less than \(\frac{r}{C_{\text {lip}}(f)}\). For the cardinality \(\#A_r\) of \(A_r\), we observe
For any \(x \in [0,1]^d\), there is a \(z \in A_r\) such that \(\Vert x-z\Vert \le r /C_{\text {lip}}(f)\) and thus \(\Vert f(x)- f(z) \Vert \le r\). This implies \(B_r(f(A_r)) \supset f([0,1]^d)\), which leads to \(B_{2r}(f(A_r)) \supset B_r(f([0,1]^d))\). Hence, we get
This finishes the proof of the first statement. The second statement is trivially implied by the first one. Furthermore, as \(B_r(f(A_r)) \supset f([0,1]^d)\) due to the definition of the Hausdorff measure (see, e.g., [9, Definition 2.1]) we observe
The final statement is a corollary of this inequality. We want to note that \({\mathcal {K}}(\Lambda )<\infty \) for any piecewise Lipschitz region \(\Lambda \), as we require in this paper our atlases to be a finite collection of charts. \(\square \)
Lemma A.4
Let \(\Lambda \subset {{\mathbb {R}}}^3\) be a piecewise Lipschitz region with piecewise Lipschitz atlas \((\Psi _{\textrm{pL},i})_{i \in I}\). Let \(v_0 \in \partial \Lambda \) satisfy that there are \(i_0 \in I\) and \(x_0 \in (0,1)^2\) such that \(\Psi _{\textrm{pL},i_0}(x_0)=v_0\) and the Jacobi matrix \(D \Psi _{\textrm{pL},i_0}(x_0)\) exists. Then, the signed distance function \({\text {d}}_\Lambda \) is differentiable at \(v_0\), the outward unit normal vector \(n(v_0)\) is well-defined, orthogonal to the image of \(D \Psi _{\textrm{pL},i_0}(x_0)\), and \(D{\text {d}}_\Lambda (v_0)=n(v_0)\).
To prove this statement, we need the following result from intersection theory.
Lemma A.5
Let \(R>0\), \(f_1 :[-1,1] \rightarrow \overline{ B}_R^{(3)} (0), f_2 :\overline{ B}_R^{(2)} (0) \rightarrow \overline{ B}_R^{(3)} (0)\) be continuous functions such that \(f_1( \pm 1 ) = \pm R e_3\) and \(f_2\) restricted to the boundary is the equatorial embedding, that is, \(f_2(x)=(x,0)\) for \(\Vert x \Vert =R\). Then, the images of \(f_1\) and \(f_2\) intersect.
Proof
Without loss of generality, we assume \(R=1\). Assume \(f_1\) and \(f_2\) were two such functions such that their images do not intersect. Let \(\eta _1 :{{\mathbb {R}}}^1 \rightarrow {{\mathbb {R}}}^3, t \mapsto (0,0,t)\) and \(\eta _2 :{{\mathbb {R}}}^2 \rightarrow {{\mathbb {R}}}^3, x \mapsto (x,0)\) be the natural orthogonal inclusions. The assumptions on \(f_j\) can now be stated as \(f_j(x_j)= \eta _j(x_j)\) for \(x_j \in {{\mathbb {R}}}^j\) with \(\Vert x_j \Vert =1\) for \(j\in \{1,2\}\). We extend the maps \(f_j\) to the \({{\mathbb {R}}}^j\) by setting
Trivially, these extensions are still continuous and their images still do not intersect. As the images do not intersect and only get close to each other in the compact set \(\overline{B}_1^{(3)}(0)\), they have a positive distance. We can now mollify \(f_j\) by convolution with an appropriately chosen, compactly supported smooth function to get \({\hat{f}}_j\) such that the images of \({\hat{f}}_1\) and \({\hat{f}}_2\) still have positive distance and \({\hat{f}}_j (x_j)=\eta _j(x_j)\) for any \(x_j \in {{\mathbb {R}}}^j\) with \(\Vert x_j \Vert \ge 2\).
For \(d=1,2,3\), consider the sphere \({\mathbb {S}}^d= {{\mathbb {R}}}^d \cup \{\infty \}\). With the charts \(\text {id} :{{\mathbb {R}}}^d \rightarrow {\mathbb {S}}^d, x \mapsto x\) and \(\iota _d :{{\mathbb {R}}}^d \rightarrow {\mathbb {S}}^d, x \mapsto x / \Vert x \Vert ^2, 0 \mapsto \infty \), it becomes a differentiable manifold. We now extend \({\hat{f}}_j\) to a function from \({\mathbb {S}}^j\) to \({\mathbb {S}}^3\) by setting \({\hat{f}}_j(\infty ):=\infty \) for \(j\in \{1,2\}\). The point \(\infty \) is now an intersection point of \({\hat{f}}_1\) and \({\hat{f}}_2\). We want to show that the extended functions are still smooth and that they intersect transversely at \(\infty \), see for instance [14, Page 113]. For \(j\in \{1,2\}\) and \(x_j \in {{\mathbb {R}}}^j\) with \(\Vert x_j \Vert < \frac{1}{2}\), we observe
Thus, in the charts \(\iota _j ,\iota _3\) the maps \({\hat{f}}_j\) are linear and orthogonal at 0 (which corresponds to \(\infty \in {\mathbb {S}}^j\)). The maps \({\hat{f}}_1,{\hat{f}}_2\) are therefore smooth and intersect transversely at \(\infty \). In conclusion, we have just constructed two smooth maps \({\hat{f}}_j :{\mathbb {S}}^j \rightarrow \mathbb S^3\), which intersect transversely and have a unique intersection point. Thus, their oriented intersection number is equal to the local intersection number at this intersection point, which is \(+1\) or \(-1\) (in fact, it is \(+1\)). However, both maps are contractible (homotopic to a constant map) and thus, as oriented intersection numbers are homotopy invariant (see [14, Page 115]), they should have intersection number 0. This is a contradiction. Hence, the assumption that \(f_1\) and \(f_2\) do not intersect was wrong. \(\square \)
Proof of Lemma A.4
Let \(C_{\text {lip}}\) be a bi-Lipschitz constant of \(\Psi _{\text {pL},i_0}\), as in footnote 1. Then, for any \(x \in {{\mathbb {R}}}^2\) with \(\Vert x \Vert =1\), we observe \(C_{\text {lip}}^{-1} \le \Vert D\Psi _{\text {pL},i_0}(x_0) x \Vert \le C_{\text {lip}}\). This means that the Jacobi matrix is invertible. Thus, using affine linear transformations on \({{\mathbb {R}}}^2\) and on \({{\mathbb {R}}}^3\), we can transform the function \(\Psi _{\text {pL},i_0}\) into a function \(\Psi \) such that \(x_0,v_0\) are mapped to 0Footnote 4 and the Jacobi matrix turns into the standard inclusion \(J :{{\mathbb {R}}}^2 \rightarrow {{\mathbb {R}}}^3, x \mapsto (x,0)\). The function \(\Psi \) is now defined on some closed parallelogram P containing 0 in its interior \(P^{\text {int}}\). Let \(C_{\text {lip}}\ge 2\) be a bi-Lipschitz constant for \(\Psi \). Let \(0<\varepsilon <1/2\). Then, there is an \(r>0\) such that
-
For any \(x \in B_{2C_{\text {lip}}r}^{(2)}(0)\), we have \(\Vert \Psi (x)-(x,0) \Vert \le \varepsilon \Vert x \Vert \), as \(D \Psi (0)=J\);
-
\(B_{3r}^{(3)}(0)\cap \partial \Lambda \subset \Psi (P^{\text {int}})\), as \(\Psi ^{\text {int}}\) is relatively open in \(\partial \Lambda \);
-
The set has exactly two connected components, as \(\overline{\Lambda }\) and are topological manifolds with common boundary \(\partial \Lambda \).
As \(\Psi \) is bi-Lipschitz, we observe
We define
The sets \(U_\pm \cap B_r^{(3)}(0)\) are open, convex and do not intersect \(\partial \Lambda \) due to (A.17). We will now use Lemma A.5 to show that, up to a binary choice, we may assume \(U_- \cap B_r^{(3)}(0) \subset \Lambda \) and . As has exactly two connected components, it is sufficient to prove that any (continuous) path \(p :[-1/2 , 1/2 ] \rightarrow B_r^{(3)}(0)\) with \(p(\pm 1/2) \in U_{\pm }\) intersects \(\partial \Lambda \).
We first use the convexity to extend p by an (affine) linear path at both ends to get a path \(f_1 :[-1,1] \rightarrow \overline{B}_{ 5 r}^{(3)}(0)\) with \(f_1(\pm 1)= \pm 5 r e_3\). Then we define \(f_2 :\overline{B}_{ 5 r}^{(2)}(0) \rightarrow {{\mathbb {R}}}^3\) by
We see that \(f_2 \) is Lipschitz continuous. The middle case is just a convex combination between the two other cases. Let \(x \in {{\mathbb {R}}}^2\) with \(\Vert x \Vert \le 3 r\). We observe
This implies
that is, \(f_2\left( \overline{B}_{3r}^{(2)}(0)\right) \subset U_0\) and thus, by the definition of \(f_2(x)\) for \(\Vert x\Vert \ge 3r\), \(f_2 \left( \overline{B}_{5r}^{(2)}(0) \right) \subset U_0\). By the triangle inequality and with \(\varepsilon < \frac{1}{2} \) we obtain
These inequalities yield \( f_2^{-1} \left( B_{r}^{(3)} (0) \right) \subset B_{2r}^{(2)}(0)\) and \(f_2 \left( \overline{B}_{3r}^{(2)} (0)\right) \subset B_{5r}^{(3)}(0)\). The latter inclusion together with the definition of \(f_2\) outside \(B_{3r}^{(2)}(0)\) implies \(f_2\left( B_{5r}^{(2)}(0) \right) \subset B_{5r}^{(3)}(0)\). Thus, \(f_1\) and \(f_2\) satisfy the assumptions of Lemma A.5 (with \(R :=5 r\)) and consequently, they have an intersection point \(s \in {{\mathbb {R}}}^3\). We have \(s \in f_2\left( \overline{ B}_{5r}^{(2)}(0)\right) \subset U_0\) and \(f_1^{-1}(U_0) \subset (-\frac{1}{2}, \frac{1}{2})\), which means s is in the image of the original path p. Thus, \(s \in B_r^{(3)}(0)\), which implies that \(s \in f_2\left( B_{2r}^{(2)}(0)\right) = \Psi \left( B_{2r}^{(2)}(0)\right) \subset \partial \Lambda \). Therefore, the path p intersects \(\partial \Lambda \), which was our claim.
As a result, we know that the sets \(U_\pm \cap B_r^{(3)}(0)\) lie on opposite sides of \(\partial \Lambda \). Without loss of generality, we assume \(U_- \cap B_r^{(3)}(0) \subset \Lambda \) and . In terms of the signed distance function \({\text {d}}_\Lambda \), this means that \(\pm {\text {d}}_\Lambda (v) >0\) for \(v \in U_\pm \cap B_r^{(3)}(0)\).
We are left to estimate \({\text {dist}}(v, \partial \Lambda )\) for \(v \in B_r^{(3)}(0)\). We start with the case \(v \in U_0 \cap B_r^{(3)}(0)\). For that, we consider the map
We see that \(\Phi \left( B_r^{(2)}(0) \times [- \varepsilon , \varepsilon ]\right) = U_0 \cap B_r^{(3)}(0)\) and \(\Vert \Phi (y,t) \Vert = \Vert y \Vert \). Furthermore, for a fixed y, the map \(t \mapsto \Phi (y,t)\) defined on \([-2\varepsilon ,2\varepsilon ]\) is a path between \(U_+\) and \(U_-\) inside \(B_r^{(3)}(0)\) and must thus intersect \(\partial \Lambda \). This path has a length of \(2 \Vert y \Vert \sin ^{-1}(2 \varepsilon ) \le 2 \pi \varepsilon \Vert y \Vert \). Hence, as each point \(v \in U_0 \cap B_r^{(3)}(0)\) is on such a path for a y with \(\Vert y \Vert = \Vert v \Vert \), we get \(|{\text {d}}_\Lambda (v) |\le 2 \pi \varepsilon \Vert v \Vert \). Therefore, for \(v \in U_0 \cap B_r^{(3)}(0)\), we get
For \(v \in U_{\pm } \cap B_r^{(3)}(0)\), we know that \(\pm {\text {d}}_\Lambda (v) >0\) and only need upper and lower bounds for the distance to \(\partial \Lambda \). For the lower bound, as \(\partial \Lambda \cap B_{2r}^{(3)}(0) \subset U_0\), we have
For the upper bound, we just use
As the signs align, we finally get
Thus, (A.24) holds for all \(v \in B_{r}^{(3)}(0)\), which, by definition, says that \({\text {d}}_\Lambda \) is differentiable at 0 and its differential is \(e_3\). We also see that \(e_3\) is orthogonal to the image of J and points toward , which means that it is the outward normal vector to \(\partial \Lambda \) at 0. This finishes the proof. \(\square \)
Lemma A.6
Let \(\Lambda \subset {{\mathbb {R}}}^3\) be a piecewise Lipschitz region. Then, the outward normal vector n(v) exists for \({\mathcal {H}}^2\) almost every \(v \in \partial \Lambda \) and the set
is a (two-dimensional) Lebesgue null set, where \(\Lambda _{x^\perp }\) and \(\left( \partial \Lambda \right) _{x^\perp }\) are defined in Definition 3.1.
Proof
We observe
The first inclusion is trivial. The second inclusion can be seen as follows. Let \(t\in \partial \left( \Lambda _{x^\perp }\right) \). Then for all \(r>0\), \(B_r^{(1)}(t)\cap \Lambda _{x^\perp }\not =\emptyset \) and . Therefore, \(B_r^{(3)}(x^\perp ,t)\cap \Lambda \not =\emptyset \) and . Therefore, \((x^\perp ,t)\in \partial \Lambda \) and \(t\in \left( \partial \Lambda \right) _{x^\perp }\).
Let \(\pi :{{\mathbb {R}}}^3 \rightarrow {{\mathbb {R}}}^2\) be the projection with \(\pi (e_3)=0\) and let \(\left( \Psi _{\text {pL},i}\right) _{i \in I}\) be a piecewise Lipschitz atlas of \(\partial \Lambda \). For \(i \in I\), we define the sets
which are Lebesgue null sets due to Rademacher’s theorem.Footnote 5 Thus, the set \(\bigcup _{i\in I}\Psi _{\text {pL},i}({\mathcal {N}}_i)\) is an \(\mathcal H^2\) null set, see [9, Theorem 2.8(i)]. Combining this with Lemma A.4, we now know that the outward normal vector n(v) is well-defined for \({\mathcal {H}}^2\) every \(v \in \partial \Lambda \). As \(\pi \circ \Psi _{\text {pL},i} :[0,1]^2 \rightarrow {{\mathbb {R}}}^2\) is Lipschitz, this implies that \((\pi \circ \Psi _{\text {pL},i}) ({\mathcal {N}}_i)\) is a Lebesgue null set [9, Lemma 3.2(iii)]. Furthermore, we define the sets
By [9, Theorem 3.8], we know that \((\pi \circ \Psi _{\text {pL},i}) ({\mathcal {M}}_i)\) is a Lebesgue null set. Let
and
Let \(v \in \partial \Lambda {\setminus } {\mathcal {M}}\). Hence, there is an \(i \in I\) and a \(y \in \Omega _i\), such that \(v = \Psi _{\text {pL},i}(y)\). Thus, \(D\Psi _{\text {pL},i}(y)\) exists, has full rank and does not have \(e_3\) in its image. By Lemma A.4, we know that n(v) exists and that \(n(v) \cdot e_3 \ne 0\). Thus, the function \(p :{{\mathbb {R}}}\rightarrow {{\mathbb {R}}}\) given by \(p(t)={\text {d}}_\Lambda (v+te_3)\) with \({\text {d}}_\Lambda \) being the signed distance function to the boundary \(\partial \Lambda \) has non-vanishing differential at 0 and satisfies \(p(0)=0\). Hence, p changes sign at 0, which means that \(v^\parallel \in \partial \overline{\left( \Lambda _{v^\perp }\right) }\). Conversely, this means that for \(v \in \partial \Lambda \), the property \(v^\parallel \not \in \partial \overline{\left( \Lambda _{v^\perp }\right) }\) implies that \(v \in \mathcal M\). Thus, we have for the set \({\mathcal {N}}\) defined in (A.28),
Finally, we observe
which shows that \({\mathcal {N}}\) is a Lebesgue null set.\(\square \)
Lemma A.7
Let \(\Lambda \subset {{\mathbb {R}}}^3\) be a piecewise Lipschitz region, \(\pi :{{\mathbb {R}}}^3 \rightarrow {{\mathbb {R}}}^2\) be the canonical projection and \(f :\partial \Lambda \rightarrow {{\mathbb {R}}}^+\) be measurable. Let \(n :\partial \Lambda \rightarrow {{\mathbb {R}}}^3\) be the outward normal vector field, which is defined almost everywhere (see Lemma A.6). Then, we have
Proof
The proof is based on a quite general form of changing variables. We use the following area-formula (see [9, Theorem 3.9]) with one slight modification. To this end, let \(n,m \in {\mathbb {N}}\) with \(n \le m\), \(U \subset {{\mathbb {R}}}^n\) be open, \(\Phi :U \rightarrow {{\mathbb {R}}}^m\) be Lipschitz continuous and let \(g :U \rightarrow {{\mathbb {R}}}^+\) be measurable. Then, we have the identity
where \(|D\Phi (y) |^2= \det (D\Phi (y)^* D\Phi (y))\). In [9], this is stated for \(g \in {\textsf{L}}^1(U)\). However, their proof also applies to positive, measurable functions g as an identity in \([0,\infty ]\).
We cannot apply this directly to \(\pi \), as it decreases the dimension and \(\pi \) does not have an inverse. Thus, we have to introduce a new map.
Let \((\Psi _{\text {pL},i})_{i\in I}\) be a piecewise Lipschitz atlas of \(\partial \Lambda \) so that \(\partial \Lambda = \bigcup _{i\in I}\Psi _{\text {pL},i}([0,1]^2)\). We may assume that \({\text {supp}}(f) \subset \Psi _{\text {pL},i} ((0,1)^2) \) for some \(i \in I\). For the remainder of this proof, we write \(\Psi \) for \(\Psi _{\text {pL},i}\).
Now, we can apply (A.37) with \(\Phi :=\pi \circ \Psi \) and \(g :=f\circ \Psi \). Thus, we see
We used that \(\Psi \) is bijective. So, we already have the right-hand side of the claim. We will apply again (A.37) with the functions \(\Phi :=\Psi \) and g given by
Thus, using that \(\Psi \) is bijective and that the measure \(\mathcal H^2\) on \(\partial \Lambda \) is the 2-dimensional Hausdorff measure, we have
To conclude the proof, we only need to show that the quotient of the functional determinants is given by \(|n(v) \cdot e_3 |\) for almost every \(v \in \partial \Lambda \). Let \(v \in \partial \Lambda \) be such that \(B :=D\Psi (\Psi ^{-1}(v)) \in {{\mathbb {R}}}^{3 \times 2}\) is well defined. We identify the two column vectors of the \(3 \times 2\) matrix B as \(w_1\) and \(w_2\). The image of B is the tangent space to \(\partial \Lambda \) at v. As \(\Psi \) is bi-Lipschitz continuous, the matrix B has full rank. The normal vector n(v) is now orthogonal to the linear independent vectors \(w_1,w_2\). Thus, \(n(v) |w_1 \times w_2 |= \pm w_1 \times w_2\).
As \(\pi \) is linear, we have \(D(\pi \circ \Psi )(\Psi ^{-1}(v))= \pi B\). For the determinant of this \(2 \times 2\) matrix, we get \(\det (\pi B)=( w_ 1\times w_2) \cdot e_3\). For the denominator, we observe
In conclusion, we have
In combination with (A.39) and (A.41), we have proved the statement. \(\square \)
Corollary A.8
Let \(\Lambda \subset {{\mathbb {R}}}^3\) be a piecewise Lipschitz region. Then, for Lebesgue almost every \(x^\perp \in {{\mathbb {R}}}^2\), the set \(\Lambda _{x^\perp }\) is a finite (possibly empty) union of intervals with disjoint closures.
Proof
As \({\mathcal {H}}^2( \partial \Lambda )\) is finite, see Lemma A.3, we have by Lemma A.7 (with \(f=1\))
This implies that the set \(\partial (\Lambda _{x^\perp })\subset {{\mathbb {R}}}\) is finite for almost every \(x^\perp \). Hence, \(\Lambda _{x^\perp }\) is almost everywhere a finite union of intervals. If, for some \(x_0^\perp \in {{\mathbb {R}}}^2\), two different connected components of \(\Lambda _{x_0^\perp }\) share a boundary point, t, then \(t \in \partial ( \Lambda _{x_0^\perp }) {\setminus } \partial \overline{\left( \Lambda _{x_0^\perp }\right) }\). Looking at Lemma A.6, we realize that this means \(x_0^\perp \in {\mathcal {N}}\). Thus, we have proved the claim. \(\square \)
Lemma A.9
Let \(\Lambda \subset {{\mathbb {R}}}^3\) be a piecewise \(\textsf{C}^{1,\alpha }\) region with \(\Gamma \) as in Definition 2.1. Then, there is a constant \(C< \infty \) such that
Proof
We start with
For the first step, we just bound the integrand by a step function from above. As \(\Lambda \) is bounded, the associated set is empty for \(k>k_{\text {max}}\) with some finite \(k_{\text {max}}\). We are left to estimate the volume of these sets. Specifically, we will show that there is an \(r_0>0\) and a \(C < \infty \) such that for any \(r< r_0\), we have
We recall that \(B_r(\Gamma )\subset {{\mathbb {R}}}^3\) is the r-neighborhood of \(\Gamma \). By assumption, there is a global Lipschitz atlas \((\Psi _{\text {gL},j})_{j \in J}\) of \(\partial \Lambda \), as in Definition 2.1. For each \(i \in I\), the set \(U_j :=\Psi _{\text {gL},j} ((0,1)^d)\subset \partial \Lambda \) is a (relatively) open subset of the compact metric space \(\partial \Lambda \) and we have \(\partial \Lambda \subset \bigcup _{j \in J} U_j \). Thus, by Lebesgue’s number lemma, there is a constant \(r_0>0\) such that any \(v\in \partial \Lambda \) there is a \(j \in J\) such that \(B_{2r_0}(v) \cap \partial \Lambda \subset U_j\).
Now, we need to understand the set \(\Gamma \). We recall its definition
Let \(C_0\) be a Lipschitz constant for all \(\Psi _{\text {pC},i}\)’s which exists, as I is finite. As \(\partial [0,1]^2\) is just the boundary of the unit square, there is a surjective (piecewise linear) function \(\vartheta :[0,1] \rightarrow \partial [0,1]^2\) with Lipschitz constant 4. Let \(N\in {\mathbb {N}}\) with \(N>4C_0/r_0\) and \(f_k :[0,1] \rightarrow [0,1]\) be the functions satisfying \(f_k(t)=\frac{k-1+t}{N}\). Now, for any \(1 \le k \le N\) and \(i \in I\), we define \(g_{ik} :[0,1] \rightarrow \Gamma \) by \(g_{ik} :=\Psi _{\text {pC},i} \circ \vartheta \circ f_k\) and observe
Furthermore, \(\Gamma = \bigcup _{i\in I} \bigcup _{k=1}^N g_{ik}([0,1])\). By (A.50), we know \(g_{ik}([0,1]) \subset B_{r_0} (g_{ik}(0))\) and thus
for \(r \le r_0\). Hence, there is an \(j=j(i,k) \in J\), such that \( B_r(g_{ik}([0,1]) \cap \partial \Lambda \subset U_{j(i,k)} \). For any \(r \le r_0\), we can estimate
As \(\Psi _{\text {gL},j(i,k)}\) is bi-Lipschitz, there is a constant C such that
and
We now apply Lemma A.3 with \(f=\Psi _{\text {gL},j(i,k)}^{-1} \circ g_{ik}\) and \(d=1\) to obtain
as \(r < r_0\).
In conclusion, as I is finite, we have
For \(r_0<r<2^{k_{\text {max}}}\), we trivially arrive at the same estimate as long as \(C \ge {\mathcal {H}}^2( \partial \Lambda ) r_0^{-1}\), that is, \( {\mathcal {H}}^2(B_r(\Gamma )\cap \partial \Lambda )\le C r\) also for “large” r.
Now, we are able to finish (A.47) and obtain for some (finite) constant C
which was the claim. \(\square \)
Appendix B. Proof of (3.15)
We observe
where we switched back to the integration variables \(x_1, \dots , x_m\) and setFootnote 6\(x_0 :=x_m :=0\). As we can see, the last expression is the \((m-1)\)-fold convolution of \(\langle \, \cdot \, \rangle ^{-1}\) with itself evaluated at 0. This is a job for the Fourier transform. We use the convention
Thus, we have
The Fourier transform of \(\langle \, \cdot \, \rangle ^{-1}\) can be expressed in terms of the modified Bessel function of the second kind \(K_0\), see [27, (10.32.6)],Footnote 7
We observe that
We need the (known) estimate,
where \(\gamma _E\) is Euler’s constant (see, e.g., [27, (5.2.3)]). Using this inequality, the series representations [27, (10.31.2),(10.25.2)], the harmonic series \(H_n :=\sum _{k=1}^n k^{-1}\le n!\), the identity \(\Gamma (n+1)=n!\) (where \(\Gamma \) is the Gamma function, see, e.g., [27, (5.2.1)(5.4.1)]) and the geometric series, we get for any \(t \in (0,1)\)
Using the last two inequalities, we can infer
Thus, as \(K_0\) is decreasing on \({{\mathbb {R}}}^+\) (See [27, \(\S \)10.37]), we have \(2K_0(t)<1\) for \(t>1\). For \(t \in (0,1)\), we estimate using \(\ln (t/2)+\gamma _E<0\) and \(0<\gamma _E<1\),
The last inequality relies on a numerical computation to verify that \(-\gamma _E + \frac{1}{3} (1/(2\textrm{e})+ 1+ \ln (2) -\gamma _E) )\) is negative. Altogether, this yields
for \(0<2 \pi \xi <1\). Thus, we are able to estimate
The final estimate relies on \(m\ge 2\) and \(\pi > 3\), while the last identity is based on (B.5) and
Combining (B.1), (B.3) and (B.18), we arrive at
which was the claim.
Appendix C. Asymptotic Expansion with Order One Error Term
Our final result, Corollary C.3, in this section deals with the asymptotic expansion for a finite union of bounded intervals. That is, we assume that we have \(k\in {\mathbb {N}}\) open and bounded intervals \(I_1,\ldots , I_k\), whose closures are disjoint. More precisely, there exist \(d_i>0\) for \(1 \le i <k\) with \(\sup I_i +d_i = \inf I_{i+1}\). Let \(\ell _j:=|I_j|\) be the length of \(I_j\) and let \(\Omega :=\bigcup _{j=1}^k I_j\). The symbol \(\ell \) for the length of intervals in this section has, of course, nothing to do with the index of a Landau level.
The proof of Corollary C.3 is based upon two lemmata. The first lemma is per se not an asymptotic result but reduces the analysis to a single interval including an error term. The second lemma deals with the asymptotic expansion for a single interval, including an order one error term, and improves a seminal result by Landau and Widom in [19]. This is achieved by improving a certain estimate in their proof which allows for an order one error term instead of \(o(\ln (L))\). Later, Widom [36] extended their result and proved that the error term is indeed of order one. This was used by Sobolev in [32, Chapter 8] to obtain concrete error terms. Our error term is somewhat different and fits our purposes. It is important to notice that there is still an undetermined error term of order one which is however independent of the scaling and the lengths of the intervals and depends only on the energy.
The first lemma is the following.
Lemma C.1
Let \(\mu >0\) and \(m\in {\mathbb {N}}\). Then, under the above assumptions on \(\Omega \) we have
where C is a constant depending on m and \(\mu \), but crucially not on k or the intervals themselves.
Proof
Let \(Q :=\mathbb {1}[(-\textrm{i}\nabla ^\parallel )^2 \le \mu ]\). It is convenient to make a slight generalization by allowing \(I_k\) to be any measurable set such that \(d_{k-1} :=\inf (I_k)- \sup (I_{k-1} )>0\). We note that \(\ell _k\) is undefined, but it is also not present in our claim. We proceed by induction with respect to the number of intervals, k. Once we have proved the statement for \(k=2\), the statement follows for any k, as we can choose \(I_k' :=I_k \cup I_{k+1}\).
Hence, we just have to deal with the case \(k=2\). We observe \(1_\Omega = 1_{I_1}+ 1_{I_2}\). We multiply out the first term and, using \(I_1 \cap I_2 = \emptyset \), we have
The two summands \(j=(1,1, \dots ,1)\) and \(j=(2,2, \dots ,2)\) are the ones we subtract in the statement of the lemma. Hence, we have to estimate all other summands. In the case \(j_0 \ne j_m\), we use \(\textrm{tr}\,AB= \textrm{tr}\,BA\) with \(A=\mathbb {1}_{I_{j_0}}Q\) and \(B= \prod _{i=1}^m Q \mathbb {1}_{I_{j_i}}\) to conclude that the trace vanishes. We are left to estimate the terms where \(j_0=j_m\) and there is an \(i \in \{1,2, \dots ,m-1\}\) with \(j_i \ne j_0\). In this case, we consider \(i_-\) and \(i_+\) as the smallest and largest such i (which can be the same). Now, we write
where \(A_{i_-,i_+}\) is the identity, if \(i_-=i_+\) and a product of some operators Q, \(\mathbb {1}_{I_1}\), and \(\mathbb {1}_{I_2}\) otherwise. As all of the operators are projections, their operator norm can be bounded by 1. As we are interested in the trace, we will bound the trace norm. To do so, it suffices to bound two operators in the Hilbert–Schmidt norm and all others in the operator norm. The operators we will bound in Hilbert–Schmidt norm are \(\mathbb {1}_{I_{j_0}} Q \mathbb {1}_{I_{j_{i_-}}}\) and \( \mathbb {1}_{I_{j_{i_+}}}Q \mathbb {1}_{I_{j_0}}\). These operators are adjoint and hence have the same Hilbert–Schmidt norm. As \(j_{i_-} \ne j_0 \ne j_{i_+}\), we know \(\{j_0,j_{i_-}\}=\{j_0,j_{i_+}\}=\{1,2\}\). Thus, we are left to estimate \(\Vert \mathbb {1}_{I_1} Q \mathbb {1}_{I_2} \Vert _2^2\). Since the operator Q has integral kernel \(Q(x,y)= k_\mu (x-y) = \frac{\sin (\sqrt{\mu }(x-y))}{\pi (x-y)}, x,y\in {{\mathbb {R}}}\), the square of the Hilbert–Schmidt norm can be easily calculated as the square of the integral of this kernel for \(x \in I_1\) and \(y \in I_2\). By translation invariance we may assume that \(I_1=(0,\ell _1)\). By the definition of \(d_1\), we know \(I_2 \subset (\ell _1+d_1, \infty )\). Hence, using the estimate \(|k_\mu (z)|\le C/(1+|z|)\) for some constant C we get
The number of such error terms is \(2^m-1\). Thus, the error bound in m is quite bad, but we only need it to be good in k. The proof is now finished. \(\square \)
Here is our second lemma on the mentioned improved asymptotic expansion for a single interval of Landau and Widom. This agrees with the improvement of Widom in [36]. As the paper of Landau and Widom [19] is freely accessible, but the later paper by Widom [36] is not,Footnote 8 we provide this different proof for the reader’s convenience. We do not claim any originality.
Lemma C.2
Let \(\Omega \subset {\mathbb {R}}\) be an open and bounded interval of length \(\ell >0\) and let \(\mu >0\). Then, for any \(m\in {\mathbb {N}}\) and \(L >0\), we have with \(\textsf{I}(m)\) explained after (2.15),
where the order one error term is independent of L and \(\ell \) but depends on \(\mu \).
Proof
The case \(m=1\) is trivial, as the integral kernel is constant on the diagonal and only the volume term \(\frac{\sqrt{\mu }}{\pi }L\ell \) appears. Thus, by linearity, it suffices to show the statement for a basis of the polynomials vanishing at 0 and 1.
As \(\mu \) is fixed, the result depends only on \(L\ell \), which can be small or large. If \(L\ell \le 1\) then the trace on the left-hand side of (C.7) is bounded uniformly for these \(L,\ell \) by continuity as a function of \(L\ell \in [0,1]\). The same is true for the first two terms on the right-hand side of (C.7) and hence the equality holds true with an O(1) error term. In the following, we will assume that \(L\ell >1\).
From now on we use the same notation as in [19], where c takes the role of L. The last equation in the proof of their Theorem 1, where they still carry the order one error term is [19, (18)]. Afterward they allow for a larger \(o(\ln (c))\) error term and here we take a different route.
They consider the polynomials \((t(1-t))^n\) and \(t(t(1-t))^n\) for \(n \in {\mathbb {N}}\), which span all polynomials that vanish at 0 and 1. We proceed with these polynomials instead of \(t^m\) as in the statement of our lemma. Their equation [19, (18)] states
where \(A_c= P(0,c) Q(0,1)P(0,c)\), which is unitarily equivalent to the operator \(\mathbb {1}_{[0,c]} \mathbb {1}\big (-\Delta \le \frac{1}{4} \big ) \mathbb {1}_{[0,c]}\) in our notation, and \(K_c=P(1,c) Q(-\infty ,0) P(-\infty , 0) Q( -\infty ,0) P(1,c)\), as stated below [19, (17)]. They state below [19, (18)] that the integral kernel of the operator \(K_c\) on \({\textsf{L}}^2([1,c])\) is given for \(1 \le x,y \le c\) by
Let K be the operator on \({\textsf{L}}^2({\mathbb {R}}^+)\) with integral kernel f(x, y) for \(0< x,y < \infty \). Thus, \(K_c=P(1,c) K P(1,c)\) and \(K= Q(-\infty ,0) P(-\infty , 0) Q( -\infty ,0)\). Hence, we can conclude
with the convention \(x_{n+1}=x_1\). We denote the integrand \(f_n(x) :=\prod _{i=1}^n f(x_i, x_{i+1})\). It satisfies for \(\lambda >0\) the homogeneity property \(f_n(\lambda x)= \lambda ^{-n} f_n(x)\), which indicates that we should use spherical coordinates to calculate the integral. The problem is, however, that the integration domain does not look particularly nice in spherical coordinates. Thus, we would like to change the integration domain without changing the integral too much.
The first thing to observe is that as \(\ln \) is increasing, \(f_n(x) \ge 0\) holds for any \(n \in {\mathbb {N}}, x \in ({{\mathbb {R}}}^+)^n\). For any (Borel) measurable \(X \subset ({{\mathbb {R}}}^+)^n\), we define
As the integrand is non-negative, \(\iota \) is a measure. We also observe that \(\iota \) is invariant under the cyclic shift \((x_1, x_2, \dots , x_n) \mapsto (x_2, x_3, \dots , x_n,x_1)\). Assuming \(n >1\), for \(i=1, \dots , n\), we consider the set
We observe \(\iota (U_i)=\iota (U_1)\) by the cyclic shift property. We see
Since \(P(0,1)K P(1,\infty )= P(0,1) Q(-\infty ,0)P(-\infty , 0) Q(-\infty ,0) P(1,\infty )\) is the operator R [19, (9)] with appropriately chosen intervals J, M, K, N, L we see that this is trace class by [19, Lemma, (L2)]. By the homogeneity of \(f_n\), we even have \(\iota ( c U_1)= c^{n-n} \iota (U_1)=\iota (U_1)\).
Next, we introduce the set
which looks very nice in spherical coordinates. For \(n=1\), we just have \(V=[1,c]\). For \(n>1\), we observe the chain
The first inclusion is trivial. We call \(x_1, \dots , x_n\) the coordinates of x. If x has both a coordinate above \(\lambda \) and one below \(\lambda \), then it has to be in the set \(\bigcup _{i=1} ^n \lambda U_i\). Any \(x \in V\) has at least one coordinate above 1 and a coordinate below c. Thus, if \(x \not \in [1,c]^n\), it has to have a coordinate above and below 1 or a coordinate above and below c, which proves the second inclusion. These inclusions and the subadditivity and monotonicity of \(\iota \) imply that there is a constant \(C_n\) (depending on n but not on c) such that
This holds with O(1) replaced by 0 for \(n=1\). Finally, we introduce \(W :=\big \{ x \in ({{\mathbb {R}}}^+)^n : \Vert x \Vert = \sqrt{n}\big \}\) with Hausdorff measure \({\mathcal {H}}^{n-1}\) and observe \(V= [1,c] W = \big \{\lambda x : 1\le \lambda \le c, x\in W\big \}\). Now, we are just left to calculate
where \(\tilde{C}(n)\) is the result of the surface integral. We did a change to spherical coordinates in the second step. As the integrand is positive, \(\tilde{C}(n)\in (0,\infty ]\) is well-defined. By (C.15), we conclude (for fixed n and as \(c\rightarrow \infty \))
From [19, (19)], we know that \(\tilde{C}(n)= \frac{1}{4\pi ^2} \int _0^1 \textrm{d}t\, (t(1-t))^{n-1}\). In conjunction with (C.8), we get the improved error term O(1) with the same leading term for any polynomial, which vanishes at 0 and 1.
To get the claim of our lemma, we just have to replace c by \(2\sqrt{\mu }L\ell \) and then use \(\ln (2\sqrt{\mu }L\ell ) = \ln (2)+ \frac{1}{2} \ln (\mu ) +\ln (L\ell )=\ln (1+L\ell )+ O(1)\), which relies on \(L\ell \ge 1\). \(\square \)
Now we are in position to present and prove the main result in this section. The dependency of our error term on \(\Omega \) is not just O(1) as in [36] but explicit in terms of the number, lengths, and distances of the constituent intervals of \(\Omega \). Sobolev in [32, Chapter 8] has a similar error term, which, however, does not seem to suffice for our purposes.
Corollary C.3
We assume the same conditions on the set \(\Omega \) as in Lemma C.1, \(\mu >0\) and \(m\in {\mathbb {N}}\). Then, with \(\textsf{I}(m)\) explained after (2.15), we have for any \(L \ge 1\),
Proof
For the case of a single interval, we use Lemma C.2
As \(L \ge 1\), we have
Next, we observeFootnote 9 that for any \(a>0\) and \(b>0\), we have
Now, we only need to rewrite the error term from Lemma C.1 in the form we claim in this corollary. Thus, we estimate
Once we sum the error term estimate in (C.26) for \(i=1, \dots , k-1\) and the one in (C.24) for \(i=1, \dots , k\), we arrive at the claimed error estimate in (C.21). We also see that the sum of the main terms in (C.23) for \(i=1,\dots , k\) agrees with the main term in (C.21). This finishes the proof. \(\square \)
Appendix D. A Technical Lemma on Decaying Functions
This section contains a technical lemma that was useful to construct the sequence \((a_i)_{i\in {\mathbb {N}}}\) and the region \(\Lambda \) in the proof of Theorem 6.2.
Lemma D.1
Let \(f :{{\mathbb {R}}}^+ \rightarrow {{\mathbb {R}}}^+\) be bounded and satisfy \(\lim _{L \rightarrow \infty } f(L)=0\). Then, there is a convex, non-increasing function \(\textrm{Env}(f) :{{\mathbb {R}}}^+ \rightarrow {{\mathbb {R}}}^+\) satisfying \(\textrm{Env}(f)(0)=1, \lim _{L \rightarrow \infty } \textrm{Env}(f)(L)= \lim _{L \rightarrow \infty } f(L)/\textrm{Env}(f)(L) =0\) and \(\textrm{Env}(f)(L) \ge C/\sqrt{\ln (2+L)}\) for some \(C>0\).
Proof
The conditions on \(\textrm{Env}(f)\) only get worse if we increase f. Hence, we can replace f by
This is non-increasing. To achieve the \(f(L)/\textrm{Env}(f)(L)\) condition we consider \(\sqrt{{\hat{f}}}\). However, we still need to make sure that \(\textrm{Env}(f)\) is convex. For this reason, we need to consider the lower convex envelope . It is given by the supremum over all convex functions below \(\sqrt{{\hat{f}}}\). Another way to think of it is that the area above the graph of is the convex hull of the area above \(\sqrt{{\hat{f}}}\). Finally, we define
where N is a normalization constant to be chosen below. As the lower convex envelope and \(\frac{1}{\sqrt{ \ln (2+t)}}\) are convex, so is \(\textrm{Env}(f)\). As the lower convex envelope lies below the function, we have \(\textrm{Env}(f)(t) \le N\Big (\sqrt{{\hat{f}} (t/2)} + \frac{1}{\sqrt{ \ln (2+t)}}\Big ) \rightarrow 0\) as \(t \rightarrow \infty \). As \(\textrm{Env}(f)\) is convex and \(\lim _{L \rightarrow \infty }\textrm{Env}(f)(L)=0\), \(\textrm{Env}(f)\) is non-increasing. The condition \(\textrm{Env}(f)(L) \ge C/\sqrt{\ln (2+L)}\) is trivially satisfied and implies \(\textrm{Env}(f)(0)>0\) and hence allows us to choose N such that \(\textrm{Env}(f)(0)=1\). We are only left with the condition \(\lim _{L \rightarrow \infty } f(L)/\textrm{Env}(f)(L) =0\). To show this, it is sufficient to prove
By the definition of the convex envelope, for any \(t \ge 0\), there are \(0 \le t_1 \le t < t_2\) such that
If \(t_2 \le 2t\), as \({\hat{f}}\) is non-increasing, we get \(\sqrt{{\hat{f}}}\left( \frac{t_j}{2} \right) \ge \sqrt{{\hat{f}}}(t) \) for \(j=1,2\) and thus are finished. If \(t_2 > 2t\), then, as \(t_1 \ge 0\), we have \(\frac{t_2-t}{t_2-t_1} > \frac{1}{2} \). Thus, it suffices to bound the first summand from below, which we already did. \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pfeiffer, P., Spitzer, W. Entanglement Entropy of Ground States of the Three-Dimensional Ideal Fermi Gas in a Magnetic Field. Ann. Henri Poincaré (2023). https://doi.org/10.1007/s00023-023-01381-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00023-023-01381-3