Proof of the Fractional Sum Definition of the Zeta Function

This post contains a proof of the contents of my video, Extending the Zeta Function with Fractional Sums. If you aren’t coming from that video, I highly recommend watching it first.

My new video builds on a previous video in which I arrive at a definition for fractional sums:

k=1zf(k)=limn(k=1n1(f(k)f(z+k))+k=1m(zk)Δk1f(n))(1)\sum_{k=1}^z f(k) = \lim_{n \to \infty} \left( \sum_{k=1}^{n-1} (f(k) - f(z + k)) + \sum_{k=1}^m \binom zk \Delta^{k-1} f(n) \right) \tag{1}

for any natural number mm. In my new video I use these fractional sums to derive a formula for the Riemann zeta function:

ζ(s)=1s1ddzk=1zk1sz=0Re(s)>1m,(2)\zeta(s) = \frac1{s-1} \frac d{dz}\left. \sum_{k=1}^z k^{1-s} \right|_{z=0} \qquad \text{Re}(s) > 1 - m, \tag{2}

for whatever mm we choose in the (1)(1). However, I did not prove that the fractional sum in this formula converges, that a necessary interchange of limits is valid, or that this new definition of ζ\zeta is holomorphic for all ss in its domain.

This post contains a proof of these facts.

Locally Uniform Convergence is Enough

In my video, we kicked things off by asking what the fractional sum of zzwz \mapsto z^w looks like for any fixed wCw \in \mathbb{C}. We used the principal branch of this function such that 1w=11^w = 1, with a branch cut on (,0](-\infty, 0]. It then follows from (1)(1) that the fractional sum

zk=1zkwz \mapsto \sum_{k=1}^z k^w

has a branch cut on (,1](-\infty, -1].

We then observed that this fractional sum appeared to be holomorphic in C(,1]\mathbb{C} \setminus (-\infty, 1] as long as the real part of ww is less than our choice of mm in (1)(1). For the rest of this post we will assume a fixed mm, and we will examine the fractional sum as a function of both zz and ww.

Definitions: The domain Ω\Omega is given by

Ω=(C(,1])×{wC:Re(w)<m}.\Omega = (\mathbb{C} \setminus (-\infty, 1]) \times \{w \in \mathbb{C} : \text{Re}(w) < m\}.

For each nNn \in \mathbb{N}, define Fn:ΩCF_n: \Omega \to \mathbb{C} by

Fn(z,w)=k=1n1(kw(z+k)w)+k=1m(zk)Δnk1(nw).F_n(z, w) = \sum_{k=1}^{n-1} (k^w - (z + k)^w) + \sum_{k=1}^m \binom zk \Delta_n^{k-1} (n^w).

Define F:ΩCF: \Omega \to \mathbb{C} as the limit

F(z,w)=limnFn(z,w)F(z, w) = \lim_{n \to \infty} F_n(z, w)

if the sequence FnF_n converges.

This function FF is the fractional sum of zwz^w.

To show that (2)(2) is the analytic continuation of the zeta function, we must prove that when ww is fixed, zF(z,w)z \mapsto F(z, w) is holomorphic so that we may take its derivative at z=0z=0. We also need that FnzFz\frac{\partial F_n}{\partial z} \to \frac{\partial F}{\partial z} as nn \to \infty because if we can swap the derivative and the limit, we can get

Fz(z,1s)z=0=limn(k=1n11ks+k=1m(1)kΔnk1(n1s)k(1s)).\left.\frac{\partial F}{\partial z}(z, 1-s)\right|_{z=0} = \lim_{n \to \infty} \left(\sum_{k=1}^{n-1} \frac1{k^s} + \sum_{k=1}^m \frac{(-1)^k \Delta^{k-1}_n (n^{1-s})}{k(1 - s)} \right).

The second summation vanishes in the limit when Re(s)>1\text{Re}(s) > 1, so this reduces to usual the infinite series definition of ζ(s)\zeta(s). Therefore, if wFz(0,w)w \mapsto \frac{\partial F}{\partial z}(0,w) is holomorphic, we have the analytic continuation of ζ\zeta.

Fortunately, “holomorphic functions are really nice”. We can simply use a standard complex analysis result which I’ll refer to the uniform convergence theorem:

The Uniform Convergence Theorem: If a sequence of holomorphic functions fnf_n converges to a function ff locally uniformly (or equivalently, uniformly on compact sets), then ff is holomorphic, and fnf_n' converges locally uniformly to ff'.

So all we need to do is show that FnF_n converges locally uniformly, and we’ll have all the conditions we need.

The only catch (for me anyway) is that this uniform convergence theorem is only for functions of a single variable. It is not immediately clear that locally uniform convergence allows us to conclude that Fz\frac{\partial F}{\partial z} is holomorphic in ww for fixed zz. I am sure that this is a standard result in multi-variable complex analysis, but for me, it’s a “missing link” in the proof. Fortunately, for our specific sequence FnF_n, we can prove this quickly with Cauchy’s integral formula.

Proof of the missing link

Proof of Locally Uniform Convergence

We start by defining the following domains:

Definition: Given any RNR \in \mathbb{N}, let ε=1/R\varepsilon = 1/R and define

UR={zC:z<R}(,1],VR={wC:w<R,  Re(w)<mε}.\begin{align*} U_R &= \{z \in \mathbb{C} : |z| < R\} \setminus (-\infty, -1],\\ V_R &= \{w \in \mathbb{C} : |w| < R,\ \ \text{Re}(w) < m - \varepsilon\}. \end{align*}

For any (z,w)Ω(z, w) \in \Omega, we can choose RR to be large enough that (z,w)UR×VR(z, w) \in U_R \times V_R. Therefore it suffices to prove that fnf_n converges uniformly in UR×VRU_R \times V_R for arbitrarily large RR. For the rest of the proof, we will assume an arbitrary fixed R>mR > m, and thus a fixed ε=1/R\varepsilon = 1/R. We will use CC to denote a positive constant which may change from line to line, but depends only on mm and RR.

We will start by demonstrating that any degree mm polynomial is bounded on URU_R by its values at the integers 00 through mm.

Lemma: Let p(z)p(z) be a degree mm polynomial. Then there exists a constant CC such that for any zURz \in U_R,

p(z)CmaxkZ0kmp(k).|p(z)| \leq C \cdot \max_{\substack{k \in \mathbb{Z} \\ 0\leq k\leq m}} |p(k)|.

Proof: Consider the Lagrange polynomial representation of p(z)p(z) determined by its values at 0,1,,m0, 1, \dots, m:

p(z)=k=0mp(k)0jmjkzjkj.p(z) = \sum_{k=0}^m p(k) \prod_{\substack{0 \leq j \leq m \\ j \neq k}} \frac{z - j}{k - j}.

Since zjz+jR+m|z - j| \leq |z| + |j| \leq R + m and since kj1|k - j| \geq 1, we have

p(z)k=0mp(k)(R+m)m(m+1)(R+m)mmaxkZ0kmp(k).\begin{aligned} |p(z)| &\leq \sum_{k=0}^m |p(k)| (R + m)^m \\ &\leq (m+1)(R+m)^m \cdot \max_{\substack{k \in \mathbb{Z} \\ 0\leq k\leq m}} |p(k)|. \end{aligned}

Thus the claim is established with C=(m+1)(R+m)mC = (m+1)(R+m)^m. \square

The specific polynomials we will work with are the degree mm Gregory-Newton and Taylor polynomials of (z+n)w(z+n)^w centered at z=0z=0. Let us introduce shorthands for them.

Definition: Let nNn \in \mathbb{N}. We define

Gn(z,w)=k=0m(zk)Δnk(nw),Tn(z,w)=k=0mzkk! ⁣dkdζk(ζ+n)wζ=0.\begin{aligned} G_n(z,w) &= \sum_{k=0}^m \binom zk \Delta_n^k(n^w),\\ T_n(z,w) &= \sum_{k=0}^m \frac{z^k}{k!}\!\left.\frac{d^k}{d\zeta^k}(\zeta+n)^w\right|_{\zeta=0}. \end{aligned}

We can now begin our proof that FnF_n converges uniformly on UR×VRU_R \times V_R.

Theorem: FnF_n converges uniformly on UR×VRU_R \times V_R.

Proof: Observe that if N>RN > R, then

FN(z,w)=FR+1(z,w)+n=R+1NΔnFn(z,w),(3)F_N(z,w) = F_{R+1}(z,w) + \sum_{n=R+1}^N \Delta_n F_n(z,w), \tag{3}

so it suffices to establish uniform convergence of the series

n=R+1ΔnFn(z,w).\sum_{n=R+1}^\infty \Delta_n F_n(z,w).

Let (z,w)UR×VR(z, w) \in U_R \times V_R. By inserting the definition of FnF_n and grouping terms, we notice that

ΔnFn(z,w)=(nw(z+n)w)+Δnk=1m(zk)Δnk1(nw)=nw+k=1m(zk)Δnk(nw)(z+n)w=k=0m(zk)Δnk(nw)(z+n)w=Gn(z,w)(z+n)w.\begin{aligned} \Delta_n F_n(z,w) &= (n^w - (z+n)^w) + \Delta_n\sum_{k=1}^m \binom zk \Delta_n^{k-1} (n^w)\\ &= n^w + \sum_{k=1}^m \binom zk \Delta_n^k (n^w) - (z+n)^w\\ &= \sum_{k=0}^m \binom zk \Delta_n^k (n^w) - (z+n)^w\\ &= G_n(z,w) - (z+n)^w. \end{aligned}

Therefore

ΔnFn(z,w)=Gn(z,w)(z+n)wGn(z,w)Tn(z,w)+Tn(z,w)(z+n)w.\begin{align} |\Delta_n F_n(z,w)| &= |G_n(z,w) - (z+n)^w| \notag\\ &\leq |G_n(z,w) - T_n(z,w)| + |T_n(z,w) - (z+n)^w|. \tag{4} \end{align}

Since URU_R contains the line segment from 00 to zz, Taylor’s theorem with remainder gives

Tn(z,w)(z+n)wzm+1(m+1)!supζURdm+1dζm+1(ζ+n)w<CsupζURdm+1dζm+1(ζ+n)w.\begin{aligned} |T_n(z,w) - (z+n)^w| &\leq \frac{|z|^{m+1}}{(m+1)!} \cdot \sup_{\zeta \in U_R}\left|\frac{d^{m+1}}{d\zeta^{m+1}} (\zeta+n)^w\right|\\ &< C \cdot \sup_{\zeta \in U_R}\left|\frac{d^{m+1}}{d\zeta^{m+1}} (\zeta+n)^w\right|. \end{aligned}

By differentiating explicitly and using that wVRw \in V_R, hence w<R|w| < R, we obtain

dm+1dζm+1(ζ+n)w=ww1wm(ζ+n)wm1Cζ+nRe(w)m1.\begin{aligned} \left|\frac{d^{m+1}}{d\zeta^{m+1}} (\zeta+n)^w\right| &= |w||w-1|\cdots|w-m|\left| (\zeta+n)^{w-m-1} \right|\\ &\leq C \cdot |\zeta + n|^{\text{Re}(w) - m - 1}. \end{aligned}

Since Re(w)<mε\text{Re}(w) < m - \varepsilon, we see that Re(w)m1<(1+ε)\text{Re}(w) - m - 1 < -(1 + \varepsilon). Also, since ζ<Rn1|\zeta| < R \leq n - 1, we can see that ζ+nnζ>1|\zeta + n| \geq n - |\zeta| > 1. Therefore

ζ+nRe(w)m1(nζ)(1+ε)<(nR)(1+ε).\begin{aligned} |\zeta + n|^{\text{Re}(w) - m - 1} &\leq (n - |\zeta|)^{-(1 + \varepsilon)}\\ & < (n - R)^{-(1 + \varepsilon)}. \end{aligned}

This means that

Tn(z,w)(z+n)w<C(nR)(1+ε)(5)|T_n(z,w) - (z+n)^w| < C (n - R)^{-(1 + \varepsilon)} \tag{5}

for some constant CC. This is a uniform bound on the second part of (4)(4). Let’s turn our attention to the remaining part.

For fixed ww, the expression Gn(z,w)Tn(z,w)G_n(z,w) - T_n(z,w) is a polynomial of degree mm in zz, so our lemma guarantees that

Gn(z,w)Tn(z,w)CmaxkZ0kmGn(k,w)Tn(k,w).|G_n(z,w) - T_n(z,w)| \leq C \cdot \max_{\substack{k \in \mathbb{Z} \\ 0\leq k\leq m}} |G_n(k,w) - T_n(k,w)|.

Because the Gregory-Newton polynomial coincides with the function it interpolates at the integers 00 through mm, we have

Gn(k,w)Tn(k,w)=(k+n)wTn(k,w).|G_n(k,w) - T_n(k,w)| = |(k+n)^w - T_n(k,w)|.

Because we chose RR to be greater than mm, we know that kURk \in U_R, and therefore we may apply (5)(5) with kk in place of zz to see that

Gn(k,w)Tn(k,w)C(nR)(1+ε)|G_n(k,w) - T_n(k,w)| \leq C (n - R)^{-(1 + \varepsilon)}

for some constant CC.

We have shown that both parts of the right-hand side of (4)(4) are bounded by C(nR)(1+ε)C(n-R)^{-(1+\varepsilon)}. Therefore, since

n=R+1(nR)(1+ε)=n=11n1+ε<,\sum_{n=R+1}^\infty (n-R)^{-(1+\varepsilon)} = \sum_{n=1}^\infty \frac1{n^{1+\varepsilon}} < \infty,

we know by the Weierstrass M-test that

n=R+1ΔnFn(z,w)\sum_{n=R+1}^\infty \Delta_n F_n(z, w)

converges uniformly. Thus (3)(3) shows that FNF_N converges uniformly on UR×VRU_R \times V_R, so we are done.
\square