Let F be some (potentially nonlinear functional) F:L2([0,1])→R
taking functions ρ∈L2([0,1]) Suppose we have some ε∈R and δx∈L2([0,1]). Then we can create the (linear?!) functional D:L2([0,1])→R defined by Dρ(δx)=ε→0limεF(ρ+εδx)−F(ρ). Why is this linear?
Assume the necessary quantities exist to define the following (this appears to be equivalent to our functional F being Gateaux Differentiable at ρ0).
0=Dρ(δx1+δx2)−Dρ(δx1)−Dρ(δx2)=ε→0limεF(ρ+ε(δx1+δx2))−F(ρ+εδx1)−F(ρ+εδx2)
This gives us a feel for the regularity that we need F to have. One weird thing about the Gateaux derivative is that
it is so general that it can fail to even be linear. Thus, the better requirement is to assume Fréchet differentiability of F at ρ0.
In its most general formulation, this posits that F is a functional on a Locally Convex Metrizable Topological Vector Space, which is complete (e.g. w.r.t. the metric).
A few notes on this: the spaces Ck(M) works for smooth manifolds under the countable family of
seminorms given by ∥f∥k,n=sup{∣fk∣∣x∈[−n,n]} and holds even for k=∞.
For the purposes of most of the systems that I want to work with, we can assume that F
takes the form of ∫F(ρ(x))dμ(x) for some
F∈C1(R).
Let us check that this is Fréchet differentiable.
That is for h∈L2([0,1])0=ε→0lim∥εh∥∣∣F(ρ+εh)−F(ρ)−Fρ′(εh)∣∣
We can make a pretty good guess that
Fρ′(h)=∫F′(ρ(x))h(x)dx
(interesting note: the fact that F′ is only integrated against means that I think we need F to be merely weakly differentiable,
but this should be verified). We then find
ε→0lim∥εh∥∣∣F(ρ+εh)−F(ρ)−Fρ′(εh)∣∣=ε→0lim∥εh∥∣∣∫F(ρ(x)+εh(x))−F(ρ(x))−εF′(ρ(x))h(x)dx∣∣
A precise form of Taylor's theorem states that for any fixed x, we can find some hρ(x)(y) such that limy→ρ(x)hρ(x)(y)=0 and
F(ρ(x)+εh(x))=F(ρ(x))+εh(x)F′(ρ(x))+∫0εh(x)F′′(ρ(x)+t)tdt
Assume that ∣F′′∣ can be essentially bounded by some C, then we have
∣R(x)∣=∣∣∫0εh(x)F′′(ρ(x)+t)tdx′∣∣dx≤∫0εh(x)∣F′′(ρ(x)+t)t∣dx′≤∫0εh(x)∣Ct∣dx=2C(εh(x))2
This meabs
This demonstrates that, indeed,
Fρ′(h)=∫F′(ρ(x))h(x)dx
However!! In this process we've determined that, in fact,
Dρ(h)=Fρ′(h)=∫F′(ρ(x))h(x)dx.
Now! Here comes the fun part. This linear functional Dρ(h) can be represented by
integration against the measure F′(ρ(x))dμ. This functional can be used to calculate the directional
functional derivative in some "direction" h(x). However, it is extremely common (especially in numerical applications) to
wish to access a quantity δρδF which can be evaluated pointwise.
Here's the really clever thing, and the point of why I started writing this little article in the first place:
for any Fréchet-differentiable functional G, we can do this process to find some measure ν
against which an h can be integrated to calculate a directional functional derivative. If we want to find δρδG
then we merely need to calculate the Radon-Nikodym derivative dμdν (we omit for the moment the considerations
that determine whether ν≪μ).
Let us examine this concretely for our example F. The Radon-Nikodym derivative is some Lebesgue integrable function
dμdν which satisfies ν(A)=∫Adμdνdμ for all measurable sets A. Well, in the case
of our F we are calculating the R-N derivative with respect to the underlying measure μ corresponding to L2([0,1])
and so we get δρδF(ρ0;x)=dμdνρ0(x)=F′(ρ0(x)).