Monday, 14 January 2008

pr.probability - Is there an introduction to probability theory from a structuralist/categorical perspective?

One can argue that an object of the right category of spaces in measure theory is not a set equipped
with a sigmasigma-algebra of measurable sets, but rather a set SS equipped with a sigmasigma-algebra MM of measurable
sets and a sigmasigma-ideal NN of MM consisting of sets of measure 00.
The reason for this is that you can hardly state any theorem of measure theory or probability
theory without referring to sets of measure 00.
However, objects of this category contain less data than the usual measured spaces,
because they are not equipped with a measure.
Therefore I prefer to call them measurable spaces.
A morphism of measurable spaces (S,M,N)to(T,P,Q)(S,M,N)to(T,P,Q) is a map StoTStoT such that
the preimage of every element of PP is a union of an element of MM and a subset of an element of NN
and the preimage of every element of QQ is a subset of an element of NN.



Irving Segal proved that for a measurable space the following properties are equivalent:



  1. The Boolean algebra M/NM/N of equivalence classes of measurable sets is complete;

  2. The space of equivalence classes of all bounded (or unbounded) real-valued functions on SS is Dedekind-complete;

  3. Radon-Nikodym theorem is true for (S,M,N)(S,M,N);

  4. Riesz theorem is true for (S,M,N)(S,M,N);

  5. Equivalence classes of bounded functions on SS form a von Neumann algebra (aka WW-algebra).

  6. (S,M,N)(S,M,N) is a coproduct (disjoint union) of points and real lines.

A measurable space that satisfies these conditions is called localizable.



This theorem tells us that if we want to prove anything nontrivial about measurable
spaces, we better restrict ourselves to localizable measurable spaces.
We also have a nice illustration of the claim I made in the first paragraph:
None of these statements would be true without identifying objects that differ
on a set of measure 00.
For example, take a non-measurable set GG
and a family of one-element subsets of GG indexed by themselves.
This family of measurable sets does not have a supremum in the Boolean algebra
of measurable sets, thus disproving a naïve version of (1).



Another argument for restricting to localizable spaces is the following version of
Gelfand-Neumark theorem:




The category of localizable measurable spaces is equivalent to the category of commutative von Neumann algebras (aka WW-algebras) and their morphisms (normal unital homomorphisms of -algebras).




I actually prefer to define the category of localizable measurable spaces
as the opposite category of the category of commutative WW-algebras.
The reason for this is that the classical definition of measurable space
exhibits immediate connections only to descriptive set theory (and with additional
effort to Boolean algebras), which are mostly
irrelevant for the central core of mathematics,
whereas the description in terms of operator algebras immediately connects
measure theory to other areas of the central core (noncommutative geometry,
algebraic geometry, complex geometry, differential geometry etc.).
Also it is easier to use in practice. Let me illustrate this statement
with just one example: When we try to define measurable bundles of Hilbert spaces
on a localizable measurable space set-theoretically, we run into all sorts of problems
if the fibers can be non-separable, and I do not know how to fix this problem in the set-theoretic framework.
On the other hand, in the algebraic framework we can simply say that a bundle
of Hilbert spaces is a Hilbert module over the corresponding WW-algebra.



Categorical properties of WW-algebras (hence of localizable measurable spaces)
were investigated by Guichardet.
Electronic version of this paper is available here.
Let me mention some of his results.
The category of localizable measurable spaces admits equalizers and coequalizers,
arbitrary coproducts and hence arbitrary colimits.
It also admits products, although they are quite different from what one might think.
For example, the product of two real lines is not BbbR2BbbR2 with the two obvious projections.
The product contains BbbR2BbbR2, but it also has a lot of other stuff, for example, the diagonal of BbbR2BbbR2,
which is needed to satisfy the universal property for the two identity maps on BbbRBbbR.
The more intuitive product of measurable spaces (BbbRtimesBbbR=BbbR2BbbRtimesBbbR=BbbR2) corresponds to the spatial
tensor product of von Neumann algebras and forms a part of a symmetric monoidal
structure on the category of measurable spaces.
See Guichardet's paper for other categorical properties (monoidal structures
on measurable spaces, flatness, existence of filtered limits, etc.).



Finally let me mention pushforward and pullback properties of measures on measurable spaces.
I will talk about more general case of LpLp spaces instead of just measures (i.e., L1L1 spaces).
For the sake of convenience let Lp(M):=L1/p(M)Lp(M):=L1/p(M), where MM is a measurable space.
Here pp can be an arbitrary complex number with a nonnegative real part.
Note that you don't need a measure on MM to define Lp(M)Lp(M).
In particular, L0L0 is the space of all bounded functions (i.e., the WW-algebra itself),
L1L1 is the space of finite complex-valued measures (the dual of
L0L0 in the sigmasigma-weak topology), and L1/2L1/2 is the Hilbert space of half-densities.
I will also talk about extended positive part E+LpE+Lp of LpLp for real pp.
In particular, E+L1E+L1 is the space of all (not necessarily finite) positive measures.



Pushforward for LpLp spaces: Suppose we have a morphism of measurable spaces MtoNMtoN.
If p=1p=1, then we have a canonical map L1(M)toL1(N)L1(M)toL1(N), which just the dual of L0(N)toL0(M)L0(N)toL0(M)
in the sigmasigma-weak topology. Geometrically, this is the fiberwise integration map.
If pneq1pneq1, then we only have a pushforward map of the extended positive parts:
E+Lp(M)toE+Lp(N)E+Lp(M)toE+Lp(N), which is non-additive unless p=1p=1.
Geometrically, this is the fiberwise LpLp norm.
Thus L1L1 is a functor from the category of measurable spaces to the category of Banach spaces
and E+LpE+Lp is a functor to the category of "positive homogeneous pp-cones".
The pushforward map preserves the trace on L1L1 and hence
sends a probability measure to a probability measure.



To define pullback of LpLp spaces (in particular, L1L1 spaces) one needs to pass to a different
category of measurable spaces.
In algebraic language, if we have two WW-algebras AA and BB, then
a morphism from AA to BB is a usual morphism of WW-algebras f:AtoBf:AtoB
together with an operator valued weight T:E+(B)E+(A)T:E+(B)E+(A) associated to ff.
Here E+(A)E+(A) denotes the extended positive part of A (think of positive functions on mathrmSpecAmathrmSpecA
that can take infinite values).
Geometrically, this is a morphism mathrmSpecf:mathrmSpecBtomathrmSpecAmathrmSpecf:mathrmSpecBtomathrmSpecA between the corresponding
measurable spaces and a choice of measure on each fiber of mathrmSpecfmathrmSpecf.
Now we have a canonical additive map E+Lp(mathrmSpecA)toE+Lp(mathrmSpecB)E+Lp(mathrmSpecA)toE+Lp(mathrmSpecB),
which makes E+LpE+Lp into a contravariant functor from the category of measurable spaces
equipped with a fiberwise measure to the category of “positive homogeneous additive cones”.



If we want to have a pullback of LpLp spaces themselves and not just their extended positive parts,
we need to replace operator valued weights in the above definition
by finite complex-valued operator valued weights T:BtoAT:BtoA (think of fiberwise complex-valued measure).
Then LpLp becomes a functor from the category of measurable spaces to the category
of Banach spaces (if the real part of pp is at most 11) or quasi-Banach spaces (if the real part of pp is greater than 11).
Here pp is an arbitrary complex number with a nonnegative real part.
Notice that for p=0p=0 we get the original map f:AtoBf:AtoB and in this (and only this) case we don't need TT.



Finally, if we restrict ourselves to an even smaller subcategory of measurable
spaces equipped with a finite operator valued weight T such that T(1)=1T(1)=1
(i.e., T is a conditional expectation; think of fiberwise probability measure), then the pullback map preserves
the trace on L1L1 and in this case the pullback of a probability measure is a probability measure.



There is also a smooth analog of the theory described above: The category of measurable spaces
and their morphisms is replaced by the category of smooth manifolds and submersions,
LpLp spaces are replaced by bundles of pp-densities, operator valued weights are
replaced by sections of the bundle of relative 11-densities,
integration map on 11-densities is defined via Poincaré duality (to avoid circular dependence
on measure theory) etc.
The forgetful functor that sends a smooth manifold to its underlying measurable space
commutes with everything and preserves everything.

No comments:

Post a Comment