References

Texbooks


Mon Aug 30
A classical reference on the concentration of well-behaved functions of independent random variables is References for improved Chernoff bounds for Bernoulli random variables: Improvement of Hoeffding's inequality by Berend and Kantorivich: The original proof of Hoeffding inequality is here. Compare to the modern, slick proof in Lemma 2.2 of [BLM].

Wed Sep 1
For an example of the improvement afforded by Bernstein versus Hoeffding, see Theorem 7.1 of available here. By the way, this is an excellent book.

Wed Sep 8
The book Concentration Inequalities for Sums and Martingales by B. Bercu, B. Delyon and E. Rio (2015) contains a many sharp calculations and bounds.
References on the JL Lemma:

Mon Sep 13, 15 and 20
The book Metric Characterization of Random Variables and Random Processes (2000) V. V. Buldygin and Yu. V. Kozachenko, provides a great deal of information about sub-Gaussian and sub-Exponential variables (as well as random processes) and Orlicz norms.

To see the equivalent characterizations of sub-Gaussian and sub-exponential random variables, see Theorems 2.5.2 and 2.7.1 in Vershynin's book. In particular, Them 2.8.1. therein is yet another version of Bernstein inequality using Orlicz norm.

The Bernstein-Orlicz norm was introduced in the paper The Bernstein-Orlicz norm and deviation inequalities, by S. van de Geer and J. Lederer. See also the follow up paper by J. Wellner, The Bennett-Orlicz norm.

The properties of sub-Weibull random variables are described in the papers:

Mon Sep 29
The use of Donsker-Varadhan variational formula for the KL divergence in adaptive data analysis was introduced in the paper: Follow-up contributions mentioned in class: These arguments were further developed to yield tighter generalization bounds based on chaining mutual information here: To see the effect of data adaptivity on something as simple as the sample mean, check out