Locally maximizing the Rényi entropies

As I was rewriting my website, I found some visualizations I had stored on my old website to show a collaborator, and I figured it was worth writing a little to have a more proper place to put them; hence this post 😊.

Probability distributions on three letters consist just of three non-negative numbers which add up to 1, which we can see as a vector in $\mathbb{R}^3$ . The set of all such distributions form a simplex, which looks like a 2D triangle laying in $\mathbb{R}^3$ :

The simplex of probability distributions on three letters from two perspectives.

We can parametrize such distributions just by their $x$ - and $y$ -coordinates, since their $z$ -coordinate is given by $z=1-x-y$ . This allows us to plot functions $f$ that vary over the set of probability distributions on three letters in $\mathbb{R}^3$ : for each valid choice of $(x,y)$ coordinates, we plot the number $f(x,y,1-x-y)$ at the point $(x,y)$ . One particular function of probability distributions that I’m interested in is the $\alpha$ -Rényi entropy. When considering probability distributions on three letters, it’s given by $H_\alpha( \vec p) = \frac{1}{1-\alpha}\log(x^\alpha + y^\alpha + z^\alpha)$ where $\vec p = (x,y,z)$ , and $\alpha \in (0,1)\cup(1,\infty)$ is a parameter. From this function, we can define another function of probability measures, $\Delta_\varepsilon(\vec p) = \max_{ \vec q \in B_\varepsilon( \vec p) } H_\alpha (\vec q) - H_\alpha(\vec p),$ where $B_\varepsilon(\vec p)$ is called the $\varepsilon$ -ball around $\vec p$ , and consists of all probability measures which are $\varepsilon$ -close to $\vec p$ in total variation distance. For example if $\vec r = (0.21, 0.24, 0.55)$ , then $B_\varepsilon(\vec r)$ is given by the filled purple hexagon in Figure 2:

$B_\varepsilon(\vec r) is the purple hexagon.$

$B_\varepsilon(\vec r)$ is the purple hexagon.

It turns out that this maximum is achieved at one unique pointSee arxiv/1706.02212

; for the case before, it’s shown here:

$The maximizer of H_\alpha over the ball B_\varepsilon(\vec r) is the unlabelled black point at the bottom of the hexagon.$

The maximizer of $H_\alpha$ over the ball $B_\varepsilon(\vec r)$ is the unlabelled black point at the bottom of the hexagon.

and we can write down a form for the maximizer. This allows us to plot the value of $\Delta_\varepsilon$ as it varies over the set of probability distributions, for a given $\varepsilon$ and $\alpha$ . The quantity $\Delta_\varepsilon$ is useful for proving continuity boundsSee arxiv/1707.04249

. I’ve included some of these plots of it below.

$\Delta_\varepsilon$ with $\varepsilon = 0.1$ , for the $\alpha$ -Rényi entropy with $\alpha = 0.5$ .

$\Delta_\varepsilon$ with $\varepsilon = 0.1$ , for the $\alpha$ -Rényi entropy with $\alpha = 1.5$ .

$\Delta_\varepsilon$ with $\varepsilon = 0.1$ , for the $\alpha$ -Rényi entropy with $\alpha = 2.0$ .

$\Delta_\varepsilon$ with $\varepsilon = 0.1$ , for the $\alpha$ -Rényi entropy with $\alpha = 3.0$ .

Locally maximizing the Rényi entropies

August 25, 2018*