As I was rewriting my website, I found some visualizations I had stored on my old website to show a collaborator, and I figured it was worth writing a little to have a more proper place to put them; hence this post 😊.
Probability distributions on three letters consist just of three non-negative numbers which add up to 1, which we can see as a vector in . The set of all such distributions form a simplex, which looks like a 2D triangle laying in :
The simplex of probability distributions on three letters from two perspectives.
We can parametrize such distributions just by their - and -coordinates, since their -coordinate is given by . This allows us to plot functions that vary over the set of probability distributions on three letters in : for each valid choice of coordinates, we plot the number at the point . One particular function of probability distributions that I’m interested in is the -Rényi entropy. When considering probability distributions on three letters, it’s given by where , and is a parameter. From this function, we can define another function of probability measures, where is called the -ball around , and consists of all probability measures which are -close to in total variation distance. For example if , then is given by the filled purple hexagon in Figure 2:
is the purple hexagon.
It turns out that this maximum is achieved at one unique pointSee arxiv/1706.02212
; for the case before, it’s shown here:
The maximizer of over the ball is the unlabelled black point at the bottom of the hexagon.
and we can write down a form for the maximizer. This allows us to plot the value of as it varies over the set of probability distributions, for a given and . The quantity is useful for proving continuity boundsSee arxiv/1707.04249
. I’ve included some of these plots of it below.
with , for the -Rényi entropy with .
with , for the -Rényi entropy with .
with , for the -Rényi entropy with .
with , for the -Rényi entropy with .