Rényi Entropy and Free Energy

The Rényi entropy is a generalization of the usual concept of entropy which depends on a parameter q. In fact, Rényi entropy is closely related to free energy. Suppose we start with a system in thermal equilibrium and then suddenly divide the temperature by q. Then the maximum amount of work the system can perform as it moves to equilibrium at the new temperature divided by the change in temperature equals the system’s Rényi entropy in its original state. This result applies to both classical and quantum systems. Mathematically, we can express this result as follows: the Rényi entropy of a system in thermal equilibrium is without the ‘q−1-derivative’ of its free energy with respect to the temperature. This shows that Rényi entropy is a q-deformation of the usual concept of entropy.


Introduction
In 1960, Rényi [1] defined a generalization of Shannon entropy which depends on a parameter. If p is a probability distribution on a finite set, its Rényi entropy of order q is defined to be S q = 1 1 − q ln ∑ i p q i (1) where 0 < q < ∞. Of course, we need q = 1 to avoid dividing by zero, but L'Hôpital's rule shows that the Rényi entropy approaches the Shannon entropy as q approaches one: Thus, it is customary to define S 1 to be the Shannon entropy.
While Shannon entropy has a deep relation to thermodynamics, Rényi entropy has not been completely integrated into this subject, or at least not in a well-recognized way. While many researchers have tried to modify statistical mechanics by changing the usual formula for entropy, so far, the most convincing uses of Rényi entropy in physics seem to involve the limiting cases S 0 = lim q→0 S q and S ∞ = lim q→+∞ S q . These are known as the 'max-entropy' and 'min-entropy', respectively, since S q is a decreasing function of q. They show up in studies on the work value of information [2] and the thermodynamic meaning of negative entropy [3]. For other interpretations of Rényi entropy, see the works of Harremöes [4], König et al. [5], and Uffink [6].
In fact, it is not necessary to modify the statistical mechanics to find a natural role for Rényi entropy in physics. Rényi entropy is closely related to the familiar concept of free energy,with the parameter q appearing as a ratio of the temperatures.
The trick is to think of the probability distribution as a Gibbs state, or the state of thermal equilibrium for some Hamiltonian at some chosen temperature, say T 0 . Suppose that all the probabilities p i are nonzero. Then, when working in units where Boltzmann's constant equals one, we can write p i = e −E i /T 0 for some nonnegative real numbers E i . If we think of these numbers as the energies of microstates of some physical system, the Gibbs state of this system at temperature T is the probability distribution where Z is the partition function: Since Z(T 0 ) = 1, the Gibbs state reduces to our original probability distribution p at this temperature. Starting from these assumptions, the free energy is related to the Rényi entropy as follows: The proof is an easy calculation: This works for T = T 0 , but we can use L'Hôpital's rule to show that in the limit T → T 0 , both sides converge to the Shannon entropy S 1 of the original probability distribution p.
After the author noticed this result in the special case T 0 = 1 [7], Stacey commented that this case was already mentioned in Beck and Schlögl's 1995 text on the thermodynamics of chaotic systems [8]. However, most people using Rényi entropy were unaware of its connection to free energy, perhaps because they work on statistical inference rather than physics [9]. Thus, the author put a version of this note on the arXiv in 2011 [10]. It has subsequently been cited 77 times, which suggests that it was indeed useful. We have therefore decided to publish it.
Shortly after the first draft of this note was released, Polettini gave a nice physical intepretation of Rényi entropy [11]. Downes then made a further generalization [12]. We explain those ideas here. The above argument concerns a system with Gibbs state p i = exp(−E i /T 0 ) at a chosen temperature T 0 . Such a system automatically has zero free energy at this chosen temperature. Downes generalized the relation between Rényi entropy and free energy to systems whose free energy is not constrained this way. Polettini's physical interpretation of Rényi entropy can be extended to these more general systems, and we describe this interpretation in what follows. We also explain how the Rényi entropy is a 'q-deformation' of the ordinary notion of entropy. This complements the work of Abe on another generalization of entropy: the Tsallis entropy [13].
In what follows, we work in a quantum rather than classical context, using a density matrix instead of a probability distribution. However, we can diagonalize any density matrix, and then its diagonal entries define a probability distribution. Thus, all our results apply to classical as well as quantum systems. The quantum generalization of Shannon entropy is, of course, well-known: it is the von Neumann entropy. The quantum generalization of Rényi entropy is also already known [14].

Rényi Entropy as a q-Derivative of Free Energy
Let H be a self-adjoint complex matrix. Thinking of H as the Hamiltonian of a quantum system, and no longer assuming that Boltzmann's constant k equals 1, we may define the Gibbs state of this system at temperature T to be the density matrix where the partition function Z(T) = tr(e −E/kT ) (4) ensures that tr(ρ T ) = 1. The Helmholtz free energy at temperature T is defined by On the other hand, for any density matrix ρ, the quantum generalization of Rényi entropy is defined by since this formula reduces to the usual definition of Rényi entropy, Equation (1), when the probabilities p i are the eigenvalues of ρ and we set k = 1. This formula makes sense when 0 < q < ∞ and q = 1, but we can define the quantum Rényi entropy as a limit in the special cases q = 0, 1, +∞. For q = 1, this gives the usual von Neumann entropy: Returning to our system with Gibbs state ρ T at temperature T, let us write S q (T) for S q (ρ T ). By computing this Rényi entropy at some temperature T 0 , we find If we define a new temperature T with we obtain or, in short, the following: This equation is the clearest way to express the relation between Rényi entropy and free energy. In the special case where the free energy vanishes at temperature T 0 , it reduces to Equation (2). In the limit T → T 0 , it reduces to Of course, it is already well-known that the von Neumann entropy is the derivative of −F with respect to the temperature. What we see now is that the Rényi entropy is the difference quotient approximating this derivative. Instead of the slope of the tangent line, it is the slope of the secant line.
In fact, we can say a bit more: the Rényi entropy is the the 'q −1 -derivative' of the negative free energy. For q = 1, the q-derivative of a function f is defined by This reduces to the ordinary derivative in the limit q → 1. The q −1 -derivative is defined the same way but with q −1 replacing q. Equation (9) can be rewritten more tersely using this concept: Here we have made a change of variables, writing T for the variable called T 0 in Equation (9). The concept of the q-derivative shows up in mathematics whenever we 'q-deform' familiar structures, obtaining new ones such as quantum groups. For an introduction, see the text by Cheung and Kac [15]. In some cases, q-deformation should be thought of as quantization, with q playing the role of exp(h). That is definitely not the case here: the parameter q in our formulas is unrelated to Planck's constanth. Indeed, Equation (11) holds in classical as well as quantum mechanics.
What, then, is the thermodynamic meaning of Rényi entropy? This was pointed out by Polettini [11]. Start with a physical system in thermal equilibrium at some temperature. Then 'quench' it, suddenly dividing the temperature by q. The maximum amount of work the system can perform as it moves to thermal equilibrium at the new temperature divided by the change in temperature equals the system's Rényi entropy of the order q in its original state. Note that this formulation even accounts for the minus sign in Equation (9), because it speaks of the work the system performs rather than the work performed with it.
Funding: This research received no external funding.