1. Introduction
Tezos is an open-source, peer-to-peer blockchain platform founded in 2018. Since its inception, Tezos has attracted much attention [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10] for a variety of reasons, the main one of which is its consensus protocol. In this paper, we consider the protocol version called Emmy*, recently replaced by Tenderbake. Emmy* is based on Delegated Proof-of-Stake (PoS) [
2], in which the stake is represented by the number of so-called
rolls, where a roll corresponds to
units of the Tezos currency, named
Users owning/managing rolls are called
delegates, who can be selected by the platform either to
bake (propose) or to
endorse (validate) new blocks or to do both. Baking/endorsing is rewarding for a user but can only be accomplished by depositing, for a certain number of validated blocks, a sum of
for security. In the case of double baking/endorsing, the security deposit will be slashed by the platform.
Those users who own less than , or do not want to run a costly full node to bake/endorse blocks, may delegate their units to a user operating a full node, who could then temporarily increase her/his number of rolls. Upon successful baking/endorsing, the rewards will be shared between the delegating and delegated nodes, typically according to a proportion based on their number of rolls. For this reason, delegation operates in a way that is akin, though not identical, to mining pools in Bitcoin. Indeed, the main difference is the following. While in Bitcoin, when miners join their computational power into mining pools, the total hashing power is basically the sum of the individual hashing powers, in Tezos with Emmy*, the success probability with delegation will typically be larger than the sum of the individual success probabilities without delegation. That is, since the probability of being selected for baking/endorsing blocks in Tezos is based on the number of rolls, joining individual stakes through delegation will typically induce a super-additivity property of the success probability. In fact, taking as given the total number of rolls for a user, the selection (success) probability is a step function of the stake, namely, a function that increases only for multiples of .
This paper aims to investigate some main economic issues related to Tezos under the Emmy* version of the protocol. To our knowledge, this is the first such contribution. In particular, we are interested in discussing how the optimal number of rolls is determined by the nodes, how baking/endorsing rewards can be shared with delegation, and whether the possibility of delegation would induce the emergence of a market for such services. The analysis will proceed gradually, with the paper being structured as follows. In
Section 2, we discuss single-block baking priorities, while in
Section 3, we discuss endorsement priorities, both without delegation. In
Section 4 and
Section 5, we define a user’s expected revenue for a single block and a cycle of blocks, respectively. In
Section 6, we introduce a user’s preferences to investigate her/his optimal stake, that is, her/his optimal number of rolls. Delegation is analysed in
Section 7, where the
super-additivity property of the selection probability of being a baker/endorser is identified.
Section 8 presents some economic fundamentals of the market for delegation services, while
Section 9 concludes the paper.
2. Baking Priorities for a Single Block with No Delegation
In this section, we consider the fundamental elements of how priorities for baking blocks are established without delegation.
Suppose is the number of rolls in the community at some date, with one roll corresponding to Tz, and is the number of rolls owned by a generic user. We assume that rolls represent the user’s stake.
Rolls are randomly drawn according to a sampling-with-replacement scheme to establish the priority, , with which users owning the rolls bake (propose) the next block. Therefore, the same roll can be drawn more than once. Priority is the highest priority, is the second highest, and so on. The user owning the roll with priority has the right to bake the next block. If, for some reason, she/he declines, then the right to bake shifts to the roll with the second highest priority and so on. The probability that a generic user will be selected in a single draw is , so is the probability of not being selected in a single draw. Therefore, the user with the highest number of rolls, i.e., the largest stake, has the highest probability of being selected, which, of course, is the main incentive introduced by Tezos to induce users’ stakes.
The reward depends on the priority of the baker (roll) and on the number of endorsements () received by the block. Double baking, as well as double endorsement, is punished by the platform.
The reward for baking
is defined as follows:
where
and
. Thus, the reward for baking a block is a linear function of the number of endorsements
received by the block, whose slope varies according to the roll’s priority. From (1), it follows that the only difference is between the highest priority and any other priority. Since the number of selected endorsers is
, the maximum reward for baking a block is
Considering a
sampling-with-replacement scheme for the selection of baking priorities, the general expression for a user’s probability of baking the next block
is given by
where
is the probability that a user with higher priority will not bake, for whatever reason, the next block. Equation (2) assumes that users may not bake a block independently of each other. Admittedly, this may be a simplification, as well as assuming that
is the same for each user. However, as the first step in the analysis, we find this to be acceptable. It is worth noticing that as
tends to
,
tends to
. That is, when delegates tend to bake blocks, then basically, the only possibility for a user to bake a block is to have the highest priority,
. Likewise, as
tends to
,
tends to
. That is, if users with higher priorities never bake a block, the probability that a generic user will bake the next block is maximised, and
as
becomes very large.
From an operational perspective, the value of can be estimated from the data by observing the frequency with which users do not bake.
Some comments on the above expression are in order. First, observe that if, assuming users’ symmetry,
is the probability with which
any user declines to bake the next block, then (2) should become
which is corrected by a multiplyng factor
, since the user behind Equation (2) may not bake with some probability. However, without much loss of generality, we decided to simplify the expression by assuming that
is the probability with which each user,
except for the generic one, refuses to bake.
Additionally, notice that the only exogenous quantity in (2) is , and it may be interesting to investigate how should vary with to keep unaltered.
Clearly, for
,
is increasing in
, that is,
Moreover, for , and for . However, how would change with is less immediate, since it depends on the behaviour of as varies. Indeed, when changes, there could be two main scenarios. In the first scenario, the total number of rolls remains unaltered, which implies that, as changes, some other user’s number of rolls also has to change. Alternatively, in the second scenario, varies by the same amount of .
- (a)
is given and independent of
In this case,
independently of the value of
.
- (b)
is a function of and
When
is positively related to
, then the first derivative of
with respect to
is given by
which is again positive. Hence, unsurprisingly, also in this case
would maximise the probability of baking the next block, regardless of the value of
Therefore, if is the total differential of , we have . Posing , it follows that, consistent with intuition, it is ; that is, as increases, decreases for to remain constant.
3. Endorsement Priorities for a Single Block without Delegation
In addition to eligibility for baking blocks, a user in Tezos with Emmy* can be eligible for endorsing a block, again, as long as she/he has at least one roll. The reward
for a single-block endorsement is given by
where
, and
is the number of endorsing slots of the user.
As in the case of baking, endorsers are drawn according to a sampling-with-replacement scheme, so, in a single draw, the probability that a user will be selected is again . Therefore, since there are slots available to assign an endorsement, the probability that a user will obtain slots is binomial and given by with .
Interestingly, notice that with priority , it is ; that is, the reward for a single endorsement is larger than the reward received for baking a block. Since and should be interpreted as economic incentives for baking and endorsing blocks, respectively, the above inequality means that Tezos decided to provide a stronger incentive for endorsing than for baking.
Finally, it is worth observing that although and share similarities, being per endorsement rewards, they also exhibit some differences, since refers to endorsements received by a baker for proposing a block, while refers to endorsements assigned by a user to a baked block. Hence, while is an acknowledgement a user receives when acting as a baker, is an acknowledgement that a user assigns to a baker.
In the Emmy* version of Tezos, a user can not only be a baker, as well as an endorser, but also endorse a block that she/he her-/himself baked. Indeed, with endorsement slots, the potential conflict of interest is assumed to be diluted by the large number of endorsing slots, and the related probability is negligible. Indeed, as we shall see, bakers and endorsers may or may not overlap; likewise, a user may or may not appear as an endorser more than once for the same block.
Baking and Endorsing Joint Probability for a Single Block
The selection procedure for baking is independent of the selection procedure for endorsing. Hence, given the previous considerations, the joint probability of being selected with priority
for baking a block and of being assigned
slots for endorsements is
4. Expected Revenue for a Single Block
The analysis conducted so far allows us to define and compute the user’s expected revenue
related to the baking priority as well as to the number of endorsements. Before doing so notice that, from (2), it follows that the probability of the highest level of priority for baking in a single draw,
, is
, while the probability of any other priority,
, is
Therefore, a user’s expected revenue related to a single block, obtained from both baking and endorsing activities, is given by the following expression:
which is equal to
where
is the probability that the next block will receive
endorsements.
The closed form of (7) requires the explicit expression of , which, in turn, is likely to depend upon the baking priority. From an operational point of view, can be estimated from the data by replacing with the observed frequencies for each number of endorsements, . Moreover, notice that is a user’s expected reward when acting as an endorser under the simplifying assumption that the user will endorse the block with all of his/her slots. Indeed, more realistically, the model should contemplate the possibility that a block will not be endorsed because of unacceptable transactions, etc. However, as a first approximation, we deem our assumption to be acceptable, and, in any case, the value obtained can always be considered as an upper bound to the user’s revenues when all of his/her endorsement slots have been used.
Finally, we also observe that, from an analytical perspective, should be written more appropriately as , where, for a given we assume that increases with . Moreover, we should also assume that for a given , the probability decreases with . That is, the larger the block priority, that is the smaller the , the larger the probability of obtaining a high number of endorsements.
Defining the expected number of endorsements received by a baked block as
,
can finally be written as
Consistent with intuition, as gets close to , Equation (8) tends to take its largest possible value, namely, , while as becomes close to .
6. Optimal Stake and Number of Rolls in a Cycle
Based on the previous analysis, in this section, we start modelling how a user may optimally determine her/his stake and number of rolls for baking and endorsing. To proceed gradually, we shall assume that every node is a full node, deferring the discussion on delegation until later.
To investigate the issue, we first introduce what we believe to be a fundamental economic trade-off characterizing users of Proof-of-Stake (PoS)-based blockchains. On the one hand, the stake increases the probability of being selected as a baker/endorser and thus obtaining future rewards in units. On the other hand, staking prevents the immediate usage of one’s currency units and the possibility of implementing monetary transactions, which may be beneficial for the user. For this reason, staking may produce a disutility to the user.
To simplify the exposition, with no major loss of generality, unless otherwise indicated in what follows, we assume the stake to coincide with the number of rolls, which, moreover, will be treated as a continuous variable. Additionally, to simplify, we still omit a budget constraint for the users.
An initial, very simple step to embody such trade-off into a preference specification for generic user
, with
, where
is the number of platform users, is to introduce the user’s utility function
, which we define below in Equation (9), where
is the number of rolls chosen by agent
. Hence
, and we assume that user
, for all
at the beginning of each cycle, solves the following problem:
where
is user
’s discount factor related to one cycle of blocks, which quantifies her/his intertemporal preferences. Moreover,
is the per
single-cycle user’s cost due to security deposits, expressed as a function of
, and
is the cost of running a full node as a baker/endorser, which we assume to be constant and independent of
A few comments are in order.
The first term of (9) formalises the present value, before a cycle starts, of the utility-enhancing side in the above trade-off. The prospect of obtaining a reward in a cycle indeed represents an attractive incentive for a user to set up a stake and become a baker/endorser.
The second term, , is a simple representation of the compounded disutility due to keeping rolls as a stake, which is hence unused for five cycles, as specified in Emmy*. Note that such disutility increases with the time duration of the security deposit, and this is why the exponent of the discount factor is negative. In particular, we interpret as the single-cycle cost for the user induced by a stake of rolls.
Since
and
are the security deposits for each block baked and each endorsement assigned to a block, respectively, the simplest expression for
can be given by:
Therefore,
where
is a constant, and
, defined as in (2), is a function of
. Indeed, to emphasise this, we write
. It follows that (9) can now be rewritten as
Based on (12), we can proceed to user determination of the optimal in a game-theoretic framework. The reason that a setup based on strategic interaction can be suitable for modelling the optimal stake decision is due to the presence of Proof-of-Stake (PoS). Indeed, as we previously discussed, the probability of being drawn for baking/endorsing blocks typically depends on the ratio between one’s rolls and the total number of rolls . Hence, the optimal number of rolls chosen by a user may also depend on the number of rolls chosen by the other users.
The Staking Game
In this section, we consider the strategic interaction among agents and derive user
best reply correspondence
, where
is the profile of rolls chosen by
opponents. Following the above analysis and assuming, for simplicity,
, Equation (12) becomes
For any and , the following benchmark result holds:
Proposition 1. If all users have preferences as in (13), then there is a unique Nash Equilibrium of the game in strictly dominating strategies, given by Proof. It is easy to verify since
is a negatively sloped function with respect to
. Indeed, recalling that
and
, differentiating (13) with respect to
leads to
and because
, the result follows. □
Therefore, if all users’ preferences are represented by (12), with , the chain will not even start since none of them will stake any rolls. The main reason for this is due to the specification of preferences in (13), which are linear in as well as to the definition of the cost.
However, it is easy to verify that if the cost is defined by , with , then the Nash Equilibrium number rolls will, in general, differ from zero.
The above finding is interesting, as a reference, and suggests that not all utility functions can be linear in
for the chain to develop. A simple way to introduce alternative preferences is to consider the following modification of (12):
that is, with convex (quadratic) costs, where we still consider
,
and
.
Since user
best reply maximises (14), the first step towards finding it is to differentiate (14) with respect to
to obtain
Based on the above considerations, the first-order condition associated with Equation (15) becomes
Therefore, assuming the optimal stake solves (16), we obtain the following result:
Proposition 2. Suppose userhas preferences as in (14). Then, her/his best replysatisfies Proof. Immediate. Rearranging (16) leads to (17). □
Some comments are in order. First, notice that (17) is a proper probability expression since . Moreover, it is interesting to observe that the ratio in (17) increases in both and . That is, the more patient the user, the higher the expected number of endorsements for baked blocks, and the larger the proportion . Furthermore, it is easy to determine that the maximum value that Equation (17) can take when and is around , which means that if all users of the platform have preferences as in (14), then the money supply will be widely spread across them.
Furthermore, observe that the only element in (17) that differs across users is . Therefore, users with a larger discount factor (more patient) will have more rolls than users with a lower (less patient).
Finally, it is important to point out that (17) does not determine a unique value for
but rather the proportion
. This means that if all users have preferences as described by (14), the level of
will not be uniquely determined, and moreover, summing both sides of (17) with respect to
, the following equalities will be satisfied.
That is, in our simplified model, a user’s proportion of rolls is fully determined by the proportion of individuals’ discount rates.
7. Delegated Proof-of-Stake for a Cycle of Blocks
As previously said, one of the distinguishing features of Tezos is that, rather than running a full node to bake/endorse blocks, generic user may decide to delegate all or some of her/his units to another baking node, say , who would also bake/endorse on behalf. Delegation typically has the following main features:
- (i)
The costs of running a full node are paid by the delegated node only, if the delegating node is not a full node.
- (ii)
The probability of baking/endorsing exhibits a super-additivity property. That is, with delegation, the joint selection probability is at least as large as the sum of the selection probabilities without delegation.
- (iii)
The rewards obtained by the delegated node are shared with the delegating user proportionally to the number of their rolls.
Based on the above three points, it is natural to ask whether there is a benefit for a user to run a full node. Indeed, the delegating user pays no operating cost for running a full node and, moreover, may enjoy the advantage of obtaining some reward, even if it does not even have a single roll to stake individually. Intuition certainly suggests that there must be benefits in operating a full node because, alternatively, a scenario where nobody wants to run a full node may be envisaged, and the platform would not even operate.
To gain some insights on the above point, below, we discuss some simple cases. Consider the following example with two users and , whose money holdings and stakes are equal to , and , , respectively. Moreover, assume that runs a full node, while does not.
- (1)
Suppose
and
. Therefore, neither user
nor user
can bake a single roll since none of them individually has at least
. However, if
delegates
of at least
, then, jointly, the two nodes can reach at least
and potentially bake/endorse blocks, which separately they could not. Given this initial
symmetric situation, where neither of them can bake individually, the observed block/endorsing joint reward
, which is a random variable, is likely to be shared
exclusively according to the monetary sum at stake. Indeed, this may prevent one user from free riding on the other user, trying to convince him/her to stake as much money as possible. If
is the stake of user
and
is the stake of user
, then the share
of the reward, jointly obtained by the two users, will go to user
, while the rest goes to user
.
Hence, for this particular example, assuming
, the joint success probability is given by
where
stands for the integer number of
, while the sum of the individual success probabilities with no delegation would be
. Though very simple, the example immediately shows how delegation may induce
super-additivity on such probabilities. That is, the joint success probability
is larger than the sum of the individual success probabilities, which is
. Finally, notice that a similar argument could hold even if
were a full node; this is also true for the following considerations.
- (2)
Suppose now that
and
. In this case, the situation is slightly different, as compared to the previous point, since user
can stake one roll for baking/endorsing blocks, while user
alone still cannot. If
delegates at least
to
, then
can participate in the baking activity, which otherwise would be impossible. User
, even without the support of user
, in this case, can be selected with a success probability given by
However, if user
delegates at least
, then the success probability will increase to
Since the increase in the success probability will affect the reward, in this case, an agreement on how to share the joint revenues may be a more involved decision. A reasonable way to take into account that, with delegation, the expected revenue will increase could be the following. If , then the amount could go completely to user , since even with no delegation, she/he could have been selected to bake/endorse blocks.
Then, the difference between the observed joint reward with two rolls
and the expected revenue with one roll
can be divided according to the following criterion. The share
may be assigned to user
, while the rest is assigned to user
. That is, revenues will be distributed according to the additional stake contribution of the users with respect to what they could obtain individually. Clearly, the application of this criterion presupposes an agreement on the value of
. In our model, considering (8) and assuming, for simplicity,
, we obtain
which is easily computable by observing
and estimating
from the data.
Finally, if , then we can imagine more than one possibility. Either the entire observed revenue goes to user , or, alternatively for example, is shared according to the above formula used to distribute , or some other arrangement.
Therefore, if
, the expected revenue of user
with delegation
will be given by
that is, a convex combination, an average, of
and
, with weights given by
and
, respectively. In Paragraph (7.1), we discuss a similarity of (20) with some notable axiomatic bargaining solutions. Expression (20) is certainly larger than
, the expected revenue with no delegation. However, though an unlikely event, the possibility of
cannot be excluded if, with just one roll, user
was particularly lucky in being selected for several blocks and relatively unlucky with two rolls.
Finally, obviously, with delegation, user would also, in general, be better off than with no delegation.
- (c)
Suppose now that
and
; that is, both users can be selected separately for baking with two rolls and one roll, respectively. Moreover, in this case, if user
delegates user
with
, then two users, jointly, will obtain four rolls. Again, also in this case, delegation induces super-additivity in the success probabilities
and additional gains for both agents. However, as in the previous point, it is important to discuss how the revenues can be shared in this case. Following a reasoning analogous to the previous point, we can imagine that if
then they may agree on granting a reward of
to user
and a reward equal to
to user
. Additionally, user
will obtain the share
of the reward
, while the remaining share of it will go to user
.
However, (21) may not necessarily be satisfied, and the following can take place:
Hence, if (22) is the case, what could be a criterion for sharing User could claim that since the observed reward is not enough to cover the expected reward that she/he could obtain with no delegation, then she/he should receive the entire reward . However, user may claim that with no delegation, on average, she/he might have obtained and thus disagree with user claim.
In such circumstances, the observed
could perhaps be divided between the two users, assigning the share
to user
and the remaining share to user
. Alternatively, user
could obtain the share as in (22) or, perhaps, simply the share
Analogous reasoning could apply for
In the above example, we assume that user , the one running a full node, receives no specific benefit for running such a node. That is, rewards are shared with reference only to some relative stake criterion, with no concern for the fact that without user , user may not enjoy additional benefits.
One possibility to account for this may be to introduce a multiplying factor,
, as an additional weight to the stake of user
. For example, (22) could now become
and analogously, we would obtain, for example,
An additional possibility may be to introduce a weight, in this case,
, such as in (26) below:
A simple way to fix those two coefficients could be to define them, for instance, as
which would also be a possibility for the coefficients to account for the relative stakes.
Nash and Kalai–Smorodinsky Non-Cooperative Bargaining Solutions
Another approach to consider when discussing how users may share the rewards obtained with delegation is given by two main axiomatic bargaining models in economics. These are the Nash solution and the Kalai–Smorodinsky (KS) solution, where the
status quo is given by the users’ expected reward obtained without delegation. To see what such solutions suggest, we start by taking the first example of the previous paragraph. In this case, the status quo is zero for both users, since neither of them can bake without delegation. Hence, if
is the revenue obtained with delegation, which implies just one roll for the two of them, the Nash Bargaining solution can be obtained by solving the following problem:
and similarly for
, where
with
are the shares of the observed revenue
going to users
and
. It immediately follows that (27) is solved by
, which would also coincide with the KS solution. Indeed, differentiating
]
with respect to
leads to
, which, equalised to
, provides the result. This solution, however, by its very definition, does not take into consideration the different stakes between the two users.
Now, take the second example, assuming
. The status quo for user
in this case is
, while for user
, it is again zero. Following a similar procedure to that above, the Nash Bargaining solution can solve the problem below:
where
, from which we find that the solutions are given by
which, also in this case, coincide with the KS solutions. As mentioned above, it is easy to observe the similarity between (28) and (20), except that the former assigns a uniform weight to
and
, while the latter assigns a weight proportional to the users’ stakes.
To summarise, in the above discussion we considered just some possibilities to determine revenue shares and possible weights to account for the asymmetry in running a full node, but other criteria could certainly be considered.
8. Delegation Service Market
As a follow-up to the previous section, we now briefly discuss what a market for delegation services may look like. In what follows, we only provide a sketch of it. Indeed, such a competitive market may emerge quite naturally when non-full nodes are asking for delegation services, which a plurality of full nodes is willing to offer them to the other nodes. Although, in principle, they could receive delegated funds from any node, it is likely that most delegations would come from non-full nodes.
The very existence of a market and its configuration is crucially related to why and how delegating nodes may choose from among available full nodes. This is what we discuss below.
There may be several features making a full node attractive for delegation and consequently successful in the market. However, we believe that the following two features would play a major role in succeeding as delegation service providers.
The first is, broadly speaking, their trustworthiness, associated by any node to generic full node We assume it to be sufficiently well represented as a numerical indicator by some increasing function associated by generic node to full node , which depends on the total amount of delegated funds received by node . Of course, this may raise a few questions including, for example, how to define for the first cycle the node to which it is providing delegation services. One solution could be to assume that the initial reputation is an average of the existing nodes’ reputations or, alternatively, some other combination of them. A good reputation is certainly an attractive element, as much as a bad reputation may be a repulsive one, for those nodes that have decided to delegate their funds.
The second, as in the previous section, is the type of share agreements (fees) that full node
is paying to the delegating nodes in the case of positive rewards. More explicitly, suppose
is the funds of node
delegated to node
; therefore,
. In general, we can assume that
, where
is the share of node
revenue
, paid as a fee by node
to node
, as a function
of the delegated funds. How the sharing rules are chosen by full nodes requires some detailed analysis based on mechanism design, which will not be discussed here. Additionally, we also assume that
. In principle, the
function could take a variety of different forms, which would then represent an element of competition across alternative full nodes to attract delegated funds. A possible form of the share function
could be the following:
which is additive, with a fixed and a variable component
. The simplest example of (29) would be the constant (flat) function
for all
. A slightly more detailed version of (29) could be linear, that is,
and with
.
However, once
is announced by the node, there is no guarantee that
for all
. Therefore, in general, an operational version of (29) may be as follows:
where
is the maximum revenue share paid by the delegated node to the delegating ones. For example, suppose
. Finally,
and
therefore, if
then
Notice that for and , definition (30) becomes akin to the sharing rules discussed in the previous section.
At a high level, each delegating node must make two major decisions: which node(s) to delegate its funds to and the delegated amount of
Tz. Therefore, if
denotes node
benefit function provided by the level of trustworthiness, and the proposed sharing rule of node
, and
denotes the cost for node
of delegating
to node
, then node
would identify the optimal pair
by solving the problem:
For example, suppose
,
and
, with
, so that (31) becomes
Defining
and assuming, again, that
is the amount of money in
wallet, by differentiating (32) with respect to
, we obtain
and thus, the best
is given by
Equation (33) provides the best choice if node delegates node . Hence, to find the overall optimal , node will have to choose the full node that provides the highest utility.
Therefore, to summarise, depending upon the sharing rules proposed by full nodes, their reputation and the delegating nodes’ preferences, full nodes will be chosen, and the market for their services will emerge in the network. Finally, notice that since the optimal choice of depends on , that is, on the amount of funds delegated to node by all the nodes other than , the value of the selected may represent a Pure Strategy Nash Equilibrium of the model.