Systematic Analysis of the Non-extensive Statistical Approach in High Energy Particle Collisions - Experiment vs. Theory

The analysis of high-energy particle collisions is an excellent testbed for the non-extensive statistical approach. In these reactions we are far from the thermodynamical limit. In small colliding systems, such as electron-positron or nuclear collisions, the number of particles is several orders of magnitude smaller than the Avogadro number; therefore, finite-size and fluctuation effects strongly influence the final-state one-particle energy distributions. Due to the simple characterization, the description of the identified hadron spectra with the Boltzmann-Gibbs thermodynamical approach is insufficient. These spectra can be described very well with Tsallis-Pareto distributions instead, derived from non-extensive thermodynamics. Using the $q$-entropy formula, we interpret the microscopic physics in terms of the Tsallis $q$ and $T$ parameters. In this paper we give a view on these parameters, analyzing identified hadron spectra from recent years in a wide center-of-mass energy range. We demonstrate that the fitted Tsallis-parameters show dependency on the center-of-mass energy and particle species (mass). Our findings are described well by a QCD (Quantum Chromodynamics) inspired parton evolution ansatz. Based on this comprehensive study, apart from the evolution, both mesonic and baryonic components found to be non-extensive ($q>1$), besides the mass ordered hierarchy observed in the parameter $T$. We also study and compare in details the theory-obtained parameters for the case of PYTHIA8 Monte Carlo Generator, perturbative QCD and quark coalescence models.


Introduction
In Nature we often meet phenomena with a large number of variables where the few-body approach breaks down. In these cases the standard procedure is to apply tools from statistical physics and inspect thermodynamical quantities of the system, instead of treating all degrees of freedom one-by-one. A certain generalization of the standard Boltzmann-Gibbs entropy is promoted by Constantino Tsallis, introducing the q-entropy formula, central to non-extensive statistical theory [1,2]. Despite its unconventional form, in the last two decades the Tsallis-entropy was found to be a very general and descriptive notion. Numerous physical observations were successfully explained using non-extensive statistical physics [3][4][5][6][7]. From our perspective the field of high-energy physics is especially important, since that community uses efficiently these tools to describe the results of high-energy particle and nuclear collisions. It is an experimental finding that the distributions derived from Tsallis-entropy fit the spectrum of high-energy particles, produced by many systems starting from electron-positron collisions up to the cosmic rays. In this paper we focus on identified hadron spectra, measured in proton-proton collisions. We put emphasis on the investigation of the center-of-mass energy ( √ s) dependence of the Tsallis parameters q and T, assuming a Quantum Chromodynamics (QCD) inspired logarithmic scaling of these parameters. We use units in whichh = c = k B = 1.
The outline of the paper is the following: in the next section we enlist our motivation from high energy physics and the goals of our analysis. In Section 3 we briefly introduce the mathematical apparatus we used during our investigation. In Section 4 we show experimental results, while in Section 5 we compare them to state-of-the-art theoretical models. Finally, in Section 6 we summarize our work and give a discussion, including our future plans.

Connection with High Energy Physics
One of the main goals in high-energy heavy-ion physics is to understand the properties of the so-called Quark Gluon Plasma (QGP), a particular form of the strongly interacting matter which existed shortly after the Big Bang. With today's high-energy particle accelerators we are able to reach the energy range where this superdense matter of the early Universe can be formed for a short, O(fm/c)∼ 10 −23 s time. The properties of the QGP can be studied in ultra-relativistic heavy-ion collisions indirectly. Due to the nature of the strong interaction there is no way for direct observation, only signatures stemming from the final state allow us to draw conclusions. On the other hand, the reactions occur during a very short time and our information about their nature is very limited. This is a strong restraint in our possibilities, especially we cannot treat properly the description at the microscopical level. Nevertheless it is essential to understand the processes in proton-proton collisions, the baseline for heavy-ion measurements.
To date we still do not have a well established, detailed, and throughout probed theory of the hadronization, the process where the color degrees of freedom confine into hadrons. This is related to the Yang-Mills Mass Gap, one of the so-called "Millennium problems" of the Clay Mathematical Institute [8]. Recent hadronization models are phenomenological, and it is quite typical that their parameters lack of any clear physical meaning.
Recently, complex detector systems, like ALICE at the Large Hadron Collider (CERN LHC) or STAR and PHENIX at the Relativistic Heavy Ion Collider (BNL RHIC) are able to measure with high accuracy the final state particles. The hadron spectra, measured in high-energy collisions, are one of the most fundamental characteristics of these events and involve both microscopic and collective effects in high-energy collisions. Identifying their most crucial problems is a key task for understanding hadronization. A remarkable phenomenon is that these properties occur not only in heavy-ion collisions, but even for small colliding systems like proton-proton or electron-positron collisions [9][10][11][12][13].
The aim of our study is to find the common source of these similarities and to recognize the driving mechanisms behind the observations. For this aim, we built a consistent non-extensive approach, in which fit parameters carry important physical information about the observed system of high-energy and strongly interacting particles including reactions and collective effects among them.

Non-Extensive Statistics in High-Energy Physics
High-energy physics, and in particular high-energy heavy ion physics is an interdisciplinary topic. It uses the theory of relativistic quantum fields, statistical physics, thermo-and hydrodynamics, and even the theory of curved space-times. Earlier studies show that non-extensive statistical physics provides a useful tool to describe particle-particle collisions, where "particle" now stands either for electron/positron, or for proton or a heavy nucleus. The non-extensivity in high-energy physics manifests itself both in the non-exponential energy-and non-Poissonian multiplicity distributions.
The hadron spectra can be characterized with Tsallis-Pareto-like distributions, both at low-, and high transverse momenta very well [13][14][15][16][17][18][19][20][21][22][23][24]. The origin of these distributions lies in the assumption that subsystems are not independent of each other. It makes them a good candidate to investigate the QGP in heavy ion collisions or the baseline in smaller colliding systems. Recently a number of systematic analyses have been made in order to find the best form for these distributions [9][10][11][12][25][26][27][28], but in this present study we compare theoretical and experimental results in a wide energy range comprehensively.

The Description of the Inclusive Hadron Production
The Quantum Chromodynamics is the fundamental theory of the strong interaction. Due to the energy-scale dependent behavior of the strong coupling, the perturbative QCD (pQCD) based parton model-initiated by Bjorken and Feynman-works extremely well at high energies [29]. In the framework of the pQCD-based parton model, all hadrons are made up from partons (bare, nearly massless quarks and gluons), therefore the inner structure of the initial colliding and the finally produced hadrons are described by the parton distribution functions (PDF) and by the fragmentation functions (FF), respectively. These non-perturbative distribution functions are defined in the momentum space and can be parametrized by a polynomial ansatz. The PDF, gives the distribution of parton a inside the hadron h at the energy scale, Q 2 , while x a = p a /p h is the momentum fraction carried by that parton. On the other hand, the confinement of the parton c into the final state hadron h with the momentum fraction z c = p h /p c can be described at scale,Q 2 with the help of fragmentation functions D h c (z c ,Q 2 ). (2) In this framework the inclusive cross-section of a given hadron h produced in proton-proton collisions can be calculated by the following convolution: Here the parton distribution function of a proton is denoted by f a/p (x a , Q 2 ) and dσ dt is the differential cross-section of the ab → cd partonic process, the variablet is related to the 4-momentum exchange of the particles.
The hadronization is described within the parton model by the above phenomenological fragmentation functions, for which several forms of parametrization exist in the literature. These parametrizations are usually fitted to lepton scattering data, therefore they describe existing experimental results in a broad-range in the parameter space. In Section 5, after investigating the energy dependence, we show the latest results of a new fragmentation function parametrization based on non-extensive phenomena.

Hadronization Using Non-Extensive Statistics
As we have already mentioned, the transverse energy distribution of the measured hadrons-the particle yield measured in the y ∈ [−0.5, +0.5] midrapidity region-is an important quantity accessible to measurement. In practice, the low-energy regime is described by exponential-like functions, as a thermalized system, while the high p T regime behaves like a power-law, p −n T . The Tsallis-Pareto-like distributions handle these two regimes simultaneously.
The technical apparatus in the high-energy physics shows a great advancement, nowadays the statistics of these spectra is larger than ever. It is no surprise that the Tsallis-Pareto distributions are widely used by the high-energy community to describe hadron spectra. The STAR and PHENIX collaborations at RHIC BNL (USA) and the European CERN's ALICE, ATLAS, and CMS collaborations at the LHC are using the following form to characterize the particle yield [30][31][32][33][34][35]: where n and C are fit parameters and m T = p 2 T + m 2 is the transverse mass, including the rest mass m of the given identified hadron species. We note that this formula is based on the QCD-Hagedorn formula [36][37][38][39][40]. This and other variations of the distribution are exhaustively tested e.g., in [9][10][11][12][13][14][15]25,26]. Below we theoretize over the origin of such Tsallis-type formulas. Contrary to the fixed fit parameters of the Tsallis-Pareto distributions as in Equation (4), we assume that the identified hadron spectra are characterized with a scaling Tsallis-distribution, where an energy scaling of the Tsallis-parameters is also present. In the following we refer to these as Tsallis-like distributions.
In extensive systems the entropy is finite in the thermodynamical limit, lim N→∞ S N N < ∞. This is the case with the Boltzmann-Gibbs-Shanon entropy formula, S = − ∑ i P i ln P i , where P i is the probability of being in state i. In strongly correlated systems, it turns out that the total entropy of the system is not the sum of the entropy of the subsystems: For our generalization, we use the well established terminology of the thermodynamics, since we expect to include the classical Boltzmann-Gibbs case too. Let us consider a monotonic, transformed entropy, L(S), which satisfies additivity , Note, on general terms L(S) is the logarithm of the formal group of phase space factors Ω(S) = e S . Due to this assumption, applied recursively in ensembles, we arrive at the following general class of entropies [41]: Because L(S) is by definition a monotonic function, the most likely state of a heat reservoir and its subsystem at maximum entropy is equivalently at where E 1 is the energy of the subsystem, E 2 = E − E 1 is the energy of the reservoir. While keeping E = const in the entropy maximum Equation (8) we obtain It makes the use of the usual definition of thermodynamical temperature expedient, Assuming now E 1 E in high-energy collisions, we consider: E → √ s and E 1 → (m T − m) ≈ p T of the particle, Inserting this into Equation (10), we arrive at the formula, By looking for an universal termostat, lending to β 1 an absolute temperature interpretation, we assume that the energy of the subsystem is independent from the energy of the reservoir, i.e., we require the term linear in E 1 to vanish. After ordering we obtain: This equality among general functions, L(S) and S(E) is possible only if both are equal with a constant, where C is the heat capacity of the reservoir. The solution of this differential equation has all desired features: Replacing it into the Equation (7), is the (now additive) Tsallis entropy, while turns out to be the Rényi entropy. This argumentation can be used also for microcanonical systems, with S 1 = − ln P 1 and P 1 being the distribution of the subsystem's states. Using the previously-defined generalized entropy, L(S), one arrives at the following energy distribution, which maximizes the q-entropy: It is a Tsallis-Pareto distribution with the individual energy, E i and Z is calculated form ∑ p i = 1.
In high-energy collisions we also have to deal with fluctuations event by event. Following the calculations in references [41,42], one may assume that the multiplicity of the created hadrons follow a negative-binomial distribution for bosons, a binomial one for fermions. Due to such general reservoir fluctuations, the q non-extensivity parameter receives a correction [41,42]: As the average number of created particles can vary in a wide range depending on the studied system-typically O(10 2 ) in proton-proton collisions, while O(10 3 − 10 5 ) in nucleus-nucleus collisions. We expect this fluctuation effect to overcome the finite heat capacity condition, therefore one observes q > 1. It is also straightforward to see that enlarging the system results in C → ∞, and if fluctuations become sufficiently suppressed, we get back the Boltzmann-Gibbs case with exponential distribution. On the other hand, the assumption q → 1 leads to a Gaussian distribution for the β values, which can be an approximation, but never the complete truth. This parameter is used to call the entropic index or the non-extensivity parameter, present as a measure of the deviation from the Boltzmann-Gibbs case, q = 1. We note, in high-energy nuclear collisions this value is in the range 1.0 < q < 1.5, which suggests fluctuations override the system size effects, related to the heat capacity of the reservoir.
The distributions Equations (4) and (18) behave similarly, they both can be regarded as Tsallis-Pareto-type distributions. The authors in [25,26] investigated how the different types fit the experimental data. Many further useful readings regarding the thermodynamically consistent non-extensive approach can be found in the literature. The first possibility is presented in [22][23][24], representing the case where the power is proportional to 1 q−1 . An another kind of approach where the power is q 1−q , as discussed in references [43][44][45][46]. For our analysis the chosen form is the following [47]: As it was shown in [42,48] the parameters q and T for an ideal case are connected to the mean multiplicity and its variance: They also may depend on each other. Since in case of q → 1 one has T → T BG , the parameter q is a measure of non-extensitivity (i.e., non-Gaussivity in β fluctuations, non-Poissonity in the multiplicity distribution P(N)). T is like the kinetic temperature.
Based on Equation (21), for fixed ∆N 2 / N 2 = σ 2 one obtains: On the other hand for an NBD (Negative Binomial Distribution) with fixed N /k = f one gets Our aim in the followings is to explore the center-of-mass energy evolution of the parameters q and T, especially keeping in our mind their physical meaning. Based on the definition of the PDF and FF of the pQCD-based parton model, we expect a logarithmic scaling. Since this was observed even in fits of electron-positron data [9], where PDFs do not appear, we connect the non-extensive features with the hadronization (fragmentation) processes only. The argumentation behind this will be explained in the next subsection.

Motivation for Qcd-Like Energy Scaling of the Parameters
Partons, the elementary momentum carriers in the strongly interacting matter, are tagged with a quantum number named color, which property is not observable directly. The quarks, antiquarks and gluons together confine into color singlet hadrons during the hadronization process. Hadron formation can happen at any energy scale, Q 2 . The dependency on it can be factorized into the running nature of the strong interaction's coupling constant, α(Q 2 ). Since any observable quantity should be independent of the arbitrary fixing of the energy scale appearing in perturbative QCD calculations, the mathematical description ought to be (energy-)scale independent at any fixed order. To satisfy this request is not an easy task, due to the non-perturbative nature of non-Abelian fields at low-energy.
As we presented the hadron production within the pQCD-based parton model in Section 3.1, the convolution in Equation (3) includes the scale parameter (Q 2 ) in its kernel. However, the cross-section-being an observable quantity-should be independent of this inner parameter. Technically, this is achieved with the following method: while calculating the partonic (color) cross sections at a given order, the scale can be factorized out and merged into the non-perturbative phenomenological functions. These are the parton distribution and fragmentation functions, and they must satisfy a proper scale evolution equation for avoiding scale-dependent hadronic yields.
To obtain the scale invariance, the following formula should be fulfilled at any fixed order for any generic form of such phenomenological functions [49][50][51]: where R(z, Q 2 ) can be either the PDF or FF, typically at the momentum fraction of the mother and daughter particles, z. In general, this evolution equation determines the possible form of quantity R, which naively depends on the current energy scale at a given fixed order. Solving this Callan-Symanzik equation one can obtain the energy scale dependence of the running coupling at a given order in any theory [52].
In the perturbative QCD based parton model, the parametrized parton distribution and fragmentation ansatz functions are typically given in a power-law form [53]. One can get then the proper scale evolution by solving the Doksitzer-Gribov-Lipatov-Altarelly-Parisi (DGLAP) equations [49][50][51]. Due to its polynomial power-law form, the predictive power of the calculations gets weaker at low-z. We expect that a Tsallis-like distribution with the appropriate parameter evolution can resolve this problem, providing a better description. This motivates us to fit q and T parameters as a function of the center-of-mass energy Q 2 ∼ √ s in a similar fashion, as it was done in reference [54]. Here our aim is to test the validity of this approach via investigating the energy-evolution dependence of the parameters.

The Improved Quark-Coalescence Model
Another description of the hadron formation is based on the constituent quark scaling. In the quark coalescence model the usual underlying assumption is that the hadronization takes place in a thermal system, where all the participating partons emerge at the same temperature [55,56]. This idea was developed for the description of hadron production in high-energy heavy-ion collisions, where the bulk of the hadron yield closely follows the exponential shape. In larger colliding systems, like in central collisions of large nuclei, this idea worked well, especially for the low transverse momentum regime, p T < 3-5 GeV/c, with a single temperature parameter.
In the original approach the energy distribution of the partons follow the Boltzmann-Gibbs statistics. Then, one approximates the formation rate as the multiplication of k such Boltzmann-Gibbs distributions: In the present non-extensive framework we still assume that the partons are part of a simple ensemble, but we replace the Boltzmann-Gibbs exponentials by Tsallis-Pareto distributions. Now the rate is the following: In order to test this model, we check the fitted q parameters of the identified hadrons. If the equal-energy quark-coalescence theory is correct for both the quark-antiquark containing mesons and triple (anti)quark (anti)baryons, then one observes: The meson-baryon ratio for the Tsallis parameters should be around 3/2: This idea formulated by Equation (28) can be tested as fitting the hadron spectra and investigating the ratio of the parameter (q i − 1) for different identified hadron species. As the constituent quark scaling is getting stronger at larger energies, we expect to reach this theoretical value only asymptotically.

Fitted Parameters
For our analysis we use identified hadron spectra datasets measured in proton-proton collisions in recent years [34,35,[57][58][59][60][61][62][63][64][65]. The numerical fits of the various datasets were made utilizing the CERN Root analysis software (https://root.cern.ch/, version 6.06/00). Although we conducted a comprehensive study, it is worth to note that it is not necessarily meaningful to compare the fit-parameter values for all existing data, because kinematical ranges may vary and the multiplicity-classes are also not defined evenly. This especially holds for kaons and protons at high p T ; this generates some uncertainty in those fits. To circumvent these difficulties, we fixed a recipe for the fit procedure making it consistently insensitive to the fit program(s), the size of the fit parameter space and input parameters.
In order to counterbalance the effect of varying ranges we perform the fit procedure in multipl steps: 1. fit of the high-p T part by fixing T and changing q; 2. fit of the low-p T part by fixing q and changing T; 3. fit of the whole p T range with both parameters, starting from the above obtained q and T.
We define "low-p T " as p T < 4 GeV/c and the "high-p T " part as p T ≥ 2 GeV/c, respectively. The overlap is intended. In the function defined in Equation (20) the parameter T sets the characteristic p T scale. In fact, for q −→ 1 one obtains f 1 (m T ) = Ae −(m T −m)/T . The parameter q on the other hand is linked to the 'power-law like' tail at high p T . The procedure was evaluated by comparing the The investigated spectra and the fitted Tsallis-Pareto functions are shown in Figure 1. Upper panels of the plots present the fits of experimental data measured in proton-proton collisions at √ s = 62.4 GeV, 200 GeV, 500 GeV, 900 GeV, 2.76 TeV, and 7 TeV center-of-mass energies. We considered various neutral, charged and charge-averaged hadron species, π ± , π 0 , K ± , p, andp. Identified hadron m T spectra are scaled by constant factors (2 n ) for better visibility, as indicated in the panels.
In the lower panels "Data/Fit" plots are presented for each case. One can observe how well the distribution (20) describes the yields in the whole 62.4 GeV ≤ √ s ≤ 7 TeV center-of-mass energy range in the m T 20 GeV region. Within the mid m T -regime the overlap with data is excellent, while at the highest m T values or for heavier hadrons the deviation is somewhat larger. =13000 GeV s ALICE, To determine the center-of-mass energy dependence, we review the √ s evolution of the fitted q i and T i parameters for each hadron species, i ∈ π ± , π 0 , K ± , p, andp . According to the formula (20), the parameters of the Tsallis-Pareto distribution are plotted in Figure 2 as a function of √ s. Based on the motivation presented in Section 3.3, we assumed an energy-evolution for each hadron type i as follows [66]: In these formulae, the mass m i of the identified hadron i is used to set the physically relevant energy scale. In Equation (30) the parameter T 1,i is fixed at T 1,i = 50 MeV, as suggested by reference [14]. Hence, the function in Equation (30) is parametrized by T 2,i .
In Figure 2a, the fitted q i values are plotted for each √ s, for given identified hadron spectra summarized in Table 1. One can see in the graphs, that the q i ( √ s) values are close to each other and all curves slightly increase with √ s, following nicely the formula (29). For pions and kaons the increase is very similar and their evolutions are alike. However, the precise increase seems to be larger for the kaons.
We note, that charged and neutral pion results should be consistent. Taking π 0 and π ± together for the fits, the mesonic components overlap more. This alternative behavior is thought to be the effect of different kinematical ranges. See more on fitted parameters and χ 2 /NDF in Table A1 and on kinematical ranges in Table A2 in the Appendixes A and B.    Table 1. Here the energy evolution of the parameter T( √ s) shows an increasing trend with the mass of the hadron species. The obtained parameter values supports the idea of a mass hierarchy effect: the higher the mass, m i , the larger T 2i .
One can realize from Figure 2a,b, by comparing them in the √ s < 3 TeV regime, that massive protons and kaons present the smaller change in q i , and their masses are closer to the lattice QCD crossover temperature T ≈ 170 MeV [67]. Light pions deviate more as increasing the energy, and the obtained T π is smaller, around ≈ 100 MeV. It is consistent with our picture that, lighter particle can suffer larger fluctuations, which increases the parameter q i following Equation (19).
Based on the √ s evolution of the experimental fit curves in Figure 2, we could predict the parameter values for the soon-to-be available LHC-energy collisions at √ s = 13 TeV and 14 TeV. These energies are indicated on both panels with vertical lines. According to the the assumptions given by the ansatz Formulae (29) and (30) and the fit parameters from Table 1 we summarized these values in Table 2 for √ s = 13 TeV. These data were used to plot the 13 TeV center-of-mass energy prediction on the bottom right panel of Figure 1. Note, √ s = 14 TeV data is expected to have very similar values within errors.
In Figure 3 we show the fitted q i and T i values for different hadron species at the center-of-mass energy values listed above. In agreement with reference [28], we observe that the non-extensivity parameter q i on Figure 3a is less sensitive to the hadron mass, however the importance of the center-of-mass energy of the colliding system is remarkable. As we have seen already on Figure 2, pions have somewhat larger non-extensitivity than more massive hadrons: Figure 3 also highlights that protons present weaker c.m. energy dependence than mesons. The parameter T i reflects an opposite mass-hierarchy ordering, on Figure 3b. The more massive hadron, the larger T i value: Nevertheless, we observe only a weak center-of-mass energy dependence. Table 2. Predictions for the Tsallis-Pareto parameters for √ s = 13 TeV (left) and 14 TeV (right), for hadrons i ∈ π ± , π 0 , K ± , p, andp, based on the Formulae (29) and (30). . Figure 3. The fitted q i (a) and T i (b) values for each hadron type, i ∈ π ± , π 0 , K ± , K 0 s , p, andp, c.f. [28] 4.1. The (T, q) Parameter Space for Identified Hadrons Summarizing our obserations we can conclude that the center-of-mass energy evolution of the fit parameters works well with our logarithmic evolution ansatz.

•
The q 2i and T 2i parameters are getting slightly larger with the larger hadron mass, m i , and applying Formulae (29) and (30) the evolution is described nicely in the whole tested energy range, The obtained q i ( √ s) function increases with √ s in the range 1.07-1.17 indicating the deviation from the Boltzmann-Gibbs case where q = 1. The deviation from this "thermodynamical limit" case grows as the center-of-mass energy gets higher values. However large statistical errorbars correspond to the lack of statistics in specific particle identification methods of the measurements. (See more in Appendix B.) • The T i ( √ s) kinetical temperature parameters almost keep constant values, with the following hadron (mass) hierarchy: T π = 120-140 MeV, T K = 120-200 MeV, and T p = 70-240 MeV.
We plot the parameters q i and T i on the Figure 4. The fitted parameters gather in the T i ∈ [70,240] MeV and q i ( √ s) ∈ [1.07, 1.17] parameter space, which is indicated by the shaded area. In Figure 4b, while keeping the shaded area, we included the fit-result of theoretical calculations as well. We used two model to get the identified hadron spectra series, namely PYTHIA8 [68,69] and kTpQCD_v20 [70]. We chose several c.m. energy values and the pseudorapidity region, |η| < 0.5 for our calculations, and finally we applied the same fit procedure as described in Section 5. We plotted the parameters q i and T i together with the experimental data-fitted shaded region.  Figure 4. (a) The parameter space T i − q i extracted from experimental p T spectra for hadron types i ∈ π ± , π 0 , K ± , p, andp; (b) The PYTHIA and kTpQCD_20 calculated spectra fit results on the same hadron types added to the measurement fit points.
The calculated points are denoted by empty symbols for each identified hadron at several center-of-mass energies. Theoretical data fits partially overlap with the shaded area defined by the experimental fits, although several points are deviating.

PYTHIA8
We generated 10M events using PYTHIA8 [68,69] Lund's high-energy Monte Carlo event generator to simulate the identified hadron spectra. Points for (charge-averaged) pion, kaon and protons spread in the q i ∈ [1.08, 1,23] range, wider than the experimental points. In contrast to that, the T i ∈ [80, 150] MeV corresponding to the experimental values on the left panel. Deviating points of the PYTHIA8 results are those which lack sufficient statistics at the highest transverse momenta. Here, the tail of the distribution is indefinite, thus q i values fall outside of the experimental q i ( √ s) ∈ [1.07, 1.17] parameter space. One recognizes that pions deviate less, since they have the highest statistics among all, followed by kaons and protons. We note that the deviance of q i disappear as we exclude the low-statistic data at the highest momenta at each energy value, while the consistency with T i remains.

kTpQCD_v20
Hadron spectra calculated within the framework of perturbative QCD were also used utilizing kTpQCD_v20 [70]. These calculations deliver similar results for all hadron species, because of the similar (polynomial) fragmentation parametrizations. Concerning the correspondence between perturbative QCD results and experimental data fits, both T i and q i are running out of the experimental regime in a similar way. Deviation from the measurement-based data is most remarkable at low c.m. energies, where the p T range of the spectra is too short due to the limited phase-space. The domain of pQCD is p T > 1.5 GeV/c and the maximal energy is typically p T < √ s/2. This limited range makes the fits more doubtful.
We also investigated the center-of-mass energy dependence of the fit parameters calculated theoretically from the the PYTHIA8 [68,69] and kTpQCD_v20 [70] models. In Figure 5, we compare the experimentally observed √ s-dependence from Figure 2 (solid lines) to these theoretical model results (data points). Figure 5a is for parameter q i .

Non-extensivity, q i :
According to the above observations, the perturbative QCD points are close to each other, due to the similar fragmentation function parametrization for the hardon species, mostly at the tail of the distributions at the highest energies and momenta-where experimental and theoretical data meet each other. With kTpQCD_v20 pions and kaons have similar slopes in log( √ s). PYTHIA8 results violate the inequality (31) and are larger: q PYTH I A,i > q EXP,i . However, the log( √ s) evolution has the same trend and similar slopes-except for protons.

Temperature-like, T i :
The kTpQCD_v20 points for each hadron species are close to each other and meet the temperature values only at the highest-energy regime. In this case the formula (32) represents a trend opposite to the perturbative QCD calculations. Theoretical fit parameters deviate here appreciably. We count this for the non-applicability of the pQCD at the low-momentum regime, p T < 1.5 GeV/c, where the spectra are more thermal-like. On the other hand, PYTHIA8 works well for both the soft and hard regimes for the light hadrons. The √ s evolution follows the experimentally observed trend, only a small offset is present for kaons and protons.
We conclude that these model calculations proved their validity in several ways [71,72]. Nevertheless, these pictures are not fully consistent with the fit parameters obtained from the data. In other words, comparison of theoretical models should be made with care within their region of validity.

Comparison with the Improved Quark-Coalescence Model
As we have explained in Section 3.4, the quark-coalescence model was developed for heavy-ion collisions originally, but it was improved by the previously introduced Tsallis distribution. In the followings we endeavor to extend this idea also for smaller systems, such as proton-proton collisions.

Connecting Non-Extensivity with the Quark-Coalescence Model
According to the coalescence picture, the observed of (q meson − 1)/(q baryon − 1) should be 3/2 [73]. In Figure 6 we plot the ratio χ ij = (q i − 1)/(q j − 1) as a function of the center-of-mass energy, √ s. Figure 6a presents the ratios of experimental data points compared to the fit curve ratios of π/p, K/p and K/p listed in Table 1. A monotonic, increasing trend of the ratios is clearly seen at the lower energies and saturation can be expected in the most energetic reactions. There are two important observations that is worth note: 1. the (q meson − 1)/(q baryon − 1) fit curves lie below the dashed line with the value of 3/2 within the √ s ∈ [62.4 GeV, 10 TeV] c.m. energy range; 2. the χ Kπ kaon-pion ratio shoots over the expected value, 1, a bit.
[GeV] s  Figure 6. (a) Ratio of (q meson − 1)/(q baryon − 1) plotted in the function of √ s as data points and fit lines. (b) The PYTHIA and kTpQCD_20 calculated spectra fit results on the same hadron types added to the measurement fit lines.
On Figure 6b we plotted the theoretically calculated points along with the experimentally fitted solid lines. It is not surprising, that theoretical curves present quite flat functions for all combination of the ratios, since both the PYTHIA8's Lund fragmentation and the fragmentation function parametrizations in the kTpQCD_v20 are based on the constituent quark model and the infinite momentum frame is assumed as well. Thus, there is no room for the evolution apart from a constant hadron-mass effect. This can result only in a shift of the q i values as we have seen on the left panel of Figure 5. Deviation from the constancy appears only at the lowest energies, but as we have seen earlier in Section 4.1, at low energies both PYTHIA8 and kTpQCD_v20 have limited phase space. The average values of the ratios χ ij = (q i − 1)/(q j − 1) are summarized in Table 3 for i, j ∈ {π, K, p}. Table 3. The average values of the hadron spectra parameter ratios, χ ij = (q i − 1)/(q j − 1), obtained from theoretical models PYTHIA8 and kTpQCD_v20. The kTpQCD_v20-and PYTHIA8-calculated (q i − 1)/(q j − 1) points have the same order: For χ Kπ both models give the same value, 10% larger than the improved coalescence expectation of 1. For χ meson,baryon kTpQCD_v20 has slightly higher values than PYTHIA8, but both are far below the expected value 3/2. Comparing the experimental fit curves and the theoretically calculated points, both theory meets the experimental values of χ Kπ . However, for χ meson,baryon the kTpQCD_v20 model agrees with the experimental values only at the highest √ s c.m. energies.
In summary the improved quark-coalescence model prediction might be reached only beyond the LHC energies, now they seem to support the smaller values. Allegedly, constituent-quark scaling is a high-√ s property. Experimental data support the trends, the very hadronizaton model needs further investigation.

Investigating the T Slope in the Quark-Coalescence Model
In the quark-coalescence model, T hadron = T parton = T, thus T meson = T baryon is also assumed, c.f. Section 3.4. Using the Tsallis distribution we consider the logarithmic slope of E i spectra: This may explain the mass ordering found in Equation (32). A possible way to read off this effect would be to determine the slope of (m T,i − m i ) spectra. Estimating E i by √ s − m i one obtains results as seen in Figure 7.   (34), fitted on the theoretical PYTHIA8 and kTpQCD_v20 data (empty points) and on the experimental values (solid points), for all investigated hadron species.

Summary and Discussion
In this study we analyzed identified hadron spectra measured in proton-proton collisions from RHIC to LHC energies in the range 62.4 GeV ≤ √ s ≤ 7 TeV. We showed that the Tsallis-Pareto distributions originated from non-extensive thermodynamics describe the spectra very well in wide m T regions, typically at p T 10-20 GeV/c using the distribution in the form of Equation (20).
We provided a comprehensive and detailed analysis of the state-of-the-art experimental data which will be used also to make predictions about the forthcoming 13 TeV and 14 TeV spectra. The ∼ log( √ s)-like evolution of the parameters q i and T i were tested on the identified hadron spectra data measured for charge averaged π ± , π 0 , K ± , p, andp. We observed that both the non-extensivity parameter q i and temperature-like T i parameters agree with the suggested QCD-inspired evolution pattern. However, the temperature has almost a constant value within the investigated center-of-mass energy regime. We found a mass-ordered hierarchy in the evolution parameters of the experimental fits, i.e., lighter hadron spectra have the more non-extensive q i > 1 and heavier hadron spectra fit with larger T i values. However, we note that the deduced T i values might be sensitive not just the hadron mass but also on the chosen distribution function. This might result different values extracted from similar data, e.g., in reference [27].
We compared the experimental data fit results with theoretical predictions. The c.m. energy evolution of the fit parameters were calculated by PYTHIA8 [68,69] and kTpQCD_v20 [70]. We found theoretical fit parameters to be more compact in the (T, q) space than the experimental ones: T i ∈ [80, 240] MeV, while non-extensivity is wider than the measurement-based q i ∈ [1.08, 1,23] region. The most deviating points arise from limitations of the theoretical models, i.e., where statistics is low or the phase-space is limited. Energy evolution in the theoretical models were investigated as well.
In agreement with our expectations we conclude that (i) for the √ s evolution, kTpQCD_v20 agrees more with the power-law related non-extensivity parameter q i ; (ii) PYTHIA8 results correspond well with the measured T i ( √ s) evolution.
The study of these models reflected the lack of the proper handling of the hadron-mass, since all assumptions fail for more massive hadron species.
At the highest energies and momenta, in the infinite momentum frame, constituent quark number scaling is assumed to get stronger. To test this idea in the framework of the non-extensive approach, we applied and investigated an improved quark-coalescence model, inserting Tsallis-like energy distribution kernels where constituent quark scaling appears explicitly. Experimental data present a slight monotonic-increase with c.m. energy, but the saturation ridge of the ratio (q meson − 1)/(q baryon − 1) is lower than predicted by the coalescence-theory (3/2) while the reference ratio (q meson − 1)/(q meson − 1) is only slightly apart from the expected value 1. The fit parameters calculated by PYTHIA8 and kTpQCD_v20 models both have almost no √ s evolution, but only the ratio values for light mesons are in agreement with the experimental data especially at the highest LHC energies.
In summary, our detailed analysis aimed to investigate how we can provide physical meaning for experimentally-fitted parameters, based on well-known theoretical models and phenomena. Our results motivate us to improve the model of hadronization in high-energy collisions, using spectra with exponential shape at low-p T , keeps the power-law tail at thigh p T , and takes care of the meson/baryon spectra ratios and/or the experimentally observed (q meson − 1)/(q barion − 1).

Conflicts of Interest:
The authors declare no conflict of interest.

Abbreviations
The following abbreviations are used in this manuscript:

Appendix A
In Table A1 we show the parameters fitted to all used datasets [34,35,[57][58][59][60][61][62][63][64][65]. The parameters q i , T i and A i from Equation (20). and the χ 2 /ndf values of the given fit are listed for each identified hadron and for each √ s center-of-mass energy value. Table A1. The fitted q, T and A parameters and the χ 2 /ndf value of the fit for identified hadrons π 0 , π ± , K ± and pp. The values are grouped according to the hadrons such that the change in respect to the center-of-mass energy can be compared easily.

Appendix B
In Table A2 the used datasets, their kinematical properties and the corresponding references are listed. As we already mentioned in Section 4. that the spectra were measured in wide range of kinematical variables, although these values varies in different experiments.