Next Article in Journal
Exploring the Influence of Oblateness on Asymptotic Orbits in the Hill Three-Body Problem
Previous Article in Journal
Introducing the Leal Method for the Approximation of Integrals with Asymptotic Behaviour: Special Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Četaev Condition for Nonholonomic Systems

by
Federico Talamucci
DIMAI, Dipartimento di Matematica e Informatica “Ulisse Dini”, Università degli Studi di Firenze, Via S. Marta 3, 50139 Firenze, Italy
AppliedMath 2025, 5(1), 29; https://doi.org/10.3390/appliedmath5010029
Submission received: 28 January 2025 / Revised: 5 March 2025 / Accepted: 10 March 2025 / Published: 13 March 2025

Abstract

:
In the context of holonomic systems, the identification of virtual displacements is clear and consolidated. This provides the possibility, once the class of displacements have been coupled with Newton’s equations, for us to write the correct equations of motion. This method combines the d’Alembert principle with Lagrange formalism. As far as nonholonomic systems are concerned, the conjecture that dates back to C ˇ etaev actually defines a class of virtual displacements through which the d’Alembert–Lagrange method can be applied again. A great deal of literature is dedicated to the C ˇ etaev rule from both the theoretical and experimental points of view. The absence of a rigorous (mathematical) validation of the rule inferable from the constraint equations has been declared to have expired in a recent publication; one of our objectives is to produce a critical comment on this stated result. Finally, we explore the role of the C ˇ etaev condition within the significant class of nonholonomic homogeneous constraints.

1. Introduction

Our study concerns discrete systems constrained by nonholonomic constraints. Among the various approaches used to deal with the problem (virtual displacements, variational principles in differential or integral form, algebraic and geometric methods), our attention is directed to the d’Alembert–Lagrange principle (d’A–L P.). The classic formulation of the principle will be presented in the next Section; here, we address the issue from a substantial and qualitative point of view.
As is known (see, for instance, [1] or [2]), the formulation of d’A–L P. starts from Newton’s equations of motion and selects a particular category of displacements δ r (we will define this notation later on), which are consistent with the dynamics of the system (virtual displacements). The situation is clear and universally accepted when we are dealing with systems constrained by geometric conditions (holonomic constraints): the displacements are distinctly identifiable and spontaneously linked to simple concepts in geometry (tangent space of a manifold).
The approach through d’A–L P. is not so common in the literature when we move on to the more complex category of kinematic constraints, i.e., constraints which involve the velocities of the system and which cannot be reduced to geometric constraints via integration. Starting from the fundamental text [3], the problem of how to define displacements for nonholonomic systems is rarely addressed, and alternative approaches (methods based on differential geometry or on a variational principle) might prevail in the study of nonholonomic systems. We refer to [4] for a useful review and an updated state of the art with respect to virtual displacements in nonlinear nonholonomic constraints.
In examining the role and potentiality of the d’A–L P. when dealing with nonholonomic systems, we find a clear line of demarcation dividing the linear case and the nonlinear one; if, in the first case, the application of the d’A–L P. method is a natural and simple extension of the holonomic model, in the latter case, the task of assigning displacements to the system is more difficult and unclear. From a mathematical point of view, the question is simple to introduce; the concern is to deduce appropriate integer conditions in terms of δ r starting from a set of constraints on the state ( r , r ˙ ) of the system, which cannot be integrated. The main difficulty lies in the fact that the displacements that work for a straightforward derivation of the equations of motion are not necessarily possible (i.e., compatible with the constraints) displacements.
Nevertheless, it must be said that the topic of nonlinear kinematic constraints is quite problematic, and it is not confined to the perspective of the d’A–L P. we are focusing on; actually, starting from the very existence of physical models of this type, the issues under debate concern the concrete feasibility of physical models [5], the distinct role from control forces [6], the axiomatic and theoretical features that provide a valid generalization of the linear case [7], and the mathematical aspect of expressing a certain condition in multiple equivalent ways and in different sets of coordinates [8], to mention just a few. In addition to the references mentioned above, we indicate the texts [9,10] and articles [11,12,13,14,15], which examine the topic of nonlinear constraints from a historical and axiomatic point of view.
Returning to the themes of this work, our analysis aims to examine a condition that establishes the class of virtual displacements δ r in a very simple way, which is known as the C ˇ etaev rule ([16], and we also refer to [17,18]). Naturally, as we will recall later, the rule properly extends the existing conditions for the holonomic case and the linear kinematic case. An interesting aspect is that historically, the hypothesis was born with the aim of making the d’A–L P. and the Gauss principle equivalent in order to achieve agreement through the Hertz–Helder principle.
Going through the specific literature, we understand that the C ˇ etaev rule is in the state of a postulate; that is, there does not seem to be a rigorous theoretical justification starting from the laws of mechanics [4]. At the same time, the debate is also open in the context of experimental tests, and there are conflicting opinions even when the hypothesis is tested directly with experimental procedures [19,20]. For this reason, we were impressed by the content of [21], where a theoretical argument is intended to extend the validity of the d’A–L P. to treat general nonholonmic systems; essentially, the C ˇ etaev rule is claimed to be derived (through mathematical steps) directly from the constraint equations, and in this way, the displacements defined by the rule itself are fully justified. The procedure is also extended to higher-order constraints, which involve the accelerations of the system.
The dissertation in [21] is undoubtedly outstanding and exhaustive since it explores the most remarkable aspects of the theory on nonholonomic constraints (including various types of principles, the debated transpositional rule, ans the pseudovelocity formalism). This is why we find it appropriate, as the initial purpose of our work, to dwell on the mathematical tools by which the rule seems to find its demonstration.
The second aim of this paper is to investigate the effects of the C ˇ etaev rule on relevant physical aspects, particularly the overlap of virtual displacements and instantaneous velocities, the vanishing of virtual work carried out by the constraint forces, and the definition of a Hamilton function restricted to the independent velocities.
For the sake of clarity, we present the organization of the work and indicate the main steps as follows:
  • In Section 2, we present the formal method and ordinary method of virtual displacements, starting from Newton’s equations of motion. We explicitly write the equations of motion for systems constrained with nonlinear kinematic constraints whenever the C ˇ etaev condition is assumed to hold.
  • The first paragraph of Section 3 is devoted to commenting on the theoretical justification of the C ˇ etaev rule asserted in [21]. In the next two paragraphs, we analyze the role of this rule with respect to virtual displacements such as possible instantaneous velocities, virtual work, and the energy of the system. A special condition with respect to the explicit constraint equations (condition (40)) turns out to outline the right class of nonlinear constraints for which the extension of the method virtual displacements is appropriate.
  • In Section 4, we see that the just-mentioned condition identifies the constraints formulated by homogeneous functions (of any degree) with respect to the generalized velocities. Finally, the results are commented on in Section 5.

2. The Mathematical Model

2.1. The d’Alembert–Lagrange Principle

We consider a system of N material points governed by Newton’s equations
m s r . . s = F s + R s , s = 1 , , N ,
where r s is the radius vector in R 3 locating the s-th point, m s is the mass, F s is the corresponding active force, and R s is the unknown force (exerted on the s-th point) due to restrictions imposed to the system.
A virtual displacement δ r s is a variation of the configuration of the system at a given moment of time which is compatible with the constraints imposed on the system [3]. The standard formulation of the equations of motion using the d’A–L P. passes through the definition of ideal constraint (we use the summation symbol Σ for greater clarity given that the range of indices is changeable):
s = 1 N R s · δ r s = 0 ,
so that it can be written as
s = 1 N m s r . . s F s · δ r s = 0 ,
from which the correct equations of motion should be deduced. If generalized independent coordinates q 1 , …, q n , n = 3 N are used, then δ r s = j = 1 n r s q j δ q j for each s = 1 , , N , and (3) is
j = 1 n d d t T q ˙ j T q j F ( j ) δ q j = 0 ,
where the kinetic energy T = 1 2 s = 1 N m s r ˙ s 2 is a function of the generalized coordinates q = ( q 1 , , q n ) and the generalized velocities q ˙ = ( q ˙ 1 , , q ˙ n ) by means of
r s = r s ( q , t ) , r ˙ s ( q , q ˙ , t ) = j = 1 n r s q j q ˙ j + r s t , s = 1 , , N ,
and F ( j ) = s = 1 N F s · r s q j is the generalized force along the j-th direction, j = 1 , , n . The generalized constraint forces are R ( j ) = s = 1 N R s · r s q j , and in the Lagrangian coordinates, the condition (2) of ideal constraint is
j = 1 n R ( j ) δ q j = 0 .
Still within the standard premises of the problem, we recall that if part of the force comes from a generalized potential in such a way that
F P ( j ) = U q j d d t U q ˙ j , j = 1 , , n ;
then, the introduction of the Lagrangian function L ( q , q ˙ , t ) = T + U makes us write (4) as
j = 1 n d d t L q ˙ j L q j F N P ( j ) δ q j = 0 ,
where the term F N P ( j ) takes into account the remaining active forces not deriving from a potential.
Remark 1.
In the transfer from the Cartesian coordinates r s to the generalized ones, no further geometric constraint occurs; i.e., N = 3 n ; if this happens, the selection of a smaller number of generalized coordinates n < 3 N does not involve substantial changes other than the reformulation of the constraint forces R s .

2.2. The Constraint Conditions

We assume that the restrictions to the dynamics of the system can be formulated by the constraints equations
ψ ν ( r 1 , , r N , r ˙ 1 , , r ˙ N , t ) = 0 , ν = 1 , , k < 3 N ;
hence, we are considering kinematic constraints (linear or nonlinear) involving the positions and the velocities os the points. We will use the following notation: for a real-valued function f : R μ R , if f = f ( ζ ) , ζ = ( ζ 1 , , ζ μ ) , we denote by f ζ the R μ -vector f ζ 1 , , f ζ μ . Moreover, for a vector function f = ( f 1 , , f σ ) : R μ R σ , where f i = f i ( ζ ) : R μ R for each i = 1 , , σ , we denote by f ζ the σ × μ Jacobian matrix with entries f ζ i , j = f i ζ j , i = 1 , , σ , j = 1 , , μ .
The constraints are independent according to the following assumption:
the k vectors ψ ν r ˙ R 3 N , ν = 1 , , k are independent ,
where r ˙ = ( r ˙ 1 , , r ˙ N ) R 3 N . Equivalently, the constraints are independent if the rank of the ( k × 3 N ) Jacobian matrix ψ r ˙ , ψ = ( ψ 1 , , ψ k ) , attains its maximum value k.
By means of (5), the constraint Equation (8) can be written in terms of the generalized coordinates as follows:
ϕ ν ( q , q ˙ , t ) = 0 , ν = 1 , , k .
Owing to the chain relation ϕ q ˙ = ψ r ˙ r ˙ q ˙ = ψ r ˙ r q , where ϕ = ( ϕ 1 , , ϕ k ) , we see that the constraints (10) are independent with respect to the generalized velocities q ˙ by virtue of (9) and the non-singularity of r / q since the generalized coordinates q are independent.
The crucial point can be summarized by the following question: as long as the d’Alembert–Lagrange approach (3) is exerted in a general context of nonholonomic (possibly nonlinear) constraints, which is the proper set of virtual displacements to define so that (2) or, equivalently, (6) is valid? Once this has been established, the right equations of motion can be explicitly deduced from (7).
It is natural to keep in mind the holonomic case (whose theory is long-established) as a starting point for the correct generalization: we recall that [1] in the case of geometric constraints of the following type:
ψ ν ( r 1 , , r N , t ) = 0 f ν ( q , t ) = 0
(the Lagrangian formulation by f ν is due to (5)). The consolidated theory, in this regard, provides the following category of virtual displacements:
j = 1 n f ν q j δ q j = 0 .
It is worth noting that setting the constraint in the form of (10) via differentiation:
ϕ ν ( q , q ˙ , t ) = f ˙ ν ( q , t ) = j = 1 n f ν q j q ˙ j + f ν t = 0 ,
(12) coincides with
δ ( c ) ϕ ν = j = 1 n ϕ ν q ˙ j δ q j = 0 .
The form (14) for identifying virtual displacements is known in the literature as the C ˇ etaev condition, and the superscript ( c ) recalls it. More generally, passing from the holonomic case to that of linear kinematic constraints:
ϕ ν ( q , q ˙ , t ) = j = 1 n A ν , j ( q , t ) q ˙ j + b ν ( q , t ) ,
the condition (14) formally reproduces the standard way of treating linear kinematic constraints based on the definition of virtual displacements:
j = 1 n A ν , j δ q j = 0
(indeed ϕ ν / q ˙ j = A ν , j ), which is introduced in most of the treatises on kinematic constraints (it suffices to mention [3]). Hence, the C ˇ etaev condition properly includes holonomic and linear kinematics constrained systems as special cases. Obviously, (16) is (12) when (15) is (13). A debated topic ([20,22,23] et al.) concerns precisely the fact of accepting condition (14) even in the general case of nonlinear kinematic constraints (10).
Remark 2.
Obviously, (14) is not the only way of defining the virtual displacements in terms of the functions (10); a second way which has received attention in the literature requires the introduction of the virtual variations δ q ˙ j , and as a whole, the virtual displacements must verify the condition
δ ( v ) ϕ ν = j = 1 n ϕ ν q j δ q j + ϕ ν q ˙ j δ q ˙ j = 0 .
The superscript ( v ) refers to “vakonomic” approach, which is founded on the theory presented in [24,25,26]. Even though we will not deal with (17), we briefly compare (14) and (17). First of all, the holonomic case (11) falls within (17) only if one sets ϕ ν = f ν in (17) so that (12) is obtained since f ν / q ˙ j = 0 . On the other hand, for holonomic constraints, the direct definition ϕ ν = f ν in (14) (rather than the derivative (13)) would not make sense. In other words, the two formulas match in the holonomic case f = f ( q , t ) as long as
δ ( v ) f ν = δ ( c ) f ˙ ν .
A second difference emerges in the case of linear kinematic constraints (15): on the one hand, δ ( c ) ϕ ν = 0 is (16); on the other hand, in the linear case (15) condition, (17) obtains
δ ( v ) ϕ ν = i , j = 1 n A ν , i q j q ˙ i + b ν q j δ q j + j = 1 n A ν , j δ q ˙ j = 0 ,
which is apparently very different.

2.3. The Equations of Motion

The coupling of the d’A–L. P. principle (7) and the C ˇ etaev condition leads straightforwardly to the equations of motion. A simple argument lies on the geometric meaning of (14): the whole set of virtual variations δ q builds the vector space orthogonal to the space generated by ϕ ν q ˙ , ν = 1 , , k . Since the latter vectors are independent (as we remarked just after (10)), their linear combinations cover the totality of the vectors orthogonal to all the δ q ; among them, we find the vector in round brackets in (4) so that it is necessarily
d d t L q ˙ j L q j F N P ( j ) = ν = 1 k λ ν ϕ ν q ˙ j , j = 1 , , n ,
with λ ν unknown multiplying factors (Lagrange multipliers). The just-written equations have to be coupled with the constraint Equation (10) to form a a system of n + k equations in the n + k unknowns q , λ ν , ν = 1 , , k . The constraint forces ( R ( 1 ) , , R ( n ) ) are smoooth according to (6).
The possibility of writing the equations of motion without the multipliers (which are uncomfortable unknown quantities) passes through the explicit version of (14): in a more abstract and general way, we indicate (14) as
j = 1 n β ν , j δ q j = 0 , ν = 1 , , k ,
where β ν , j are the elements of a k × n matrix with full rank k. Without losing generality, we can determine the displacements as a function of the first independent displacements δ q 1 , …, δ q m , m = n k :
δ q m + ν = r = 1 m γ ν , r δ q r , ν = 1 , , k ,
where m = n k and δ q 1 , , δ q m are arbitrary. By replacing (21) in (7) for each ν = 1 , , k and taking into account that δ q 1 , …, δ q m are independent parameters, the equations of motion can be written as
d d t L q ˙ r L q r + ν = 1 k γ ν , r d d t L q ˙ m + ν L q m + ν = F N P ( r ) + ν = 1 k γ ν , r F N P ( m + ν ) r = 1 , , m ,
to be combined, this time, with (10). The m + k = n equations contain n unknown functions q 1 , , q n ; in addition to the lower number of equations, the obvious advantage of the second system (19) with respect to (19) is the absence of the multipliers λ ν .
When (20) assumes the form (14); that is, β ν , j = ϕ ν q ˙ j , then the functions appearing in (21) are
γ ν , r = α ν q ˙ r , ν = 1 , , k , r = 1 , , m
with
α ν = q ˙ m + ν ( q 1 , , q n , q ˙ 1 , , q ˙ m , t ) , ν = 1 , , k
deduced from (10) by virtue of the non-singularity condition (save for enumerating the variables in a different way),
d e t ϕ q ˙ ( k ) 0 ϕ = ( ϕ 1 , , ϕ k ) , q ˙ ( k ) = ( q ˙ 1 , , q ˙ k )
(the adopted notation for the Jacobian matrix is explained just after Equation (8)). In order to check (23), it suffices to calculate the derivatives of ϕ ν = 0 with respect to the independent velocities q ˙ r , r = 1 , , m :
ϕ ν q ˙ r + s = 1 k ϕ ν q ˙ m + s α s q ˙ r = 0 r = 1 m ϕ ν q ˙ r δ q r + s = 1 k ϕ ν q ˙ m + s α s q ˙ r δ q r = 0
and to compare with (14), so that we can obtain
δ q m + ν = r = 1 m α ν q ˙ r δ q r , ν = 1 , , k
where the virtual variations δ q 1 , …, δ q r are independent.
The relations (24) assign to q ˙ 1 , …, q ˙ m the role of independent velocities, while the remaining ones q ˙ m + 1 , …, q ˙ n are dependent velocities. By virtue of (23), the equations of motion (22) take the following form
d d t L q ˙ r L q r + ν = 1 k α ν q ˙ r d d t L q ˙ m + ν L q m + ν = F N P ( r ) + ν = 1 k α ν q ˙ r F N P ( m + ν ) r = 1 , , m .

3. The C ˇ etaev Rule

3.1. A Reported Proof of the C ˇ etaev Condition

As we read in [22], the equations of motion (19) can be obtained from Hertz’s principle of least curvature [27] or from a variational principle [28]. On the other hand, if we adopt the technique of virtual displacements for the d’A–L P. (4), the conjecture (14) has an axiomatic role and seems to have no real theoretical justification. For this reason, a claimed demonstration [21] based on the constraint equations caught our attention.
The proof in question (Section III, B of [21]) can be reported as follows. By differentiating (8) with respect to t and taking into account the partition among independent generalized velocities q ˙ i , i = 1 , , m and dependent ones q ˙ m + s , s = 1 , , k , we can write
j = 1 n ϕ ν q j q ˙ j + i = 1 m ϕ ν q ˙ i q . . i + s = 1 k ϕ ν q ˙ m + s q . . m + s + ϕ ν t = 0 , ν = 1 , , k ,
whence we deduce, by deriving with respect to q . . r , r = 1 , , m :
ϕ ν q ˙ r + s = 1 k ϕ ν q ˙ m + s q . . m + s q . . r = 0 , r = 1 , , m .
Now, the point we criticize in [21] states (adapting the symbols to our notations): “Although the coordinate function q m + s = q m + s ( q 1 , , q m , t ) , s = 1 , , k is unknown for nonintegrable constraint (10), the dependent displacements
δ q m + s = i = 1 m q m + s q i δ q i = i = 1 m q ˙ m + s q ˙ i δ q i = i = 1 m q . . m + s q . . i δ q i
can be obtained in terms of the independent variations δ q r , r 1 , , m ”. The doubtful step fails, in our opinion, to correctly interpret what is not known via what cannot be obtained. In fact, the (known or unknown) dependence is attributed only to geometric constraints; it cannot be mathematically inferred from a non-integrable differential expression (10).
Returning to (29), it is clear that the first equality is the characteristic condition for the holonomic case and it corresponds to (12) written by explicit functions. The same equality leads to (14) (which holds in the holonomic case, as we have already noticed in (13)) in the following way:
j = 1 n ϕ ν q ˙ j δ q j = i = 1 m ϕ ν q ˙ i δ q i + s = 1 k ϕ ν q ˙ m + s δ q m + s = i = 1 m ϕ ν q ˙ i + s = 1 k ϕ ν q ˙ m + s q . . m + s q . . i δ q i = 0 .
The first step simply separates the independent variables from the dependent ones; the second step makes use of (29), last equality, in order to rewrite δ q m + s . Finally, the entire expression is null due to (28).
We stress that the key point is actually δ q m + k = i = 1 m q m + k q i δ q i , which is characteristic of holonomic systems, and, in principle, it is not necessarily compatible with (14). Using such an identity in a non-proper way is, in our opinion, the point at which the reasoning falls. This property, in fact, is valid for holonomic systems, but for them, we already know that the C ˇ etaev rule holds as the standard condition on displacements.
We further remark that if the procedure were rigorous, the second derivatives would not even be needed: the procedure is much simpler if we replace (28) with
i = 1 m ϕ ν q ˙ i + s = 1 k ϕ ν q ˙ m + s q ˙ m + s q ˙ i = 0 ,
and we conclude that
j = 1 n ϕ ν q ˙ j δ q j = i = 1 m ϕ ν q ˙ i δ q i + s = 1 k ϕ ν q ˙ m + s δ q m + s = i = 1 m ϕ ν q ˙ i + s = 1 k ϕ ν q ˙ m + s q ˙ m + s q ˙ i δ q i = 0
using, on this occasion, the second equality in (29).
The same argument is performed in [21], Section III, C, for higher order constraints of the type:
h ν ( q , q ˙ , q . . , t ) = 0
in order to prove the relation (similar to (14)):
j = 1 n h ν q . . j δ q j = 0 .
The derivative of the constraint equation involves the third derivative h . . . ν , and the property
δ q m + s = i = 1 m q m + s q i δ q i = i = 1 m q . . . m + s q . . . i δ q i ,
analogous to (29), is claimed in order to prove the validity of (30). Nevertheless in our opinion, “unknown” is confused with “not existing”, once again, at the moment in which the dependent functions q m + s ( q 1 , , q m , t ) have, although unknown, been assumed to exist. A comprehensive and remarkable point of reference in regard to higher-order constraints is [29].
Remark 3.
The claimed proof of (14) does not actually require (17) to hold, as one seems to understand from [21]; as a matter of fact, the reasoning starts from the differential expression g ˙ ν (the one preceding (28)), regardless of the condition in terms of displacements (17).
Still, according to [21], the theoretical justification of the C ˇ etaev rule for a general constraint ϕ ν ( q , q ˙ , t ) = 0 is supported by the fact that the same Equation (19) can be deduced from the Gauss principle; although we will not go into the details of the mathematical reasoning that justifies this derivation from the principle, we may observe that the validity of the identity
r q j = r ˙ q ˙ j = r . . q . . j
is invoked; whereas, in our opinion, it can be used only in the holonomic case. In the next paragraph, we will deal with the just-written identity.

3.2. An Argument in Support of the Rule

Regardless of a rigorous theoretical support, hypothesis (14) is widely considered in the literature for nonholonomic mechanical models; moreover, it is particularly suitable for extending the concept of virtual displacements for higher order constraints, as (30) suggests. Here we are going to add a heuristic consideration in favor of the condition (14).
The question can be formulated as follows: coming back to the notation (5), we consider a system r s ( q , t ) and the virtual displacements
δ r s = j = 1 n r s q j δ q r
in terms of the virtual variations δ q r . In the case of a holonomic system, the property r ˙ s q ˙ j = r s q j makes (31) equivalent to
δ r s = j = 1 n r ˙ s q ˙ j δ q r .
In nonholonomic systems, the additional kinematic constraints (10) prevent (31) and (32) from being equivalent since the velocities q ˙ 1 , …, q ˙ n are not independent.
The following property establishes that the definition of virtual displacements based on (14) is the only one that extends the equivalence between (31) and (32) to the case of nonholonomic constraints, provided that only the independent velocities q ˙ r , r = 1 , , m are considered.
Property 1.
Assume that (14) holds for the set of kinematic constraints (10) ν = 1 , , k imposed on the system r s ( q , t ) , s = 1 , , N . Then, δ r s , defined in (31), verifies
δ r s = r = 1 m r ˙ s q ˙ r δ q r , s = 1 , , N
where q ˙ 1 , …, q ˙ m is the set of independent velocities, as in (24).
Proof. 
Since, owing to (24),
r ˙ s = r = 1 m r s q r q ˙ r + ν = 1 k r s q m + ν α ν + r s t , s = 1 , , N ,
we see that
r ˙ s q ˙ r = r s q r + ν = 1 k r s q m + ν α ν q ˙ r , s = 1 , , N , r = 1 , , m ;
hence,
r = 1 m r ˙ s q ˙ r δ q r = r = 1 m r s q r + ν = 1 k α ν q ˙ r r s q m + ν δ q r = r = 1 m r s q r δ q r + ν = 1 k r s q m + ν r = 1 m α ν q ˙ r δ q r = δ q m + ν
by virtue of (26), and (33) is proven. □
The essential hypothesis is (14), which implies (26); otherwise, the property is not verified.

3.3. Virtual Displacements and Possible Instantaneous Velocities

A possible instantaneous velocity r ˙ ^ s is a velocity allowed by the constraints at a given instant of time. Its expression can be inferred from (5) by blocking the time t:
r ˙ ^ s = j = 1 n r s q j q ˙ j = r ˙ s r s / t , s = 1 , , N .
In a nonholonomic system constrained by (10), the corresponding explicit functions (24) assign (36) the form
r ˙ ^ s = r = 1 m r s q r q ˙ r + ν = 1 k r s q m + ν q ˙ m + ν = r = 1 m r s q r q ˙ r + ν = 1 k r s q m + ν α ν .
On the other hand, whenever the condition (14) is assumed to hold, the virtual displacements (31) must satisfy (26); hence,
δ r s = r = 1 n r s q r δ q r + ν = 1 k r s q m + ν δ q m + ν = r = 1 m r s q r + ν = 1 k r s q m + ν r = 1 m α ν q ˙ r δ q r
Let us define the functions
α ¯ ν ( q 1 , , q n , q ˙ 1 , , q ˙ m , t ) : = r = 1 m α ν q ˙ r q ˙ r ν = 1 , , k .
The comparison of (37) with (38) leads to the following.
Property 2.
Assume that (14) holds for the set (10) of kinematic constraints imposed on the system r s ( q , t ) , s = 1 , , N . Then, the virtual displacements δ r s and the possible velocities r ˙ ^ s , s = 1 , , N exhibit the same direction if and only if
α ¯ ν = α ν for any ν = 1 , k ,
on varying, in the same way, the independent generalized velocities q ˙ 1 , …, q ˙ m and the independent displacements δ q 1 , …, δ q m .
Indeed, in the case (40), the functions α ν , ν = 1 , , k , can be replaced by α ν = r = 1 m α ν q ˙ r q ˙ r , and the two expressions (37) and (38) are formally identical.
In the class of constraints satisfying (40), the C ˇ etaev rule (14) enables the overlap of virtual displacements and possible velocities. As a consequence, the condition of ideal constraint s = 1 N R s · δ r s = j = 1 n R ( j ) δ q j = 0 (see (2) and (6)) actually represents the absence of virtual work s = 1 N R s · r ˙ ^ s = j = 1 n R ( j ) q ˙ j = 0 whenever (39) is held.
Remark 4.
More generally, the following relation can be deduced from (19) and (27):
j = 1 n R ( j ) q ˙ j = ν = 1 k ( α ν α ¯ ν ) d d t L q ˙ m + ν L q m + ν F N P ( m + ν ) .
We see that the virtual work vanishes if (39) holds.
A further point which confers interest upon the condition (40) comes from the calculation of the Hamiltonian as the Legendre tranform of L with respect to all the generalized velocities q ˙ j , j = 1 , , n :
H ( q 1 , , q n , q ˙ 1 , , q ˙ n , t ) = j = 1 n q ˙ j L q ˙ j L .
Consider now the Hamiltonian restricted to independent generalized velocities:
H * ( q 1 , , q n , q ˙ 1 , , q ˙ m , t ) = r = 1 m q ˙ r L * q ˙ r L * ,
where
L * ( q 1 , , q n , q ˙ 1 , , q ˙ m , t ) = L ( q 1 , , q n , q ˙ 1 , , q ˙ m , α 1 ( · ) , , α k ( · ) , t )
is the restricted Lagrangian calculated by means of (24) and ( · ) stands for ( q 1 , , q n , q ˙ 1 , , q ˙ m , t ) .
The two functions, (42) and (43), do not coincide, meaning that H q ˙ m + ν = α ν H * . The exact relation, which can be easily checked, is
H * = H + ν = 1 k ( α ¯ ν α ν ) L q ˙ m + ν .
Once again, we see that the hypothesis (40) plays a unifying role in the definition since, in that case, the physical quantities and (42) and (43) do coincide; this allows the energy of the system to be uniquely defined.

4. Homogeneous Constraints

Rather than searching for a mathematical derivation of the elusive Cetaev condition (as long as it exists), we assume the C ˇ etaev rule (14) as a postulate and we examine in more depth the class of constraints fulfilling (40): actually, Properties 1 and 2 of the previous Section assign to nonholonomic systems of the class (40) the same features as we meet in holonomic and linear nonholonomic systems. Hence, as long as the C ˇ etaev rule holds, condition (40) provides a category of constraints which best “fit in” with the rule, which extend in a natural way the theory of holonomic systems. We will conclude (Proposition 1) that this category of constraints coincides with (10), where the functions ϕ ν are homogeneous functions (even of different degrees) with respect to the generalized velocities q ˙ .
We recall that a real-valued function f ( ξ 1 , , ξ ) defined on a domain D R is a positive homogeneous function of degree σ if
f ( λ ξ 1 , , λ ξ ) = λ σ f ( ξ 1 , , ξ ) ( ξ 1 , , ξ ) D and λ > 0 .
Here, we pay attention to constraints which are homogeneous functions with respect to the generalized velocities q ˙ j , j = 1 , , n ; that is, with respect to (10):
ϕ ν ( q , λ q ˙ , t ) = λ ν σ ϕ ν ( q , q ˙ , t ) λ > 0 , ν = 1 , , k ,
where the degree σ ν can be different for each constraint equation. The vast majority of examples and applications in literature of nonholonomic systems with kinematic restrictions belong to the category of constraints (46).
Example 1.
Linear kinematic constraints (15) with b ν = 0 are positive homogeneous functions of degree 1 with respect to the variables q ˙ (in this case, (45) holds also for λ < 0 ) in particular holonomic constraints (13) for f ν , not depending explicitly on time. As an example, for two points, P 1 and P 2 , at a constant distance ℓ, the geometric constraint ( q 1 q 4 ) 2 + ( q 2 q 5 ) 2 + ( q 3 q 6 ) 2 2 = 0 (where ( q 1 , q 2 , q 3 ) and ( q 4 , q 5 , q 6 ) are the Cartesian coordinates of P 1 and P 2 , respectively) is converted by derivation to the homogeneous function of degree 1 ( q 1 q 4 ) 2 ( q ˙ 1 q ˙ 4 ) + ( q 2 q 5 ) 2 ( q ˙ 2 q ˙ 5 ) + ( q 3 q 6 ) 2 ( q ˙ 3 q ˙ 6 ) = 0 .
Example 2.
Given a system of two material points, P 1 and P 2 , the following nonholonomic restrictions, i.e.,
( i ) P ˙ 1 P ˙ 2 = 0 parallel velocities ( i i ) P ˙ 1 · P ˙ 2 = 0 perpendicular velocities ( i i i ) | P ˙ 1 | = | P ˙ 2 | same magnitude of the velocities ( i v ) B ˙ P 1 P 2 = 0 the velocity of midpoint B is parallel to the joining line ( v ) B ˙ · P 1 P 2 = 0 the velocity of B is perpendicular to the joining line ( v i ) P ˙ 1 · P 1 P 2 = P ˙ 2 · P 1 P 2 = 0 the velocities are perpendicular to the joining line
correspond to homogeneous constraints; actually, in fixing the Lagrangian parameters q as the Cartesian coordinates P 1 = ( q 1 , q 2 , q 3 ) , P 2 = ( q 4 , q 5 , q 6 ) , the constraints are
( i ) q ˙ 2 q ˙ 6 q ˙ 3 q ˙ 5 = 0 , q ˙ 3 q ˙ 4 q ˙ 1 q ˙ 6 = 0 , q ˙ 1 q ˙ 5 q ˙ 2 q ˙ 4 = 0 ( i i ) q ˙ 1 q ˙ 4 + q ˙ 2 q ˙ 5 + q ˙ 3 q ˙ 6 = 0 ( i i i ) q ˙ 1 2 + q ˙ 2 2 + q ˙ 3 2 q ˙ 1 2 + q ˙ 2 2 + q ˙ 3 2 = 0 ( i v ) ( q 3 q 6 ) ( q ˙ 2 + q ˙ 5 ) ( q 2 q 5 ) ( q ˙ 3 + q ˙ 6 ) = 0 ( q 1 q 4 ) ( q ˙ 3 + q ˙ 6 ) ( q 3 q 6 ) ( q ˙ 1 + q ˙ 4 ) = 0 ( q 2 q 5 ) ( q ˙ 1 + q ˙ 4 ) ( q 1 q 4 ) ( q ˙ 2 + q ˙ 5 ) = 0 ( v ) ( q 1 q 4 ) ( q ˙ 1 + q ˙ 4 ) + ( q 2 q 5 ) ( q ˙ 2 + q ˙ 5 ) + ( q 3 q 6 ) ( q ˙ 3 + q ˙ 6 ) = 0 ( v i ) ( q 1 q 2 ) q ˙ 1 + ( q 2 q 5 ) q ˙ 2 + ( q 3 q 6 ) q ˙ 3 = 0 ( q 1 q 2 ) q ˙ 4 + ( q 2 q 5 ) q ˙ 5 + ( q 3 q 6 ) q ˙ 6 = 0
and they all show positive homogeneous functions of degree 1 or 2 with respect to the variables q ˙ .
Remark 5.
The just-mentioned conditions may appear abstract, complying only with theoretical requirements. We refer to [8] for a useful and accurate description of devices (rods, blades, wheels, sleds, etc.), which put in practice the constraints listed in Example 2 or a combination of them. Another example of a physically realizable nonlinear nonholonomic mechanical system is the nonholonomic pendulum proposed in [5]. On the other hand, examples of nonlinear nonholonomic models constrained by nonhomogeneous velocity conditions and corresponding to physically realizable systems are, to our knowledge, uncommon, if not fully absent, in literature.
The connection between the condition (40) and the homogeneity of the constraint functions is explained by the following:
Proposition 1.
These three statements are equivalent:
( 1 )
The constraint functions ϕ ν of (10), ν = 1 , , k , are positive homogeneous functions with respect to the variables ( q ˙ 1 , , q ˙ n ) , even of different degrees σ ν .
( 2 )
Any set of explicit functions α ν defined in (24) and deduced from (10) are positive homogeneous functions of degree 1 with respect to the variables ( q ˙ 1 , , q ˙ m ) .
( 3 )
The condition (40) holds for any set of explicit functions α ν , ν = 1 , , k , calculated from (10).
Proof. 
(Essential steps): the implications ( 2 ) ( 3 ) correspond to the Euler’s homogeneous function Theorem, stating that the definition (45) is equivalent to the condition
i = 1 ξ i f ξ i ( ξ 1 , , ξ ) = σ f ( ξ 1 , , ξ ) ( ξ 1 , , ξ ) D ,
which coincides with (40) for σ = 1 and ( ξ 1 , , ξ ) = ( q ˙ 1 , , q ˙ m ) . Furthermore, the implications ( 1 ) ( 2 ) can be proved by virtue of the fact that each derivative f x i i , i = 1 , , , of a homogeneous function f of degree σ is a homogeneous function of degree σ 1 . □
Remark 6.
A significant point in favor of the category (46) of homogeneous constraints is that the formal structure is invariant if a transformation of Lagrangian coordinates q ¯ = q ¯ ( q ) is applied; indeed, the induced linear transformation of the generalized velocities q ¯ ˙ = q ¯ q q ˙ (where the quantity in round brackets indicates the Jacobian matrix) makes the property (46) still valid for the generalized velocities q ¯ ˙ .

5. Conclusions and Next Investigation

The d’Alembert–Lagrange principle combined with the class of displacements (14) offers a simple way of inferring the equations of motion for nonholonomic systems, even with nonlinear kinematic constraints. Actually, the linear conditions (14) on the δ q j allow us an easy transition from the principle (7) to the equations of motion (19) simply by using an argument of linear algebra. A further advantage of (14) combined with the d’A–L P. is that it is not required to express about the possible commutation δ q ˙ j d d t δ q j (this is actually a debated question), which, instead, is indispensable whenever (17) is assumed, and necessarily combined with the integral formulation of a principle, owing to the presence of δ q ˙ j .
A long-standing problem is whether there exists the possibility of deriving the set (14) from the constraint Equation (10). In our opinion, the problem has not been resolved in [21], and the C ˇ etaev rule remains, at least for the moment, without a theoretical explanation. However, this circumstance does not detract from a series of advantages that the rule offers: The use of the d’A–L P. allows the equations of motion to be written in a simple and direct way. The generalization to higher orders (see (30)) occurs through the δ q j alone; therefore, nothing changes from a formal point of view; many other advantageous aspects could be mentioned. We also highlighted how the validity of (39) (which, as reported in Proposition 1, is characteristic of homogeneous constraints of any order) places the mechanical system in a natural physical context, where the displacements coincide with the virtual velocities, the virtual work of the constraint forces is zero, and the energy of the system can be univocally defined (see (41) and (44)). The class of displacements (14) joined with the requirement (39) (homogeneous constraints) appears to be the natural extension of the standard holonomic case to the general case of nonholonomic nonlinear constraints.
The conditions listed in Example 2 show various nonholonomic constraints (some of them nonlinear) of systems that are certainly not marginal in the context of kinematic restrictions. This encourages us to think that (39) essentially covers the set of physically feasible nonholonomic constraints. From a mathematical point of view, the topic to be investigated is the possibility of formulating a zero level set through homogeneous functions. Obviously, the topic concerns stationary constraints and time-dependent ones requires a separate study.
A second issue we are investigating concerns the correlation between the two conditions (14) and (17); on the one hand, it is simple to write the mathematical identity that links them (the so-called transpositional relation); on the other, it is not obvious, for example, which category of systems admits both displacements, or what the role of the commutation rule is. In recent decades, this debate has been animated in the literature [4,22,30].
A further interesting point is to compare the procedure offered by the d’Alembert–Lagrange principle with different approaches (as variational methods) which generalize standard methods for holonomic systems to the case of nonholonomic systems, especially with nonlinear constraints. A stimulating starting point is the recent paper [31], where the Hamiltonian and the action functional are extended by introducing an extra variable performing a sort of dissipation. The corresponding Lagrange problem leads to equations comparable with (19). Furthermore, the method used in [31] is suitable to deal with the energy question, which we simply mention via Formulae (27)–(44).
In the present work, we focused on the theoretical questions about the Cetaev condition and the link with homogeneous constraints omitting a numerical approach: moving to these kinds of questions (Lagrangian mechanics versus vakonomic mechanics (transpositional relation)), neither numerical simulations nor experimental validations provide strong support for the theoretical conclusions.

Funding

This research received no external funding.

Data Availability Statement

No datasets were generated or analysed during the current study.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Gantmacher, F.R. Lectures in Analytical Mechanics; Mir Publisher: Moskow, Russia, 1970. [Google Scholar]
  2. Pars, L.A. A Treatise on Analytical Dynamics; Heinemann Educational Books Ltd.: London, UK, 1965. [Google Scholar]
  3. Neĭmark, J.I.; Fufaev, N.A. Dynamics of Nonholonomic Systems; Translations of Mathematical Monographs; American Mathematical Society: Providence, RI, USA, 1972; Volume 33. [Google Scholar]
  4. Li, S.M.; Berakadar, J. A generalization of the Chetaev condition for nonlinear nonholonomic constraints: The velocity-determined virtual displacement approach. Rep. Math. Phys. 2009, 63, 179–189. [Google Scholar] [CrossRef]
  5. Benenti, S. The non–holonomic double pendulum, an example of non-linear non-holonomic system. Regul. Chaotic Dyn. 2011, 1, 417–442. [Google Scholar] [CrossRef]
  6. Zeković, D.N. Dynamics of mechanical systems with nonlinear nonholonomic constraints—I The history of solving the problem of a material realization of a nonlinear nonholonomic constraint. Z. Angew. Math. Mech. 2011, 91, 883–898. [Google Scholar] [CrossRef]
  7. Flannery, M.R. The enigma of nonholonomic constraints. Am. J. Phys. 2005, 73, 265–272. [Google Scholar] [CrossRef]
  8. Zeković, D.N. Dynamics of mechanical systems with nonlinear nonholonomic constraints—III Analysis of motion. Z. Angew. Math. Mech. 2011, 93, 550–574. [Google Scholar] [CrossRef]
  9. Lurie, A.I. Analytical Mechanics; Springer: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
  10. Papastavridis, J.G. Analytical Mechanics: A Comprehensive Treatise on the Dynamics of Constrained Systems; World Scientific: Singapore, 2014. [Google Scholar]
  11. Benenti, S. A general method for writing the dynamical equations of nonholonomic systems with ideal constraints. Regul. Chaotic Dyn. 2008, 13, 283–315. [Google Scholar] [CrossRef]
  12. Borisov, A.V.; Mamaev, I.S.; Bizyaev, I.A. Dynamical systems with non–integrable constraints, vakonomic mechanics, sub–Riemannian geometry, and non–holonomic mechanics. Russ. Math. Surv. 2017, 72, 783–840. [Google Scholar] [CrossRef]
  13. Krupková, O. The nonholonomic variational principle. J. Phys. A Math. Theor. 2009, 42, 185201. [Google Scholar] [CrossRef]
  14. Marle, C.M. Various approaches to conservative and nonconservative nonholonomic systems. Rep. Math. Phys. 1998, 42, 211–229. [Google Scholar] [CrossRef]
  15. Mei, F. Nonholonomic mechanics. Appl. Mech. Rev. 2000, 11, 283–305. [Google Scholar] [CrossRef]
  16. Četaev, N.G. On the Gauss Principles; Papers on Analytical Mechanics; Science Academy: Washington, DC, USA, 1962; Volume 323. [Google Scholar]
  17. Rumyantsev, V.V. On the Chetaev principle. Dokl. Akad. Nauk SSSR 1973, 210, 787–790. (In Russian) [Google Scholar]
  18. Rumyantsev, V.V. Forms of Hamilton’s principle for nonholonomic systems. Mech. Autom. Robot. 2000, 2, 1035–1048. [Google Scholar]
  19. Lewis, A.D.; Murray, R.M. Variational principles for constrained systems: Theory and experiment. Int. J. Non-Linear Mech. 1995, 30, 793–815. [Google Scholar] [CrossRef]
  20. Xu, S.S.; Li, S.M.; Berakdar, J. Appell–Hamel dynamical system: A nonlinear test of the Chetaev and the vakonomic model. ZAMM Z. Angew. Math. Mech. 2007, 87, 692–697. [Google Scholar] [CrossRef]
  21. Flannery, M.R. d’Alembert–Lagrange analytical dynamics for nonholonomic systems. J. Math. Phys. 2011, 52, 032705. [Google Scholar] [CrossRef]
  22. Lemos, N.A. Complete inequivalence of nonholonomic and vakonomic mechanics. Acta Mech. 2022, 233, 47–56. [Google Scholar] [CrossRef]
  23. Li, S.M.; Berakadar, J. On the validity of the vakonomic model and the Chetaev model for constraint dynamical systems. Rep. Math. Phys. 2007, 60, 107–115. [Google Scholar] [CrossRef]
  24. Kozlov, V.V. Dynamics of systems with nonintegrable constraints. I. Mosc. Univ. Mech. Bull. 1982, 37, 27–34. [Google Scholar]
  25. Kozlov, V.V. Dynamics of systems with nonintegrable constraints. II. Mosc. Univ. Mech. Bull. 1982, 37, 74–80. [Google Scholar]
  26. Kozlov, V.V. Dynamics of systems with nonintegrable constraints. III. Mosc. Univ. Mech. Bull. 1983, 38, 40–51. [Google Scholar]
  27. Rund, H. The Hamilton–Jacobi Theory in the Calculus of Variations; Van Nostrand: London, UK, 1966. [Google Scholar]
  28. Saletan, E.J.; Cromer, A.H. A variational principle for nonholonomic systems. Am. J. Phys. 1970, 34, 406–408. [Google Scholar] [CrossRef]
  29. Krupková, O. Higher order mechanical systems with constraints. J. Math. Phys. 2000, 41, 5304–5324. [Google Scholar] [CrossRef]
  30. Yang, W.; Lifu, L.; Zhongwei, L. On mechanical property of constraint. Appl. Math. Mech. (Engl. Ed.) 1995, 16, 1095–1103. [Google Scholar] [CrossRef]
  31. Liu, Q.; Torres, P.J.; Wang, C. Contact Hamiltonian dynamics: Variational principles, invariants, completeness and period behavior. Ann. Phys. 2018, 395, 26–44. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Talamucci, F. On the Četaev Condition for Nonholonomic Systems. AppliedMath 2025, 5, 29. https://doi.org/10.3390/appliedmath5010029

AMA Style

Talamucci F. On the Četaev Condition for Nonholonomic Systems. AppliedMath. 2025; 5(1):29. https://doi.org/10.3390/appliedmath5010029

Chicago/Turabian Style

Talamucci, Federico. 2025. "On the Četaev Condition for Nonholonomic Systems" AppliedMath 5, no. 1: 29. https://doi.org/10.3390/appliedmath5010029

APA Style

Talamucci, F. (2025). On the Četaev Condition for Nonholonomic Systems. AppliedMath, 5(1), 29. https://doi.org/10.3390/appliedmath5010029

Article Metrics

Back to TopTop