1. Introduction
In the wide class of differential equations, a distinct category consists of those equations where the state at present time depends on the past evolution of the system (the so-called functional differential equations); in this class, one can identify the situation where the maximum of the state on a past interval of time is involved, namely the differential equations with maxima. Such problems have been studied for a long time, starting with an example in the automatic control of some technical systems in Popov [
1], and their study includes various qualitative properties, such as oscillatory properties, stability, periodicity, or approximation for differential, difference, or integral equations (see Bainov, Hristova [
2], Cernea [
3] or Otrocol, Rus [
4] for a comprehensive discussion of the topic).
Additionally, the theory of differential equations where the usual derivative is replaced by the Stieltjes derivative with respect to a left-continuous non-decreasing function 
g (Pouso, Rodriguez [
5]) significantly grew in the recent few years (e.g., Frigon, Pouso [
6], Monteiro, Satco [
7], Marraffa, Satco [
8], Pouso, Márquez Albés [
9], Satco, Smyrlis [
10,
11]) due to its interesting real-world applications (e.g., Fernandez et al. [
12], Frigon, Pouso [
6], Pouso, Márquez Albés [
9]).
This growth is also motivated by the fact that Stieltjes differential equations are deeply related to measure differential problems (Cao, Sun [
13], Di Piazza et al. [
14], Federson et al. [
15], Monteiro, Satco [
16], Pouso, Rodriguez [
5], Satco [
17]); thus, their study allows one to cover usual differential problems (for an absolutely continuous map 
g), discrete problems (when 
g is a sum of step functions), impulsive equations (for 
g being a sum of an absolutely continuous map with step maps), dynamic equations on time scales (Federson et al. [
15]), and generalized differential equations (see Slavík [
18], Kurzweil [
19], Schwabik [
20]).
Additionally, the classical Filippov lemma (Aubin, Cellina [
21]) allows one to find a solution of a differential inclusion which satisfies a specific estimation comparing to a given “quasi-solution”. It was generalized in many directions (to the case of impulsive functional inclusions in Djebali et al. [
22], to general topologic and measure spaces in Borisenko [
23]—see also Tolstonogov [
24]—to measure differential problems in Fryszkowski, Sadowski [
25], to Stieltjes differential inclusions in Satco, Smyrlis [
11] and to inclusions with maxima in Cernea [
3], to give only a few examples).
In this work, having in mind the major implications of Filippov-type lemmas in obtaining qualitative features of differential problems (for instance, to relaxation properties, e.g., Marraffa, Satco [
26], Cernea [
3]), a Filippov-type result is obtained for differential inclusions with supremum driven by the Stieltjes derivative:
      here, 
 is non-decreasing and left-continuous, and 
 and 
 for every 
.
If g is continuous, then the supremum is attained, and one can speak about differential problems with maxima. On the other hand, as we shall see by an example, the supremum is not necessarily a maximum when g is discontinuous.
As a corollary, the first existence result for Lipschitz differential inclusions with supremum is deduced in this wide setting of differential problems driven by Stieltjes derivative.
Notably, Stieltjes differential inclusions with a constant (resp. variable) delay can be thus investigated by taking  with h a non-negative constant (respectively, ).
Moreover, the set-valued framework is taken into consideration (more general than the single-valued setting of differential equations), where differential problems with maxima or supremum were studied, as far as the authors know, only in the case where 
g is the identity map (e.g., Cernea [
3]) or when the fractional derivative is considered (Cernea [
27]).
Dynamic problems on time scales can be seen as Stieltjes differential problems (Frigon, Pouso [
6]); therefore, an application to dynamic inclusions on time scales with supremum is also included. It is important to note that the obtained Filippov-type lemma is new (comparing to the only such result, given in Rafaqat et al. [
28]) even in the case where the supremum of the map is not present in the multifunction on the right-hand side.
As a final remark concerning the generality of the outcome, it has to be mentioned that several known results are covered by taking 
F a single-valued map (for usual differential problems with maxima Bainov, Hristova [
2] or impulsive equations supremum Hristova, Georgieva [
29]), or by particularizing 
 and 
 (for instance, ([
3], Theorem 1) can be inferred when 
, 
 and 
).
  2. Notations and Auxiliary Results
Let 
 be a left-continuous non-decreasing function. The measurability with respect to (w.r.t.) the 
-algebra defined by 
g will be called 
g-measurability, and the Stieltjes measure generated by 
 will be denoted by 
 (see ([
30], Example 6.14) or ([
31], paragraph 1.5)).
The Lebesgue–Stieltjes (LS-) integrability w.r.t. 
g ([
31]; also see [
32]) represents the abstract Lebesgue integrability w.r.t. the Stieltjes measure 
. Let 
 be the space of real functions LS-integrable w.r.t. 
g, endowed with the usual topology given by the norm
      
      The 
g-topology on 
 is the topology with the basis of the class of all sets
      
Definition 1.  A function  is said to be g-continuous ([6]) if it is continuous w.r.t. the g-topology, i.e., if for any  and any  there exists  such that  In [
6], it was proved that a 
g-continuous function might not have the right limit 
 at the discontinuity points of 
g and it might not be even bounded. For this reason, the space 
 of real bounded, 
g-continuous functions was considered ([
6]). It was proved that, endowed with the norm
      
      it becomes a Banach space.
Unlike for maps continuous in the usual topology, the supremum might not be necessarily a maximum for g-continuous functions, as it can be seen in the following example.
Example 1.  Let  be given byand  byIt can be seen that f is bounded and for every  there exists , such that whenever  satisfy , one has . This implies that f is g-continuous. Yet, obviously, f does not attain its maximum on the interval .  A notion of differentiability with respect to a left-continuous non-decreasing function introduced in [
5] by an idea in [
33] will be used in what follows.
Definition 2.  The Stieltjes (or g-) derivative of a function  with respect to g at a point  is  if the limits exist. The formula is meaningless on the set
      
      which is, by [
5], 
-null (it is worthwhile to mention that in the same way as in ([
12], Definition 3.7), the definition of the 
g-derivative can be extended to cover also the points of this set).
The particular cases of the theory of Stieltjes differential equations are obtained by taking particular functions g. For instance, when , one gets ordinary differential equations. In the case where g is a sum of step functions, one finds difference equations. For the case where g is a sum of the identical function with step functions, impulsive problems are obtained problems. So, the advantage of using Stieltjes derivatives is that they allow one to include a wide class of problems in one study.
This notion of derivatives has intensively been used in solving various problems where abrupt modifications (corresponding to discontinuity points of 
g) and stationary intervals of time (corresponding to intervals where 
g is constant) occur, such as [
6,
34] or [
9].
The properties to be imposed on the solutions of the considered problem are described below.
Definition 3.  A function  is called g-absolutely continuous (g-AC; see [5]) if for every  there is  such that for any collection  of non-overlapping subintervals of   Obviously, the g-absolute continuity involves g-continuity.
The fundamental theorem of calculus for LS-integrals offers a characterization of g-AC maps in terms similar to usual absolutely continuous maps.
Theorem 1.  ([5], Theorem 5.4) A map  is g-absolutely continuous if and only if it is g-differentiable -a.e.,  is LS-integrable w.r.t. g and  For elementary notions of set-valued analysis, the reader is referred to [
21,
35] or [
36]. 
 will denote the family of nonempty compact subsets of 
. If 
, the Hausdorff–Pompeiu metric is defined by
      
      where the (Pompeiu) excess of the set 
A over the set 
C is
      
  3. Main Results
Let  be non-decreasing and left-continuous; thus,  and the g-measurable functions  satisfy  for every .
This section aims to provide a Filippov-type result for the problem .
In the same way it was carried out in ([
4], Lemma 3.1) for continuous maps, the following estimation can be proved.
Lemma 1.  For every ,  Proof.  The assertion easily follows from the inequality
        
        by passing to the following supremum:
        
        □
 The main result of this paper is presented in what follows.
Theorem 2.  Let  have the following properties:
For every ,  is g-measurable;
There exist  such that for -a.e.  and for all , 
Then, given a g-absolutely continuous function , if  satisfiesthere exists a solution  of  such that  Proof.  The well-known Filippov method of proof will be adapted to the present setting, as in [
25]. A Cauchy sequence 
 will be defined along with a corresponding sequence 
 of 
g-absolutely continuous functions, which will bring us to a solution of 
 with the required property.
Thus, let 
 be given by 
 and
        
        (it is known that 
 exists 
-a.e. by Theorem 1 and that it belongs to 
).
By hypothesis,
        
        thus, by the classical Kuratowski–Ryll–Nardzewski theorem, one can choose a 
g-measurable selection 
 of the 
g-measurable ([
36], Chapter III) closed-valued multifunction
        
        i.e.,
        
        In particular, this implies that 
.
Now, denote by 
        it can be seen that
        
        and, using Lemma 1,
        
        Next, the right-hand side of the inequality is 
g-measurable (in fact, it belongs to 
), so if
        
        again by Kuratowski-Ryll-Nardzewski theorem, one can find a 
g-measurable selection 
 of the nonempty closed-valued 
g-measurable multifunction
        
        This implies that
        
        (so, 
) and we define 
 by 
, for each 
. It follows that
        
        One proceeds further in the same way to obtain a selection 
 of
        
        for every 
 and to define 
 by
        
        One can see as before that
        
        If one denotes by
        
        then for all 
,
        
        Similarly to [
25], it can be proved by mathematical induction that for every 
,
        
        where
        
        By applying ([
25], Proposition 4.5), one obtains
        
        hence, it follows that 
 is Cauchy, so it converges towards a function 
.
The sequence 
 is Cauchy as well, and so it tends to some 
. By passing to the limit in 
, one gets, by dominated convergence theorem,
        
        Using the hypotheses on 
F, for 
-a.e. 
 and every 
, there exists 
 such that
        
        it follows that
        
        Finally, as for each 
,
        
        the proof is over.    □
 In particular, the following corollary can be deduced (one thus generalizes ([
3], Theorem 1) which is available in the case where 
 on 
 and 
, 
).
Corollary 1.  Suppose that  is jointly measurable and Lipschitz in the last two arguments.
Ifthen given a g-absolutely continuous function , if  satisfiesthere exists a solution  of  such that  Proof.  The hypotheses of Theorem 2 are obviously satisfied, so all one has to do is remark that for any 
,
        
□
 It is useful to mention that the main theorem involves the following existence result for nonconvex Stieltjes differential inclusions with supremum.
Corollary 2.  Let  satisfy the assumptions of Theorem 2.
Then, the differential problem  has g-absolutely continuous solutions on .
 Remark 1.  Using Theorem 2, a relaxation result could be obtained for Stieltjes differential inclusions with supremum following the idea in [26]. Note that the method applied in [3] to this purpose is not applicable here since the key result connecting the adherence of the integral of a multifunction and the adherence of the integral of its closed convex hull ([37], Corollary 4.3) works only for nonatomic measures, while  is not nonatomic (more precisely, it has atoms whenever g has discontinuity points).    4. Application to Dynamic Inclusions on Time Scales with Supremum
It will be shown in what follows that Theorem 2 involves a Filippov-type result for dynamic inclusions on time scales with supremum.
Let 
 be a bounded time scale (i.e., a nonempty and closed subset of 
, with the usual topology). Suppose that 
, denote by 
 and suppose
      
      meaning that the time scale is typical ([
6]).
For the convenience of the reader, the basic notions of time-scale analysis will be briefly recalled (see [
38]).
The forward jump operator  is given by  (by convention, ).
A point 
 is called right-dense, resp. right-scattered, if 
, resp. 
. It was proved (in [
39], Lemma 3.1) that only, at most, countably many points (
) are right-scattered. The Lebesgue measure 
 on 
 was studied in [
39], as well as the Lebesgue 
-integral (see also  [
38,
40]).
(Ref. [
39], Theorem 5.2) asserts that if 
, for any 
,
      
      (the time-scale interval 
).
The idea of this section is to transform the problem on time scales into a Stieltjes differential problem; to achieve this aim, as in [
41], consider the left-continuous and non-decreasing map 
 given by
      
      which has the properties
      
      For any function 
, its Slavik extension ([
41]) is
      
      It is known (e.g., [
8]) that
      
Definition 4.  ([38]) The function  is called Δ
-differentiable at  if there is  (the Δ
-derivative of f at t), such that for any , there is a neighborhood of t on which  The focus in this section is on a dynamic inclusion on time scales with supremum:
      where 
 and 
 are 
-measurable and satisfy the condition 
 for each 
.
Definition 5.  A map  is said to be a solution of this problem if it is Δ-absolutely continuous and, moreover, there exists a Lebesgue Δ-integrable selection f of  with , -a.e. and .
 Let us recall ([
42]) the following notion.
Definition 6.  One says that  is Δ
-absolutely continuous if for every  one can find  such thatfor any family  of non-overlapping intervals with  and .  The key result for the connection between the dynamic problem on time scales and a Stieltjes differential problem is the following.
Lemma 2.  A function  is a solution of problem  if and only if  is a solution of  Proof.  For every solution 
x of 
, there exists a selection 
f of 
 such that
        
        In the same way as in [
6], one can prove that at any point 
 where the 
-derivative exists,
        
        hence, 
 for every 
. Additionally, taking into account the definition of the Lebesgue 
-measure ([
39], 
Section 2) and the fact that the 
g-derivative of 
 is not defined on 
 since 
g is constant (but it is well-known, from [
5], that 
), one gets
        
        Moreover, since 
 is compact, it follows that for every 
, we have
        
        and as in 
, the equality 
 holds, it follows that
        
        With the remark that 
, it can be stated that 
 is indeed a solution of 
.
Conversely, if 
y is a solution of 
, then the map 
 given by 
 for every 
 is a solution of the dynamic problem 
 since
        
        □
 Using this auxiliary result, it is now possible to infer a Filippov-type lemma for dynamic inclusions on time scales with supremum from Theorem 2 (in an obvious manner).
Theorem 3.  Let  be Δ-measurable and satisfy the condition  for each . Let  verify the following hypotheses:
For every ,  is Δ-measurable;
There exist  Lebesgue Δ
-integrable on  such that -a.e. on , for all , as follows: 
Then, given a Δ
-absolutely continuous function , if a Lebesgue Δ
-integrable map  satisfiesthere exists a solution  of  such that  Remark 2.  Even in the case where the supremum is not present in the problem , Theorem 3 gives a new Filippov-type result for dynamic inclusions on time scales (different from ([28], Theorem 4.2), where the estimation on the right-hand side can be expressed in terms of an exponential function adjusted to the time-scale setting—see ([38], Theorem 2.77)).