An Abstraction Technique for Verifying Shared-Memory Concurrency

Oortwijn, Wytse; Gurov, Dilian; Huisman, Marieke

doi:10.3390/app10113928

Open AccessArticle

An Abstraction Technique for Verifying Shared-Memory Concurrency^†

by

Wytse Oortwijn

^1,*

,

Dilian Gurov

²

and

Marieke Huisman

³

¹

Department of Computer Science, ETH Zurich, 8092 Zurich, Switzerland

²

Department of Theoretical Computer Science, KTH Royal Institute of Technology, SE-100 44 Stockholm, Sweden

³

Formal Methods and Tools, University of Twente, 7500 AE Enschede, The Netherlands

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in 21st International Conference on Verification, Model Checking, and Abstract Interpretation held in New Orleans, LA, USA, 19–21 January 2020.

Appl. Sci. 2020, 10(11), 3928; https://doi.org/10.3390/app10113928

Submission received: 30 April 2020 / Revised: 29 May 2020 / Accepted: 2 June 2020 / Published: 5 June 2020

(This article belongs to the Special Issue Static Analysis Techniques: Recent Advances and New Horizons)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Modern concurrent and distributed software is highly complex. Techniques to reason about the correct behaviour of such software are essential to ensure its reliability. To be able to reason about realistic programs, these techniques must be modular and compositional as well as practical by being supported by automated tools. However, many existing approaches for concurrency verification are theoretical and focus primarily on expressivity and generality. This paper contributes a technique for verifying behavioural properties of concurrent and distributed programs that balances expressivity and usability. The key idea of the approach is that program behaviour is abstractly modelled using process algebra, and analysed separately. The main difficulty is presented by the typical abstraction gap between program implementations and their models. Our approach bridges this gap by providing a deductive technique for formally linking programs with their process-algebraic models. Our verification technique is modular and compositional, is proven sound with Coq, and has been implemented in the automated concurrency verifier VerCors. Moreover, our technique is demonstrated on multiple case studies, including the verification of a leader election protocol.

Keywords:

concurrency verification; program logics; process algebra; code verification; abstraction

1. Introduction

Modern software is typically composed of multiple concurrent components that communicate via shared or distributed interfaces, for example via shared-memory or via message passing. The concurrent nature of the interactions between (sub)components makes such software highly complex as well as notoriously difficult to develop correctly. To ensure the reliability of modern software, verification techniques are much-needed to aid software developers to comprehend all possible concurrent system behaviours. To be able to reason about realistic programs these techniques must be modular and compositional, as well as be supported by automated verification tools.

Even though verification of concurrent and distributed software is a very active research field [1,2,3,4,5,6], most work in this line of research is essentially theoretical, and tends to focus primarily on contributing expressive program logics specialised in reasoning about advanced concurrency features like relaxed or weak memory, fine-grained concurrency, message passing interaction, etc. Even though expressive, it is very challenging for these logics to be integrated into SMT-based automated verifiers like for example VeriFast [7], VerCors [8] and Viper [9,10]. Instead, most of these works have to be applied in pen-and-paper style, or at best semi-automatically in the context of an interactive theorem prover like Coq [11,12] or Isabelle/HOL [13].

This article contributes a concurrency verification technique that applies directly on the level of program code and is supported by automated verifiers. However, rather than doing the verification fully on the level of program code, our approach allows soundly abstracting program behaviour into abstract models which can be reasoned about externally, on a higher level in which irrelevant implementation details are hidden, to (indirectly) prove properties about the program behaviour. The presented verification technique (1) has been implemented in VerCors—an automated SMT-based concurrency verifier; (2) is demonstrated on various (real-world) examples, including a leader election protocol (presented in Section 5); and (3) the metatheory of the technique has been fully formalised and proven sound with the Coq proof assistant. With respect to (2); apart from the examples given in this article, more examples of our approach are given in [14], including the verification of a (reentrant) lock as well as a concurrent parallel GCD algorithm. Our technique has also been used in a real-world industrial case study [15]—on the formal verification of a safety-critical traffic tunnel control system.

This article extends our earlier VMCAI’20 article [16]. Elaborating on the contributions with respect to this earlier article; we contribute a generalisation of the theory in [16] by combining it with the techniques proposed in [17] and [18] into a single logical framework that is more general than the original. This combined unified framework is proven sound with Coq and is available online at [19].

1.1. Motivation

Reasoning about complex concurrent program behaviours is only practical if conducted at a suitable level of abstraction that hides implementation details that are irrelevant for the properties to prove. Furthermore, any real concurrent programming language with shared memory, threads and locks, has only very little algebraic behaviour. In contrast, process algebras offer an abstract, mathematically elegant way of expressing program behaviour. Process algebras have been used widely in the past for modelling and analysing the behaviour of concurrent programs at an adequate level of abstraction [20,21]. Our approach therefore uses a process algebra as a language for specifying program behaviour. Such a specification can be seen as a model, the properties of which can additionally be checked (say by interactive theorem proving, or by model checking against temporal logic formulas, which can be seen as even more abstract behavioural specifications). The main difficulty of this approach is dealing with the typical abstraction gap between program implementations and their abstract models. The unique contribution of our approach is that it bridges this gap by providing a deductive technique for formally linking programs with their process-algebraic models. These formal links preserve safety properties [22]; we leave the preservation of liveness properties for future work.

The key idea of the approach rests in the use of concurrent separation logic (CSL) to reason not only about data races and memory safety, which is standard [23,24], but also about process-algebraic models (that is, specified program behaviours), viewing the latter as resources that can be split and consumed. This results in a modular and compositional approach to establish that a program behaves as specified by its abstract model. Our approach is formally justified by (mechanically proven) correctness results stating that any verified program is a refinement of its abstract, process-algebraic model.

Process-algebraic models are composed out of individual actions that abstract atomic behaviours of program components. Our approach allows specifying program components to follow a particular sequence/pattern of actions—a protocol. One can then reason about the interaction behaviour of different program components by reasoning about the composition of their models, for example by using a model checker for process algebra, like mCRL2 [25]. This approach of specifying the interactions of program components is different from classical Hoare logic, which is purely transformational in the sense that it considers verified (terminating) program components essentially as transformers from states satisfying the specified precondition to states satisfying the specified postcondition.

A benefit of our combined approach compared to model checking is that it allows reasoning soundly about both data and control-oriented properties in a single framework. Model checkers typically specialise in reasoning about temporal, control-oriented specifications (e.g., send actions must always be matched by a recv), and generally have limited support for handling data due to the risk of state-space explosions. Hoare logic based techniques, on the other hand, tend to specialise in reasoning about data specifications (e.g., a sorting function should yield a sorted permutation of its input), and are typically limited in their capabilities to reason about control-flow properties. Since realistic concurrent systems often deal with both data and control-flow, it is beneficial to be able to reason about both in a single framework. Additionally, our technique addresses the typical “abstraction gap” problem of model checking: is the model actually a faithful abstraction of the modelled system? We propose techniques to formally link programs to their abstract models, allowing one to prove that all program behaviours that should be captured by the abstract model are indeed soundly abstracted.

1.2. Contributions

The main contributions of this extended article are:

A verification technique to reason about the behaviour of shared-memory concurrent programs that is modular, compositional, and proven sound. This article extends [16] by generalising its verification technique and combining it with the core ideas of [17,18]. In particular, it extends the process algebra specification language with summations, support for input parameters, and the assertional processes of [17], which shall all be introduced later, in Section 3.
A full Coq development of the formalisation as presented in Section 3, together with a soundness proof of the approach. The Coq sources and their documentation are available at [19].
Several examples that demonstrate this new (unified) verification approach, including a leader election protocol case study discussed in Section 5.

1.3. Outline

The remainder of this article is organised as follows. First Section 2 illustrates our technique on a small Owicki–Gries example program, before Section 3 gives theoretical justification of the verification technique. In particular, Section 3.1 introduces the process algebra specification language, after which Section 3.2 introduces the programming language on which the approach is formalised on. Section 3.3 defines and discusses the syntax and semantics of the assertion language, which is a concurrent separation logic with special constructs to to handle process-algebraic models. Section 3.4 discusses the proof system and Section 3.4 its soundness. Section 4 gives details on how the verification technique is implemented in the concurrency verified VerCors, and briefly elaborates on the Coq development. Section 5 demonstrates the approach on a larger case study: the verification of a classical leader election protocol. Finally, Section 6 discusses related work and Section 7 concludes.

2. Approach

Before going into the formal details of the approach, let us first illustrate it on a simple example. Our approach allows abstractly specifying concurrent program behaviour as process-algebraic models. Processes are composed of atomic, indivisible actions. In our approach the actions are logical descriptions of shared-memory modifications: they describe what changes the program is allowed to make to a specified region of shared memory—the program heap. These actions are then linked to the concrete instructions in the program code that perform the memory updates. These links between program components and their abstract models are established deductively, using a concurrent separation logic that is presented later. Well-known techniques for process-algebraic reasoning can then be applied to guarantee safety properties over all possible state changes, as described by their compositions of actions. The novelty of the approach is that these safety properties can then be relied upon in the program logic due to the established formal connection between program components and their process-algebraic models.

2.1. Example Program

Consider the following example program, which is a simple variant of the classical concurrent Owicki–Gries program [26].

atomic \{X : = [E]; [E] : = X + 4\} | | atomic \{Y : = [E]; [E] : = Y * 4\}

This program consists of two threads: one that atomically increments the value at heap location E by four, while the other atomically multiplies the value at E by four. The notation

[E]

denotes heap dereferencing, with E an expression whose evaluation determines the heap location to dereference.

The challenge is to modularly deduce the classical Owicki-Gries postcondition: after termination the value at heap location E is either

4 * ({old}_{E} + 4)

or

(4 * {old}_{E}) + 4

(depending on the interleaving of threads), where

{old}_{E}

is the “old value at E”—the value of E at the pre-state of the computation.

Well-known existing classical approaches and techniques to deal with such concurrent programs [27] include auxiliary state [26] and interference abstraction via rely-guarantee reasoning [28]. Modern program logics employ more intricate constructs, like atomic Hoare triples [5] in the context of TaDa, or higher-order ghost state [29] in the context of Iris. However, the mentioned classical approaches typically do not scale well, whereas such modern, theoretical approaches are hard to integrate into (semi-)automated SMT-based verifiers like for example VeriFast or VerCors.

In contrast, our approach makes a balanced trade-off between expressivity and usability: it is scalable as well as implemented in an automated deductive verifier.

The approach consists of the following three steps:

Step 1.: Define a process-algebraic model $OG = (incr (4) ‖ mult (4)) \cdot ? (b_{post})$ that is composed out of two actions, $incr$ and $mult$ , that abstract the two atomic sub-programs;
Step 2.: Verify that the $OG$ process indeed satisfies the Owicki–Gries postcondition, $b_{post}$ ; and
Step 3.: Deductively verify that $OG$ is a correct behavioural specification of the program’s execution flow. That is, verify that every atomic state change that is executed by a run of the program has a corresponding action in OG.

The following paragraphs give more detail on these three steps.

2.1.1. Step 1: Specifying Program Behaviour

The first step is to construct a behavioural specification OG of the example program. The OG process is defined to be the parallel composition of the actions

incr (4)

and

mult (4)

, which specify the behaviour of the atomic increment and multiplication in the program, respectively. In our approach, program behaviour is specified logically, by associating a contract to every action. For the example program, incr and mult would have the following contract:

\begin{matrix} guard true; & guard true; \\ effect x = \ old (x) + n; & effect x = \ old (x) * n; \\ action incr (int n); & action mult (int n); \end{matrix}

Any action contract consists of a guard and an effect. The guard of any action specifies the condition under which the action is allowed to be executed. In the above example, the guard of both incr and mult is specified to be true, meaning that both these actions may unconditionally be performed. The effect clause of any action specifies the way the action is allowed to change the (program) state. Observe that incr and mult are indeed abstractions of the two atomic sub-programs, and that the effect clauses of these actions are abstract specifications of how the program updates the heap. (Note that one could think of guards and effect of actions as pre- and postconditions, respectively. However, they are not strictly the same (hence the slightly different terminology). For the sake of process-algebraic analysis all action contracts can be assumed to hold, while on the program level one has to prove that sets of instructions that correspond to the action satisfy the action contract, as will be explained in a moment.) Note that both these abstract specifications contain a free variable x, which is a process-algebraic variable that is later linked to a concrete heap location in the program (this will be

[E]

). Moreover, the increment and multiplication of 4 has now been generalised to an arbitrary integer n.

These two actions may be composed into a full behavioural specification of the example program, by also assigning a top-level contract to OG:

requires

true

;

process

OG (int n) : = (incr (n) ‖ mult (n)) \cdot ? (x = (\ old (x) + n) * n \lor x = \ old (x) * n + n)

;

Notice that the

OG

process has the form

(incr (n) ‖ mult (n)) \cdot ? (b_{post})

with

b_{post}

the Owicki–Gries postcondition. Here · denotes sequential composition, and

? (b_{post})

is an assertion process. These assertions are the main subject of process-algebraic reasoning: we verify that all asserted properties are never violated. Here we specify that

? (b_{post})

holds after executing

incr (n)

and

mult (n)

in any order.

The

OG

process also has a precondition that could potentially impose restrictions on the values of n. But for this Owicki–Gries example we do not have any such restrictions. Note that postconditions (that is, ensures clauses) are encoded as assertional processes, like done above.

2.1.2. Step 2: Process-Algebraic Reasoning

The next step is to verify that OG satisfies all properties b that are encoded as assertions

? (b)

, which can be reduced to standard process-algebraic analysis. Intuitively we say that

OG

is verified if, starting from any state satisfying

OG

’s

requires

clause, the process can never reach an asserted property b that does not hold. We shall later give a more formal definition of what it means for a process to be verified with respect to its precondition, in Section 3.4.2.

The standard approach to analysing

OG

would be to first linearise it to the bisimilar process

incr (n) \cdot mult (n) \cdot ? (b_{post}) + mult (n) \cdot incr (n) \cdot ? (b_{post})

, where + denotes non-deterministic choice and with

b_{post}

again the Owicki–Gries postcondition, and then to reason about all branches of this linearised process. With “reasoning about all branches” we intuitively mean establishing that all assertions encountered during any execution of a process are a logical consequence of the series of effects preceding the assertion. A formal definition is provided later in Section 3.1. VerCors currently does the analysis by encoding the linearised process as input to the Viper verifier [10]. VerCors can indeed automatically prove that OG satisfies the asserted property.

2.1.3. Step 3: Deductively Linking Processes to Programs

The key idea of our approach is that, by analysing how contract-complying action sequences change the values of process-algebraic variables, we may indirectly reason about how the content at heap location

[E]

evolves over time. So the final step is to project this process-algebraic reasoning onto program behaviour, by annotating the program.

Figure 1 shows the required program annotations. First, x is connected to

[E]

by initialising a new model M on line 2 that executes according to

OG (4)

. The actions

incr

and

mult

are then linked to the corresponding sub-programs on lines 5–7 and 11–13 by identifying action blocks in the code, using special program annotations. We use these

action

annotations to verify in a thread-modular way that the left thread performs the

incr (4)

action (on lines 5–7) and that the right thread performs

mult (4)

(lines 11–13). As a result, when the program reaches the

query

annotation on line 15, only the

? (b_{post})

process is left on the process level—the

incr (4) ‖ mult (4)

part has already been executed alongside the program. Since the Owicki–Gries postcondition

b_{post}

is already proven externally, by other means, in the previous step, the program logic may rely on its validity. But since we tracked the contents at heap location

[E]

on the process level as the variable x, one may indirectly conclude that the heap at location

[E]

has evolved as described by

OG

. In other words, using program annotations we prove that the program is a refinement of

OG

, meaning that we get the asserted property in the logic, on line 17.

Finally, the

finish

annotation on line 16 indicates that the model has been fully reduced at that point, and thus may be disposed of. This is for technical reasons; the program logic will do some bookkeeping while dealing with process-algebraic abstractions, and

finish

will cause this bookkeeping to be cleaned up. This is later discussed in greater detail, in Section 3.4.2.

3. Formalisation

We now give theoretical justification of the verification approach and explains the underlying logical machinery. First, Section 3.1 and Section 3.2 briefly discuss the syntax and semantics of process algebraic models and programs, respectively. Then Section 3.3 presents the program logic as a concurrent separation logic with assertions that allow to specify program behaviour as a process algebraic model. Section 3.4 formally introduces and discusses the proof rules. Finally, Section 3.5 discusses soundness of the approach. All these components have been fully formalised in Coq, including the soundness proof of the logic. Section 4 elaborates on the Coq development of the meta-theory, as well as on tool support, developed for the VerCors concurrency verifier.

3.1. Process-Algebraic Models

Program abstractions are defined using the following ACP-style [30] process-algebraic specification language, where

x, y, z, \dots \in ProcVar

are process-algebraic variables;

v, w, \dots \in Val

are values from an infinite domain

Val

; and

a, \dots \in Act

are (process-algebraic) actions.

Definition 1

(Processes).

Clarifying the different connectives and constructs,

ε

is the empty process, which has no behaviour. The

δ

process is the deadlocked process which neither progresses nor terminates. Processes of the form

a (e)

are actions, which model the basic, observable (shared-memory) system behaviours. Actions are parameterised by data, in the form of expressions e. The process

P \cdot Q

is the sequential composition of P and Q, whereas

P + Q

is their non-deterministic choice. The parallel composition of processes P and Q is written

P ‖ Q

. The process P Applsci 10 03928 i004

Q is the left-merge of P and Q, which is similar in spirit to parallel composition, however Applsci 10 03928 i004

insists that the left-most process P proceeds first. The left-merge is an auxiliary connective commonly used to axiomatise parallel composition [31], by having

P ‖ Q =

P

Q + Q

P. The process

Σ_{x} P

is the infinite summation

P [x / v_{0}] + P [x / v_{1}] + \dots

over all values

v_{0}, v_{1}, \dots \in Val

. Any summation

Σ_{x} P

is a binder for the summation variable x. In the remainder we assume without loss of generality that all variables bound by summation are unique (since any such variables can be renamed to unique ones if this is not yet the case). Sometimes

Σ_{x_{0}, \dots, x_{n}} P

is written to abbreviate

Σ_{x_{0}} \dots Σ_{x_{n}} P

. The conditional (guarded) process

b : P

behaves as P if the Boolean condition b holds, and otherwise behaves as

δ

. Finally,

P^{*}

is the repetition, or iteration of P, and denotes a sequence of zero or more P’s. The infinite iteration of P is derived to be

P^{ω} ≜ P^{*} \cdot δ

. Finally,

? (b)

is the assertive process, which is very similar to guarded processes:

? (b)

is behaviourally equivalent to

δ

in case b does not hold. However, assertive processes have a special role in our approach: they are the main subject of process-algebraic analysis, as they encode the properties b to verify, as logical assertions. Moreover, they are a key component in connecting process-algebraic reasoning with deductive reasoning, as their properties can be relied upon in the deductive proofs of programs via the

query b

ghost command.

3.1.1. Action Contracts

The presented verification approach uses processes in the presence of data, which is implemented via action contracts. Action contracts consist of pre- and postconditions which we refer to as guards and effects, respectively, that logically describe the state changes that are imposed by the corresponding action. In the remainder of this article, each action is assumed to have an action contract assigned to it. Instead of defining syntax for writing these contracts, the following two functions are assumed for obtaining the pre- and postcondition of an action (from

Act

) and its data parameter (from

ProcExpr

), respectively.

guard : Act \to ProcExpr \to ProcCond effect : Act \to ProcExpr \to ProcCond

Both these conditions are of type

ProcCond

, which is the domain of Boolean expressions over process-algebraic variables. Note that, since actions are parameterised by data (see Definition 1), both

guard

and

effect

take a second argument to account for the input parameter, which is of type

ProcExpr

—the type of arithmetic expressions over process-algebraic variables.

Here

Act \to ProcExpr \to ProcCond

should be read as

Act \to (ProcExpr \to ProcCond)

and interpreted as a function sequence (in the sense of currying). That is, it is the set of functions mapping

Act

to the set of functions mapping

ProcExpr

to

ProcCond

.

3.1.2. Free Variables and Substitution

A function

{fv}_{e} : ProcExpr \to 2^{ProcVar}

is used to determine the set of free process-algebraic variables in expressions as usual, and likewise for

{fv}_{e} (b)

and

{fv}_{P} (P)

for Boolean expressions b and processes P. We often omit the subscripts and simply write

fv (\cdot)

whenever the context allows it. The definitions of

{fv}_{e}

,

{fv}_{b}

and

{fv}_{P}

are mostly standard and thus deferred to [19]. Noteworthy however are:

{fv}_{P} (a (e)) ≜ {fv}_{b} (guard a e) \cup {fv}_{b} (effect a e) {fv}_{P} (Σ_{x} P) ≜ {fv}_{P} (P) \ {x} {fv}_{P} (? (b)) ≜ {fv}_{e} (b)

Substitution is written

e^{'} [x / e]

(and likewise for Boolean expressions and processes) and has a standard definition: replacing any occurrence of x inside

e^{'}

by the expression e. Noteworthy is that substitutions inside action processes

a (e)

do not affect the action contracts:

a (e^{'}) [x / e] ≜ a (e^{'} [x / e])

.

3.1.3. Operational Semantics

The denotational semantics of process-algebraic expressions

{[[\cdot]]}_{e} : ProcExpr \to ProcStore \to Val

and conditions

{[[\cdot]]}_{b} : ProcCond \to ProcStore \to Bool

is defined in the standard way, as total functions that evaluate to

Val

and

Bool

, resp. The set

σ \in ProcStore ≜ ProcVar \to Val

is the domain of process stores, which are used to give an interpretation to all process-algebraic variables. The overloaded notations

[[e]] σ

and

[[b]] σ

are used instead of

{[[e]]}_{e} σ

and

{[[b]]}_{b} σ

wherever the context allows it. Moreover,

[[e]]

is sometimes written instead of

{[[e]]}_{e} σ

when e is closed (i.e., when

fv (e) = \emptyset

), and likewise for

[[b]]

.

The operational semantics of the process algebra language is expressed as a labelled binary small-step reduction relation

\overset{α}{⟶} \subseteq ProcConf \times ProcLabel \times ProcConf

over process configurations, defined as

ProcConf ≜ Proc \times ProcStore

—pairs of processes and process stores. The labels

α

of the reduction rules are defined as follows:

α \in ProcLabel : : = a (v) | assn

. Transitions labelled

a (v)

are reductions of actions, whereas

assn

indicates reductions of assertions.

Before giving the reduction rules we first define a notion of successful termination

P ↓

of processes P. Successful termination is only defined for processes that are well-formed. Any process P is defined to be well-formed if any action parameters (the e’s in

a (e)

) and conditions (the b’s in

b : Q

) occurring inside P are closed.

Definition 2

(Successful termination).

Intuitively, any process P can terminate successfully if P has the choice to have no further behaviour. This means that

ε

can always successfully terminate (↓-EPSILON), as it has no behaviour, while

δ

can never successfully terminate. Iteration

P^{*}

can always successfully terminate (↓-ITER) as it may choose not to start iterating and thereby to behave as

ε

.

The small-step reduction rules of process configurations are given below. Likewise to the definition of successful termination, also these reduction rules require processes to be well-formed.

Definition 3

(Reductions of process configurations).

Most of the reduction rules are standard in spirit [32]. However, the handling of actions and their contracts make this process algebra language non-standard. More specifically, the non-standard ⟶-ACT reduction rule for action handling permits the state

σ

to change in any way, as long as these changes comply with the action contract. We will later use the ⟶-ACT rule to connect shared-memory updates in programs, to action contract-complying state changes on the process level.

Moreover, the notion of successful termination is used to define the reduction rule for sequential composition, ⟶-SEQ-R, which is standard in process algebra languages with

ε

[33]. (An alternative on the explicit use of successful termination is to introduce internal (

τ

-)transitions for the reductions of

ε

. However, this might make the remaining formalisation less elegant, for example by requiring a notion of weak bisimilarity, instead of the notion of strong bisimilarity that is introduced later in this section.)

3.1.4. Process-Algebraic Verification

Process-algebraic verification in our approach amounts to verifying that all reachable assertional processes

? (b)

are always satisfied, which we are interested in so that the program logic can rely on the b’s. Any process configuration

(P, σ)

fails to verify, or exhibits a fault, which we write

↯ (P, σ)

, if it can directly violate an assertion. Verifying a process, i.e., checking for fault absence, could for example be reduced to checking the

μ

-calculus formula

[{true}^{*} \cdot ↯] false

, e.g., using the mCRL2 model checker, where ↯ is modelled as an explicit fault state, meaning “no faults are every reachable”.

Fault exhibition is defined inductively as follows.

Definition 4

(Faulting process configuration).

Any process configuration

(P, σ)

is defined to be safe, denoted as

(P, σ)

, if it can never reach a faulting configuration. More formally:

Definition 5

(Safe process configurations). The

✓ \subset ProcConf

predicate is coinductively defined such that, whenever

✓ (P, σ)

holds, then(1)

\neg ↯ (P, σ)

; and (2) for any

P^{'}

,

σ^{'}

and α, if

(P, σ) \overset{α}{⟶} (P^{'}, σ^{'})

, then

✓ (P, σ)

.

Definition 6

(Verified processes). Any well-formed process P is defined to be verified with respect to a (pre)condition b, which is written

⊨ {b} P

, if

\forall σ . [[b]] σ \Rightarrow ✓ (P, σ)

.

3.1.5. Bisimulation

Our verification approach allows handling process-algebraic models up to (strong) bisimulation.

Definition 7

(Bisimulation). Any binary relation

R \subseteq Proc \times Proc

over processes is defined to be a bisimulation relation if, whenever

P R Q

, then:

(1): $P ↓$ if and only if $Q ↓$ .
(2): $↯ (P, σ)$ if and only if $↯ (Q, σ)$ , for any σ.
(3): For any σ, $P^{'}$ , $σ^{'}$ and α, if $(P, σ) \overset{α}{⟶} (P^{'}, σ^{'})$ , then there exists a $Q^{'}$ such that $(Q, σ) \overset{α}{⟶} (Q^{'}, σ^{'})$ and $P^{'} R Q^{'}$ .
(4): For any σ, $Q^{'}$ , $σ^{'}$ and α, if $(Q, σ) \overset{α}{⟶} (Q^{'}, σ^{'})$ , then there exists a $P^{'}$ such that $(P, σ) \overset{α}{⟶} (P^{'}, σ^{'})$ and $P^{'} R Q^{'}$ .

Any two processes P and Q are defined to be bisimilar, or bisimulation equivalent, written

P ≅ Q

, if and only if there exists a bisimulation relation

R

such that

P R Q

. Bisimilarity expresses that both processes exhibit the same behaviour, in the sense that their action sequences describe the same state changes. Any bisimulation relation constitutes an equivalence relation. Furthermore, bisimilarity is a congruence for all process algebraic connectives.

Successful termination

P ↓

can intuitively be understood as P being bisimilar to the process

ε + P

, that is, by having the choice to have no further behaviour.

Proposition 1.

If

P ↓

then

P ≅ ε + P

.

Lemma 1.

If

P ≅ Q

and

✓ (P, σ)

, then

✓ (Q, σ)

.

Figure 2 gives a list of bisimulation equivalences that hold for our process algebra language. Note that the left-merge connective Applsci 10 03928 i004

is not strictly needed, in the sense that our approach does not rely on it, but can be used to prove for example that

a (e) ‖ a^{'} (e^{'})

is bisimilar to

a (e) \cdot a^{'} (e^{'}) + a^{'} (e^{'}) \cdot a (e)

.

3.2. Programs

Our verification approach is formalised on the following simple concurrent pointer language, where

X, Y, \dots \in Var

are (program) variables.

Definition 8

(Expressions, conditions, conditions, commands).

\begin{matrix} E \in Expr : : = & v | X | E + E | E - E | \dots \\ B \in Cond : : = & true | false | \neg B | B \land B | E = E | E < E | \dots \\ Π \in AbstrBinder : : = & {x_{0} \mapsto E_{0}, \dots, x_{n} \mapsto E_{n}} \\ C \in Cmd : : = & skip | X : = E | X : = [E] | [E] : = E | C; C | X : = alloc E | dispose E | \\ if B then C else C | while B do C | atomic C | inatom C | C ‖ C | \\ X : = process (λ x . P) (E) over Π | action E a (E) do C | inact C | \\ finish E | query E \end{matrix}

This language is a variation of the language proposed by O’Hearn [24] and Brookes [23]. In particular, we extend their language with specification-only commands (code annotations) for handling process-algebraic models. These commands are coloured blue. Note that the blue colourings do not have any semantic meaning; they only indicate which language constructs are specification-only. Moreover, we interchangeably refer to commands also as programs.

3.2.1. Standard Language Constructs

The notation

[E]

stands for heap dereferencing, where E is an expression whose evaluation determines the heap location to dereference. The commands

X : = [E]

and

[E] : = E^{'}

denote heap reading and writing: they read from, and write to, the heap at location E, respectively. Moreover,

X : = alloc E

allocates a free heap location and writes the value represented by E to it, whereas

dispose E

deallocates the heap location at E.

Regarding concurrency, the command

C_{1} ‖ C_{2}

is the statically-scoped parallel composition of

C_{1}

and

C_{2}

and expresses their concurrent execution. In the sequel, we sometimes refer to commands that are put in parallel as different threads; for example

C_{1}

and

C_{2}

in the above. Moreover,

atomic C

expresses a statically-scoped lock: it represents the atomic execution of C, that is, without interference of other threads. The command

inatom C

represents partially executed atomic programs: ones that are currently being executed, where C is the remaining program that still has to be executed atomically. Such commands are sometimes referred to as “runtime syntax”, as they are not written by users of the language, but are instead an artefact of program execution.

3.2.2. Specification-Only Constructs

The instructions that are displayed in blue are the specification-only language constructs, for handling process-algebraic models in the logic. These instructions are ignored during regular program execution and are essentially handled as if they were code comments.

Specification-wise,

X : = process (λ x . P) (E) over Π

initialises a new process-algebraic model P in the proof system that takes a single input argument named x, namely (the evaluation of) the expression E. This model is used (1) as a specification of how a particular region of shared memory, specified by

Π

, is allowed to evolve over time; and (2) to support reasoning over the model to indirectly prove properties of how the heap evolves. The

Π

component is an abstraction binder, which is also defined in Definition 8 and is used to connect process-algebraic variables to heap locations in the program. In particular, the abstraction binders make the connections/links between process-algebraic state and shared-memory program state (that is, heap locations). In the sequel, we often use abstraction binders as if they were finite partial mappings,

Π : ProcVar ⇀_{fin} Expr

, from process-algebraic variables to the expressions whose evaluation determine the corresponding heap location. Finally, the variable X identifies the process-algebraic model after initialisation.

The command

finish E

is used to finalise the process-algebraic model identified by E in the logic, given that it can successfully terminate. Finalisation is later explained in more detail, in Section 3.4.

The specification command

action E a (E^{'}) do C

is used to link the execution of programs with the execution of process-algebraic models. More specifically, it executes the program C in the context of the model identified by E, as the process-algebraic action a that takes (the evaluation of)

E^{'}

as an input argument. The soundness argument of the program logic establishes a refinement relation between programs and their models, and this relation is established by synchronising program execution with process execution, with help of these action blocks.

The

inact

C command denotes a partially executed action program; one that still has to execute C. Likewise to

inatom

, this command can only occur during runtime and is not written by users.

Lastly,

query

E is used to connect process-algebraic reasoning to deductive reasoning: it allows the deductive proof of the program to rely on (or assume) properties that are proven to hold (or guaranteed) on the process-algebraic model identified by E, via process-algebraic analysis. These are the properties that are encoded as assertions

? (\cdot)

in this model. Of course, this would require linking process-algebraic state to program state, which we come to later, in Section 3.3 and Section 3.4.

3.2.3. Free Variables and Substitution

We use the standard (overloaded) notations

FV (E)

,

FV (B)

,

FV (Π)

and

FV (C)

to refer to the set of free program variables in the given (Boolean) expression E and B, abstraction binder

Π

, and command C, respectively. Moreover, the notation

E [X / E^{'}]

denotes the substitution of the program variable X for the expression

E^{'}

inside E; and likewise for Boolean expressions, abstraction binders, and commands. The full definitions of

FV (\cdot)

and

(\cdot) [X / E]

are mostly standard, and therefore deferred to [19].

3.2.4. User Programs

As just discussed, our simple programming language contains runtime syntax—instructions that are not written by users but are only introduced during runtime. Commands that are free of such runtime constructs are called user commands.

Definition 9

(User commands). Any command C is defined to be a user command, denoted

user (C)

, if C does not contain sub-commands of the forms

inatom C^{'}

and

inact

C^{'}

, for any command

C^{'}

.

3.2.5. Wellformedness

Moreover, our verification approach only applies to well-formed commands. Notably, our technique requires that, for any program of the form

action

_

do

C

and

inact

C, the inner action program C only contains a subcategory of commands, excluding atomic commands and specification-only constructs, in particular nested action blocks. The latter is needed since actions must be atomically observable by environmental threads. This restriction is captured by the following definition.

Definition 10

(Basic programs, well-formed programs). Any command C is defined to be basic, denoted

basic (C)

, if C does not contain any atomic sub-programs, i.e.,

atomic

or

inatom

, nor specification-specific language constructs, i.e.,

process

,

action

,

inact

,

finish

, or

query

.

A command C is defined to be well-formed, denoted

wf (C)

, if, for any command

action

_

do

C^{'}

or

inact

C that occurs in C it holds that

basic (C^{'})

.

Lemma 2.

basic (C)

implies

wf (C)

for any command C.

3.2.6. Operational Semantics

The denotational semantics of expressions

[[E]] s

and conditions

[[B]] s

are again defined in the standard way, and evaluate to

Val

and

Bool

, respectively, where

s \in Store ≜ Var \to Val

is a (program) store that gives an interpretation to all program variables.

The operational semantics of programs is defined in terms of a binary small-step reduction relation

⇝ \subseteq Conf \times Conf

between program configurations. A program configuration

C = (C, h, s) \in Conf ≜ Cmd \times Heap \times Store

is a triple, consisting of a command C as well as a heap h that models shared memory, and a store

s \in Store

that models thread-local memory. Any program configuration of the form

(skip, h, s)

is defined to be final or terminated. Heaps

h \in Heap ≜ Val ⇀_{fin} Val

are defined to be finite partial mappings from values to values. Heap locations are themselves values, so that they can be assigned to, and read from, local variables, and thus be handled as any value. The function

dom : Heap \to 2^{Val}

denotes the mapped domain of a given heap, so that

dom (h) ≜ {v | h (v) \neq undefined}

.

Definition 11

(Small-step operational semantics of programs).

Most of the transition rules are standard; see for example [34]. The update notation

s [X \mapsto v]

defines a store that is equal to s, except that X is mapped to v. A similar notation is used for heaps, namely

h [v_{1} \mapsto v_{2}]

. Moreover, the notation

h \ v

denotes the removal of the entry at v in h.

An interesting aspect of the operational semantics is that atomic programs are executed using a small-step reduction strategy (via

⇝

-INATOM-STEP and

⇝

-INATOM-SKIP), rather than a big-step execution, which is more customary. This is done for technical reasons: it simplifies the establishment of a simulation/refinement between programs and their models. Consequently, we use a notion of a locked program to define the transition rules for atomic programs. Any command C is said to be (globally) locked if C executes an atomic program, i.e., if C has

inatom C^{'}

as a subprogram for some

C^{'}

.

Definition 12

(Locked programs). Any command C is locked if

locked (C)

holds, where

locked \subset Cmd

is defined as follows, by structural recursion on C:

locked (C) ≜ \{\begin{matrix} true & if C = inatom C^{'} \\ locked (C_{1}) & if C = C_{1}; C_{2} \\ locked (C_{1}) \lor locked (C_{2}) & if C = C_{1} ‖ C_{2} \\ locked (C^{'}) & if C = inact C^{'} \\ false & otherwise \end{matrix}

The rules

⇝

-PAR-L and

⇝

-PAR-R for parallel composition allow a thread to make an execution step only if the other thread is not locked, thereby preventing thread interference while executing atomic programs. One might ask whether this handling of locks could not potentially lead to deadlock scenarios, for example by encountering configurations

(C_{1} ‖ C_{2}, h, s)

during runtime for which both

locked (C_{1})

and

locked (C_{2})

hold. However, we will later see and prove that no such deadlocks can be reached, given that one starts with an initial configuration that contains a user program.

Furthermore, the specification-only language constructs do not affect the state of the program (not the heap nor the store) and are essentially handled as if they were comments. Notice however, that commands of the form

action

_

do

C

are first reduced to

inact

C before C is being executed. This is done for technical reasons, as this makes it more convenient to later establish a simulation relation between execution steps of programs and processes.

The semantics of programs has the following preservation properties.

Lemma 3.

Program execution preserves basicality and wellformedness:

1.: If $basic (C)$ and $(C, h, s) ⇝ (C^{'}, h^{'}, s^{'})$ , then $basic (C^{'})$ .
2.: If $wf (C)$ and $(C, h, s) ⇝ (C^{'}, h^{'}, s^{'})$ , then $wf (C^{'})$ .

3.2.7. Fault Semantics

Apart from an operational semantics, we also define a fault semantics for programs [35] that classifies runtime errors that may occur during program execution. Its definition uses two auxiliary functions,

acc (C, s)

and

writes (C, s)

, for obtaining the set of heap locations that can be accessed or written-to, respectively, in a next reduction step of C. Their definitions are deferred to [19] as well, as they are quite lengthy and not essential for understanding the definition of the fault semantics.

The fault semantics of program configurations

C

is expressed as a predicate

↯ (C)

that is inductively defined as follows.

Definition 13

(Fault semantics of programs).

Intuitively, a program configuration exhibits a fault if it (1) accesses unallocated memory, or (2) is deadlocked, or (3) allows performing a data-race.

More specifically, ↯-READ expresses that heap reading

X : = [E]

faults if the heap location at E is unoccupied. For the same reason, also heap writing (↯-WRITE) and heap deallocation (↯-DISPOSE) may fault. The ↯-PAR-L rule expresses that any parallel program

C_{1} ‖ C_{2}

can fault if

C_{1}

can fault, given that

C_{2}

is not locked, or the other way around (↯-PAR-R covers the other direction). Program configurations that hold multiple global locks are also considered to be faulting, by ↯-DEADLOCK. Finally, the fault semantics encodes the definition of a data-race, via ↯-RACE-1 and ↯-RACE-2. To clarify, any configuration

(C, h, s)

exhibits a data-race if C has (at least) two threads that can both access a common location in h in the next reduction step, where at least one of these accesses is a write.

We will later see that the soundness argument of our program logic covers that verified programs are free of faults. More specifically, we will prove that, for any program C for which a proof can be derived, we have that C is fault-free with respect to any heap h and store s that satisfy C’s precondition, and moreover, that every configuration that is reachable from

(C, h, s)

is also fault-free.

Finally, to show that the operational semantics of programs is coherent with respect to faults, we prove that the operational semantics is progressive for all non-faulting program configurations.

Theorem 1

(Progress of

⇝

). For any program configuration

C

for which

\neg ↯ (C)

holds, either

C

is final, or there exists a configuration

C^{'}

such that

C ⇝ C^{'}

.

3.3. Assertions

The assertion language of our verification approach is defined by the following grammar.

Definition 14

(Assertions).

\begin{matrix} t \in PointsToType : : = & std | proc | act \\ P, Q, R, \dots \in Assn : : = & B | \forall X . P | \exists X . P | P \lor Q | P * Q | *_{i \in I} P_{i} | P - * Q | E {\overset{π}{↪}}_{t} E | \\ {Proc}_{π} (E, \tilde{P}, Π) | \tilde{P} \approx \tilde{Q} \end{matrix}

Assertions can be built from plain Boolean expressions B, and may contain several standard connectives from predicate logic: universal and existential quantifiers, and disjunction. Moreover, logical conjunction (∧) is replaced by the separating conjunction * from Concurrent Separation Logic (CSL). The

*_{i \in I} P_{i}

connective is the iterated separating conjunction, with I a finite set that represents

P_{0} * \dots * P_{n}

, given that

I = {0, \dots, n}

. The

- *

connective is known as the magic wand and is used to describe hypothetical judgments, much like the logical implication from predicate logic.

Apart from these standard CSL connectives, the assertion language contains three different heap ownership predicates

{\overset{π}{↪}}_{t}

, with

π \in Q

a rational number that represents a fractional permission, and t the heap ownership type, as well as an ownership predicate

{Proc}_{π}

for program abstractions. Finally

\tilde{P} \approx \tilde{Q}

intuitively means that

\tilde{P}

and

\tilde{Q}

are bisimilar processes with respect to the current state.

The definitions of free variables

FV (P)

of assertions

P

, and substitution

P [X / E]

in

P

, are the standard ones and are therefore deferred to [19]. Assertions that are free of

{\overset{π}{↪}}_{t}

and

{Proc}_{π}

predicates are called pure. Any assertion that is not pure is said to be spatial.

3.3.1. Heap Ownership

The assertion

E_{1} {\overset{π}{↪}}_{t} E_{2}

is the heap ownership assertion and expresses that the heap contains the value represented by the expression

E_{2}

at heap location

E_{1}

. Moreover,

π

and t together determine the access rights to this heap location. In more detail, depending on the ownership type t, the

{\overset{π}{↪}}_{t}

ownership predicates express different access rights to the associated heap location:

Standard heap ownership. $E_{1} {\overset{π}{↪}}_{std} E_{2}$ is the standard heap ownership predicate from (intuitionistic) separation logic that provides read-access whenever $0 < π < 1$ , and write-access in case $π = 1$ . Moreover, the subscript $std$ indicates that the associated heap location $E_{1}$ is not bound to any process-algebraic model. We say that a heap location $v \in Val$ is bound by, or subject to, a program abstraction, if there is an active program abstraction with a binder $Π$ that contains a mapping to v, that is, $v \in dom (Π)$ .
Process heap ownership. $E {\overset{π}{↪}}_{proc} E^{'}$ is the process heap ownership predicate, which indicates that the heap location at E is bound by an active process-algebraic abstraction, but in a purely read-only manner. More precisely, ${\overset{π}{↪}}_{proc}$ assertions exclusively grant read-access, even in case $π = 1$ .
Action heap ownership. $E {\overset{π}{↪}}_{act} E^{'}$ is the action heap ownership predicate, which indicates that the heap location E is bound by an active process-algebraic model, and is used in the context of an action block, in a read/write manner.

Observe that action points-to assertions

{\overset{π}{↪}}_{act}

essentially give the same access rights as

{\overset{π}{↪}}_{std}

assertions. Nevertheless, they are both needed, to be able to distinguish between bound and unbound heap locations in the logic. For example, the program logic must not allow to deallocate memory that is currently bound to (protected by) an active process-algebraic model, as this would be unsound.

Moreover, even though

{\overset{π}{↪}}_{proc}

predicates never grant write access, we will later see that the proof system allows

{\overset{π}{↪}}_{proc}

predicates to be upgraded to

{\overset{π}{↪}}_{act}

inside action blocks, and

{\overset{π}{↪}}_{act}

again provides write access when

π = 1

. More precisely,

E {\overset{1}{↪}}_{proc} E^{'}

predicates grant the capability to regain write access to E, in the context of an action program. This system of upgrading enforces that all modifications to E happen in the context of

action E_{abstr} a (E_{abstr}^{'}) do C

commands, so that the modifications are protected and can be recorded by the program abstraction identified by

E_{abstr}

, as the action a.

In addition to these three heap ownership predicates, we derive a fourth such predicate, called the process–action heap ownership predicate. This ownership predicate is equivalent to

{\overset{π}{↪}}_{act}

only if

π

denotes write access, and otherwise it is equivalent to

{\overset{π}{↪}}_{proc}

.

Definition 15

(Process–action heap ownership).

\begin{matrix} E_{1} {\overset{π}{↪}}_{procact} E_{2} ≜ \{\begin{matrix} E_{1} {\overset{π}{↪}}_{act} E_{2} & if π = 1 \\ E_{1} {\overset{π}{↪}}_{proc} E_{2} & otherwise \end{matrix} \end{matrix}

This derived predicate is for later use, in the proof system of our program logic. Finally, the notation

E {\overset{π}{↪}}_{t} -

is sometimes used as shorthand for

\exists X . E {\overset{π}{↪}}_{t} X

, where

X \notin FV (E)

.

3.3.2. Process Ownership

The

{Proc}_{π} (E, \tilde{P}, Π)

assertion expresses ownership of a program abstraction that is identified by E, where the abstraction is represented by the process

\tilde{P}

. Ownership in this sense means that the thread has knowledge of the existence of the process-algebraic model

\tilde{P}

, as well as the right to execute as prescribed by this model. The mapping

Π

connects the abstract model to the concrete program by mapping the process-algebraic variables in the abstraction to heap locations in the program, as discussed before. And last, the fractional permission

π

is needed to implement the ownership system of program models. Fractional permissions are only used here to be able to reconstruct the full

{Proc}_{1}

predicate. We shall later see that

{Proc}_{π}

predicates can be split and merged along

π

and parallel compositions inside

\tilde{P}

, and be consumed in the proof system by

action

programs.

Even though reasoning about process-algebraic models is done purely on the level of process-algebraic state, in the program logic it is allowed to mix program state with process-algebraic state. This is indicated by the tilde above the

\tilde{P}

, which means that P can have both program variables and process-algebraic variables. Such processes are called hybrid processes and are defined as follows.

Definition 16

(Hybrid expressions, conditions and processes).

These hybrid processes thus allow mixing process-algebraic reasoning with deductive reasoning using our program logic. The function

fv (\tilde{P})

is used for obtaining the set of free process-algebraic variables in

\tilde{P}

, and

FV (\tilde{P})

for obtaining all free program variables in

\tilde{P}

(and likewise for

\tilde{E}

and

\tilde{B}

).

We shall later see that the program logic allows replaces processes

\tilde{P}

inside

{Proc}_{π} (E, \tilde{P}, Π)

predicates by bisimilar ones. However, note that one cannot use the standard notion of bisimilarity as defined in Definition 7 for this in case

\tilde{P}

has any program variables occurring freely in it. To resolve this, we include a relation

\tilde{P} \approx \tilde{Q}

in the assertion language, stating that

\tilde{P}

and

\tilde{Q}

are bisimilar while taking into account any (pure) information that is available from the context. This is further clarified in Section 3.3.7, after we discussed the models of the logic.

3.3.3. Models of the Program Logic

Before Section 3.3.7 discusses the semantics of assertions, this section first introduces permission heaps and process maps, that form the basis for the models of our concurrent separation logic. Permission heaps extend ordinary program heaps (i.e.,

Heap

) to capture the three different types t of heap ownership, whereas process maps capture the state and ownership of process-algebraic abstractions.

Let us start by introducing fractional permissions, which are used in the definitions of both permission heaps and process maps.

3.3.4. Fractional Permissions

In the assertion language, all heap/process ownership predicates have an associated rational number

π \in Q

. There are used to express the “amount” of ownership that is available to the corresponding heap location or program model.

We define a rational number

π

to be a (Boyland) fractional permission in case

π \in {(0, 1]}_{Q}

[36]. The original work of Boyland uses fractional permissions to distinguish between write access (

π = 1

) and read access (

0 < π < 1

) to some shared resource. However, in our work this is slightly different, since the fractional access permissions

π

annotated to

{\overset{π}{↪}}_{proc}

predicates never provide write access.

To conveniently handle fractional permissions, we define basic notions of validity (

{valid}_{Q}

) and disjointness (

⊥_{Q}

) of rational numbers, as follows.

Definition 17

(Permission validity, Permission disjointness).

{valid}_{Q} (π) ≜ 0 < π \leq 1 π_{1} ⊥_{Q} π_{2} ≜ 0 < π_{1} \land 0 < π_{2} \land π_{1} + π_{2} \leq 1

The predicate

{valid}_{Q} : Q \to Prop

determines whether the given rational number is within the range

{(0, 1]}_{Q}

, that is, is a valid Boyland fractional permission. (Here

Prop

is the sort of propositions.) The binary relation

⊥_{Q} : Q \to Q \to Prop

determines disjointness of two rationals. Disjoint rational numbers do not overlap, in the sense that both operands are fractional permissions, as well as their addition.

Lemma 4.

{valid}_{Q}

and

⊥_{Q}

satisfy the following properties.

1.: If $π_{1} ⊥_{Q} π_{2}$ , then $π_{2} ⊥_{Q} π_{1}$ , ${valid}_{Q} (π_{1})$ , and ${valid}_{Q} (π_{1} + π_{2})$ .
2.: If $π_{1} ⊥_{Q} π_{2}$ and $(π_{1} + π_{2}) ⊥_{Q} π_{3}$ , then $π_{2} ⊥_{Q} π_{3}$ and $π_{1} ⊥_{Q} (π_{2} + π_{3})$ .

3.3.5. Permission Heaps

The models of our program logic use permission heaps to give a semantic meaning to heap ownership. Permission heaps and their heap cells are defined as follows, and are slightly richer than ordinary program heaps

(Heap)

to be able to administer the access permissions and the different ownership types.

Definition 18

(Permission heap cells, Permission heaps).

\begin{matrix} hc \in PermHeapCell : : = & {free | 〈 v 〉}_{std}^{π} {| 〈 v 〉}_{proc}^{π} | {〈 v_{1}, v_{2} 〉}_{act}^{π} | inv \\ ph \in PermHeap ≜ & Val \to PermHeapCell \end{matrix}

Permission heaps

ph

are defined to be total functions from values (representing heap locations) to permission heap cells,

hc

, which in turn are inductively defined to be one of the following:

$free$ , which is an unoccupied heap cell.
${〈 v 〉}_{std}^{π}$ , which is a standard heap cell that stores the value $v \in Val$ . Standard heap cells are the models of the standard heap ownership predicates, ${\overset{π}{↪}}_{std}$ .
${〈 v 〉}_{proc}^{π}$ , which is a process heap cell that stores the value v. These are used as models of the ${\overset{π}{↪}}_{proc}$ ownership predicates.
${〈 v_{1}, v_{2} 〉}_{act}^{π}$ , which is an action heap cell that stores the value $v_{1}$ . Action heap cells are used as the models for the ${\overset{π}{↪}}_{act}$ predicates. Moreover, action heap cells store a second value $v_{2}$ . This extra value is maintained for technical reasons, to help in establishing soundness of the program logic. The value $v_{2}$ is referred to as a snapshot value: a copy of the original value stored by the heap cell, that is made when an action block was entered.
$inv$ , which is an invalid, or corrupted, permission heap cell.

Note that, unlike program heaps, permission heaps are defined to be total functions, where the heap cells have an explicit notion of being

free

. This is done to give permission heaps and their cells nicer algebraic properties. The unit permission heap is defined to be

𝟙_{ph} ≜ λ v \in Val . free

, containing

free

at every entry. Furthermore, permission heap cells also have an explicit notion of being invalid. Invalid heap cells

inv

represent the erroneous result of composing two incompatible heap cells.

We now define several operations on permission heaps.

Validity. Any permission heap

ph

is defined to be valid if the permissions of all

ph

’s heap cells are valid, where

free

is always valid and

inv

is never valid.

Definition 19

(Validity of permission heaps). A permission heap

ph

is defined to be valid, written

{valid}_{ph} (ph)

, if

{valid}_{hc} (p h (v))

holds for every

v \in Val

, where the

{valid}_{hc}

predicate is defined as follows.

{valid}_{hc} (hc) ≜ \{\begin{matrix} true & if hc = free \\ {valid}_{Q} (π) & if hc = {〈 v 〉}_{std}^{π} \lor hc = {〈 v 〉}_{proc}^{π} \lor {〈 v, v^{'} 〉}_{act}^{π} for some v, v^{'} \\ false & if hc = inv \end{matrix}

Disjointness. Two permission heaps

{ph}_{1}

and

{ph}_{2}

are disjoint if all their heap cells are pairwise compatible and their underlying permissions are disjoint.

Definition 20

(Disjointness of permission heaps). Two permission heaps,

{ph}_{1}

and

{ph}_{2}

, are disjoint, denoted

{ph}_{1} ⊥_{ph} {ph}_{2}

, if

{ph}_{1} (v) ⊥_{hc} {ph}_{2} (v)

holds for every

v \in Val

, where the

⊥_{hc}

relation is defined as follows.

{hc}_{1} ⊥_{hc} {hc}_{2} ≜ \{\begin{matrix} {valid}_{hc} ({hc}_{2}) & if {hc}_{1} = free \\ {valid}_{hc} ({hc}_{1}) & if {hc}_{2} = free \\ π_{1} ⊥_{Q} π_{2} \land v_{1} = v_{2} & if {hc}_{1} = {〈 v_{1} 〉}_{std}^{π_{1}} \land {hc}_{2} = {〈 v_{2} 〉}_{std}^{π_{2}} \\ π_{1} ⊥_{Q} π_{2} \land v_{1} = v_{2} & if {hc}_{1} = {〈 v_{1} 〉}_{proc}^{π_{1}} \land {hc}_{2} = {〈 v_{2} 〉}_{proc}^{π_{2}} \\ π_{1} ⊥_{Q} π_{2} \land v_{1} = v_{2} \land v_{1}^{'} = v_{2}^{'} & if {hc}_{1} = {〈 v_{1}, v_{1}^{'} 〉}_{act}^{π_{1}} \land {hc}_{2} = {〈 v_{2}, v_{2}^{'} 〉}_{act}^{π_{2}} \\ false & otherwise \end{matrix}

Disjoint union. The following operation defines the disjoint union (i.e., the composition) of two permission heaps.

Definition 21

(Disjoint union of permission heaps). The disjoint union

{ph}_{1} ⊎_{ph} {ph}_{2}

of any two permission heaps

{ph}_{1}, {ph}_{2}

is defined to be the permission heap

λ v \in Val . {ph}_{1} (v) ⊎_{hc} {ph}_{2} (v)

, with

⊎_{hc}

defined as follows.

{hc}_{1} ⊎_{hc} {hc}_{2} ≜ \{\begin{matrix} {hc}_{1} & if {hc}_{2} = free \\ {hc}_{2} & if {hc}_{1} = free \\ {〈 v_{1} 〉}_{std}^{π_{1} + π_{2}} & if {hc}_{1} = {〈 v_{1} 〉}_{std}^{π_{1}} \land {hc}_{2} = {〈 v_{2} 〉}_{std}^{π_{2}} \land v_{1} = v_{2} \\ {〈 v_{1} 〉}_{proc}^{π_{1} + π_{2}} & if {hc}_{1} = {〈 v_{1} 〉}_{proc}^{π_{1}} \land {hc}_{2} = {〈 v_{2} 〉}_{proc}^{π_{2}} \land v_{1} = v_{2} \\ {〈 v_{1}, v_{1}^{'} 〉}_{act}^{π_{1} + π_{2}} & if {hc}_{1} = {〈 v_{1}, v_{1}^{'} 〉}_{act}^{π_{1}} \land {hc}_{2} = {〈 v_{2}, v_{2}^{'} 〉}_{act}^{π_{2}} \land v_{1} = v_{2} \land v_{1}^{'} = v_{2}^{'} \\ inv & otherwise \end{matrix}

Note that

⊎_{hc}

only gives a non-corrupted entry when applied to two compatible heap cells. Furthermore,

free

is neutral with respect to

⊎_{hc}

while

inv

is absorbing.

Below are the most important properties of validity, disjointness and disjoint union.

Lemma 5.

(The analogous operations on permission heap cells have the exact same properties.)

1.

{ph}_{1} ⊎_{ph} ({ph}_{2} ⊎_{ph} {ph}_{3}) = ({ph}_{1} ⊎_{ph} {ph}_{2}) ⊎_{ph} {ph}_{3}

.

2.

{ph}_{1} ⊎_{ph} {ph}_{2} = {ph}_{2} ⊎_{ph} {ph}_{1}

.

3.

If

ph ⊎_{ph} 𝟙_{ph} = ph

.

4.

If

{ph}_{1} ⊥_{ph} {ph}_{2}

, then

{valid}_{ph} ({ph}_{1} ⊎_{ph} {ph}_{2})

.

5.

If

{ph}_{1} ⊥_{ph} {ph}_{2}

and

({ph}_{1} ⊎_{ph} {ph}_{2}) ⊥_{ph} {ph}_{3}

, then also

(a): ${ph}_{2} ⊥_{ph} {ph}_{3}$ and
(b): ${ph}_{1} ⊥_{ph} ({ph}_{2} ⊎_{ph} {ph}_{3})$ .

3.3.6. Process Maps

The models of the logic also use process maps in addition to permission heaps, to give a semantic meaning to process ownership predicates

{Proc}_{π}

in the logic. Process maps and their entries are defined as follows, where binders

Λ

are finite partial mappings from process variables to heap locations (i.e., values). These binders are the models for the abstraction binders

Π

defined earlier in Definition 8.

Definition 22

(Process map entries, process maps, binders).

\begin{matrix} me \in ProcMapEntry : : = & {free | 〈 P, Λ 〉}^{π} | inv \\ pm \in ProcMap ≜ & Val \to ProcMapEntry \\ Λ \in Binder ≜ & ProcVar ⇀_{fin} Val \end{matrix}

Process maps are total mappings from values (identifying program abstractions) to process map entries, which are, in turn, inductively defined to one of the following three elements:

$free$ , which models unoccupied or free entries in $pm$ .
${〈 P, Λ 〉}^{π}$ , which is an occupied process map entry. These are used as models for the ${Proc}_{π} (E, \tilde{P}, Π)$ assertions, where E identifies the process map entry in $pm$ , and the binder $Λ$ is a model for $Π$ .
$inv$ , which denotes an invalid, or corrupted, process map entry.

Likewise to permission heaps, process maps are defined as total functions with entries that can explicitly be

free

or

invalid

, as this provides desirable algebraic properties. Corrupted entries represent the erroneous result of taking the disjoint union of two incompatible, non-disjoint entries. The unit process map is defined to be

𝟙_{pm} ≜ λ v \in Val . free

, containing

free

at every entry.

We now define several operations and relations on process maps that are analogous to the operations defined earlier for permission heaps, starting with bisimilarity.

Bisimilarity. Any two process maps are said to be bisimilar, if all their entries are equal point-wise, or contain occupied entries with process components that are bisimilar.

Definition 23

(Process map bisimilarity). Two process maps

{pm}_{1}

and

{pm}_{2}

are defined to be bisimilar, denoted

{pm}_{1} ≅_{pm} {pm}_{2}

, if

{pm}_{1} (v) ≅_{mc} {pm}_{2} (v)

for every

v \in Val

, with the relation

≅_{mc}

defined as follows.

{me}_{1} ≅_{mc} {me}_{2} ≜ \{\begin{matrix} true & if {me}_{1} = free \land {me}_{2} = free \\ P_{1} ≅ P_{2} \land Λ_{1} = Λ_{2} \land π_{1} = π_{2} & if {me}_{1} = {〈 P_{1}, Λ_{1} 〉}^{π_{1}} \land {me}_{2} = {〈 P_{2}, Λ_{2} 〉}^{π_{2}} \\ true & if {me}_{1} = inv \land {me}_{2} = inv \\ false & otherwise \end{matrix}

Both

≅_{pm}

and

≅_{mc}

are equivalence relations. A notion of bisimilarity of process maps is needed in addition to ordinary equality, since for example disjoint union of process maps is not associative nor commutative with respect to ordinary equality, as opposed to bisimilarity. Moreover, we will later see that the program logic always allows replacing processes

\tilde{P}

inside

{Proc}_{π} (X, \tilde{P}, Π)

predicates by bisimilar ones, as discussed earlier. But to handle such replacements at the semantic level, we allow process maps and their entries to be handled up to

≅_{pm}

and

≅_{mc}

, respectively.

Validity. Any process map

pm

is said to be valid intuitively if none of

pm

’s entries are corrupt and all occupied entries of

pm

hold a valid associated fractional permission.

Definition 24

(Process map validity). Any process map

pm

is defined to be valid, denoted

{valid}_{pm} (pm)

, if

{valid}_{mc} (pm (v))

holds for every

v \in Val

, with the

{valid}_{mc} : ProcMapEntry \to Prop

predicate defined as follows.

{valid}_{mc} (me) ≜ \{\begin{matrix} true & if me = free \\ {valid}_{Q} (π) & if me = {〈 P, Λ 〉}^{π} for some P and Λ \\ false & if me = inv \end{matrix}

It is not difficult to see that

𝟙_{pm}

is trivially valid, and that bisimilarity is validity-preserving, i.e.,

{valid}_{pm} ({pm}_{1})

and

{pm}_{1} ≅_{pm} {pm}_{2}

implies

{valid}_{pm} ({pm}_{2})

for every

{pm}_{1}

and

{pm}_{2}

; and likewise for

{valid}_{mc}

.

Disjointness. Two process maps are said to be disjoint if none of their entries are corrupt, and all fractional permissions of their entries are point-wise disjoint, as captured by the following definition.

Definition 25

(Process map disjointness). Any two process maps

{pm}_{1}

and

{pm}_{2}

are defined to be disjoint, denoted

{pm}_{1} ⊥_{pm} {pm}_{2}

, if

{pm}_{1} (v) ⊥_{mc} {pm}_{2} (v)

for every

v \in Val

, with the

⊥_{mc}

relation defined as follows.

{me}_{1} ⊥_{mc} {me}_{2} ≜ \{\begin{matrix} {valid}_{mc} ({me}_{1}) & if {me}_{2} = free \\ {valid}_{mc} ({me}_{2}) & if {me}_{1} = free \\ π_{1} ⊥_{Q} π_{2} \land Λ_{1} = Λ_{2} & if {me}_{1} = {〈 P_{1}, Λ_{1} 〉}^{π_{1}} \land {me}_{2} = {〈 P_{2}, Λ_{2} 〉}^{π_{2}} for some P_{1} and P_{2} \\ false & otherwise \end{matrix}

The intuition of disjointness is that disjoint process maps can safely be composed without corrupting any of their entries. Disjointness is a symmetric relation and is a congruence with respect to bisimilarity, meaning that

{pm}_{1} ⊥_{pm} {pm}_{2}

and

{pm}_{1} ≅_{pm} {pm}_{1}^{'}

and

{pm}_{2} ≅_{pm} {pm}_{2}^{'}

implies

{pm}_{1}^{'} ⊥_{pm} {pm}_{2}^{'}

.

Disjoint union. The following operation defines the disjoint union of two process map (entries).

Definition 26

(Disjoint union of process maps). The disjoint unionof two process maps

{pm}_{1}

and

{pm}_{2}

is defined as

{pm}_{1} ⊎_{pm} {pm}_{2} ≜ \forall v . {pm}_{1} (v) ⊎_{mc} {pm}_{2} (v)

, with

⊎_{mc}

defined as follows.

{me}_{1} ⊎_{mc} {me}_{2} ≜ \{\begin{matrix} {me}_{1} & if {me}_{2} = free \\ {me}_{2} & if {me}_{1} = free \\ {〈 P_{1} ‖ P_{2}, Λ_{1} 〉}^{π_{1} + π_{2}} & if {me}_{1} = {〈 P_{1}, Λ_{1} 〉}^{π_{1}} \land {me}_{2} = {〈 P_{2}, Λ_{2} 〉}^{π_{2}} \land Λ_{1} = Λ_{2} \\ inv & otherwise \end{matrix}

Likewise to disjoint union of permission heaps, the composition of incompatible process map entries produces a corrupted

inv

entry. The

free

entry is again neutral, whereas

inv

is absorbing (that is, composing

inv

with any entry results in

inv

). Disjoint union is a congruence with respect to bisimilarity, so that

{pm}_{1} ≅_{pm} {pm}_{2}

and

{pm}_{1}^{'} ≅_{pm} {pm}_{2}^{'}

implies

{pm}_{1} ⊎_{pm} {pm}_{1}^{'} ≅_{pm} {pm}_{2} ⊎_{pm} {pm}_{2}^{'}

.

Lemma 6.

(The analogous operations on process map entries have the exact same properties.)

1.

{pm}_{1} ⊎_{pm} ({pm}_{2} ⊎_{pm} {pm}_{3}) ≅_{pm} ({pm}_{1} ⊎_{pm} {pm}_{2}) ⊎_{pm} {pm}_{3}

.

2.

{pm}_{1} ⊎_{pm} {pm}_{2} ≅_{pm} {pm}_{2} ⊎_{pm} {pm}_{1}

.

3.

pm ⊎_{pm} 𝟙_{pm} ≅_{pm} pm

.

4.

If

{pm}_{1} ⊥_{pm} {pm}_{2}

, then

{valid}_{pm} ({pm}_{1} ⊎_{pm} {pm}_{2})

.

5.

If

{pm}_{1} ⊥_{pm} {pm}_{2}

and

({pm}_{1} ⊎_{pm} {pm}_{2}) ⊥_{pm} {pm}_{3}

, then also

(a): ${pm}_{2} ⊥_{pm} {pm}_{3}$ , and
(b): ${pm}_{1} ⊥_{pm} ({pm}_{2} ⊎_{pm} {pm}_{3})$ .

3.3.7. Semantics of Assertions

Let us now define the semantic meaning of assertions. The semantics of assertions is defined in terms of a satisfaction relation

ph, pm, s, g ⊨ P

stating that the assertion

P

is satisfied by the model

(ph, pm, s, g)

. Its definition depends on an operation

[[\cdot]] : AbstrBinder \to Store \to Binder

for evaluating abstraction binders

Π = {x_{0} \mapsto E_{0}, \dots, x_{n} \mapsto E_{n}}

, that is defined as follows.

Definition 27

(Abstraction binder evaluation).

\begin{matrix} [[{x_{0} \mapsto E_{0}, \dots, x_{n} \mapsto E_{n}}]] s ≜ λ x . \{\begin{matrix} [[E_{i}]] s & if x = x_{i} for some 0 \leq i \leq n \\ undefined & otherwise \end{matrix} \end{matrix}

Moreover, recall that hybrid processes may contain both program variables and process-algebraic variables. The semantics of assertions relies on a closure operation for “closing” processes with respect to any program variable occurring in it. More specifically, given any hybrid process

\tilde{P}

and store s, the s-closure of

\tilde{P}

, written

\tilde{P} [s]

, is defined to be

\tilde{P} {[X / s (X)]}_{X \in FV (\tilde{P})}

, i.e., replacing every free program variable X in

\tilde{P}

by

s (X)

. This operation is “closing”

\tilde{P}

in the sense that

FV (\tilde{P} [s]) = \emptyset

and

\tilde{P} [s] \in Proc

.

Definition 28

(Semantics of assertions). The modelling relation

ph, pm, s, g ⊨ P

is defined by structural recursion on

P

as follows.

\begin{matrix} ph, pm, s, g ⊨ B & iff & [[B]] s \\ ph, pm, s, g ⊨ \forall X . P & iff & \forall v . (ph, pm, s [X \mapsto v], g [X \mapsto v]) ⊨ P \\ ph, pm, s, g ⊨ \exists X . P & iff & \exists v . (ph, pm, s [X \mapsto v], g [X \mapsto v]) ⊨ P \\ ph, pm, s, g ⊨ P_{1} \lor P_{1} & iff & ph, pm, s, g ⊨ P_{1} \lor ph, pm, s, g ⊨ P_{2} \\ ph, pm, s, g ⊨ P_{1} * P_{2} & iff & \exists {ph}_{1}, {ph}_{2} . {ph}_{1} ⊥_{ph} {ph}_{2} \land {ph}_{1} ⊎_{ph} {ph}_{2} = ph \land \\ \exists {pm}_{1}, {pm}_{2} . {pm}_{1} ⊥_{pm} {pm}_{2} \land {pm}_{1} ⊎_{pm} {pm}_{2} ≅_{pm} pm \land \\ {ph}_{1}, {pm}_{1}, s, g ⊨ P_{1} \land {ph}_{2}, {pm}_{2}, s, g ⊨ P_{2} \\ ph, pm, s, g ⊨ *_{i \in I} P_{i} & iff & ph, pm, s, g ⊨ P_{i_{0}} * \dots * P_{i_{n}} for I = {i_{0}, \dots, i_{n}} \\ ph, pm, s, g ⊨ P_{1} - * P_{2} & iff & \forall {ph}^{'}, {pm}^{'} . (ph ⊥_{ph} {ph}^{'} \land pm ⊥_{pm} {pm}^{'} \land {ph}^{'}, {pm}^{'}, s, g ⊨ P_{1}) \Rightarrow \\ ph ⊎_{ph} {ph}^{'}, pm ⊎_{pm} {pm}^{'}, s, g ⊨ P_{2} \\ ph, pm, s, g ⊨ E_{1} {\overset{π}{↪}}_{std} E_{2} & iff & {valid}_{Q} (π) \land \exists π^{'} . ph ([[E_{1}]] s) = {〈 [[E_{2}]] s 〉}_{std}^{π^{'}} \land π \leq π^{'} \\ ph, pm, s, g ⊨ E_{1} {\overset{π}{↪}}_{proc} E_{2} & iff & {valid}_{Q} (π) \land \exists π^{'} . ph ([[E_{1}]] s) = {〈 [[E_{2}]] s 〉}_{proc}^{π^{'}} \land π \leq π^{'} \\ ph, pm, s, g ⊨ E_{1} {\overset{π}{↪}}_{act} E_{2} & iff & {valid}_{Q} (π) \land \exists π, v . ph ([[E_{1}]] s) = {〈 [[E_{2}]] s, v 〉}_{act}^{π^{'}} \land π \leq π^{'} \\ ph, pm, s, g ⊨ {Proc}_{π} (E, \tilde{P}, Π) & iff & \exists {me}^{'} . me ⊥_{mc} {me}^{'} \land pm ([[E]] g) ≅_{pm} me ⊎_{mc} {me}^{'} \\ where me = {〈 \tilde{P} [s], [[Π]] s 〉}^{π} \\ ph, pm, s, g ⊨ \tilde{P} \approx \tilde{Q} & iff & \tilde{P} [s] ≅ \tilde{Q} [s] \end{matrix}

As usual, any separating conjunction

P_{1} * P_{2}

is satisfied by a model

(ph, pm, s, g)

if that model can both be split along

ph

and

pm

into two disjoint models, such that one satisfies

P_{1}

and the other satisfies

P_{2}

. The semantic meaning of iterated separating conjunctions can be expressed simply in terms of the interpretation of the binary separating conjunction. Magic wands

P_{1} - * P_{2}

are satisfied by a model if, for any disjoint extension of that model satisfying

P_{1}

, the extended model satisfies

P_{2}

.

Moving to the non-standard connectives; heap ownership assertions

E {\overset{π}{↪}}_{t} E^{'}

are satisfied if the permission heap holds an entry at location E that matches with the ownership type t, with an associated fractional permission that is at least

π

. Process ownership assertions

{Proc}_{π} (E, \tilde{P}, Π)

are satisfied if the process map holds a matching entry at the position described by E, with a fractional permission at least

π

, and a process that at least includes all the behaviours of the process

\tilde{P} [s]

. Finally,

\tilde{P} \approx \tilde{Q}

is satisfied if

\tilde{P}

and

\tilde{Q}

are bisimilar with respect to the current state. To give an example of the use of ≈, consider the assertion

{Proc}_{π} (E, 0 < X : \tilde{P}, Π) * X = 2

. One might wish to replace

0 < X : \tilde{P}

with

\tilde{P}

, considering that

X = 2

. But since

0 < X : \tilde{P}

is a process that includes program variables (namely X), one can not immediately deduce that it is bisimilar to

\tilde{P}

according to Definition 7. However, we do have that

0 < X : \tilde{P} \approx \tilde{P} * X = 2

, since for every model

(ph, pm, s, g)

satisfying this assertion it holds that

s (X) = 2

. We shall later give entailment rules that allow such context-dependent bisimulation equivalences to be used to simplify processes inside

{Proc}_{π}

ownership predicates.

Lemma 7.

The ⊨ modelling relation satisfies the following properties:

1.: $ph, pm, s, g ⊨ P$ and $pm ≅_{pm} {pm}^{'}$ implies $ph, {pm}^{'}, s, g ⊨ P$ .
2.: If $ph, pm, s, g ⊨ P$ , then for any ${ph}^{'}$ and ${pm}^{'}$ such that $ph ⊥_{ph} {ph}^{'}$ and $pm ⊥_{pm} {pm}^{'}$ it holds that $ph ⊎_{ph} {ph}^{'}, pm ⊎_{pm} {pm}^{'}, s, g ⊨ P$ .

Lemma 7.1 is essential for allowing replacing process-algebraic abstractions by bisimilar ones inside the program logic. Lemma 7.2 expresses monotonicity, and states that adding resources does not invalidate the satisfiability of any assertion (i.e., adding more resources makes the assertion “more true”). This is a key property of intuitionistic separation logic and is necessary for proving soundness of the weakening rule, which we introduce later in Section 3.4.1.

3.3.8. Semantic Entailment

Let the denotation

[[P]] ≜ {(ph, pm, s, g) | (ph, pm, s, g) ⊨ P}

be the set of all models that are satisfied by the assertion

P

. Given any two assertions

P

and

Q

, the assertion

P

is defined to semantically entail

Q

, denoted

P ⊨ Q

, if every model of

P

is also a model of

Q

, that is,

P ⊨ Q ≜ [[P]] \subseteq [[Q]]

. Semantic entailment is thus a preorder and a congruence for all connectives of the assertion language.

3.4. Proof System

This section introduces the proof system of our model-based verification technique, which consists of structural proof rules (Section 3.4.1) as well as Hoare proof rules (Section 3.4.2). This proof system essentially extends the CSL of [34] by adding permission accounting [36,37] and machinery for handling process-algebraic program abstractions.

3.4.1. Entailment Rules

Figure 3 presents the structural rules of the program logic. The notation

P ⊣ ⊢ Q

is a shorthand for both

P ⊢ Q

and

Q ⊢ P

, and indicates that the rule can be used in both directions.

The rules for the standard connectives are mostly as expected. PLAIN-DUPL expresses that plain expressions can freely be duplicated, whereas *-PLAIN shows that * has the same meaning as ∧ in the case of plain assertions. The rule *-WEAK shows that our concurrent separation logic is affine (intuitionistic) by allowing to forget about resources. The rules *-ASSOC and *-COMM express that the separating conjunction is associative and commutative, respectively, whereas *-TRUE allows composing any resource with

true

. The rule TRUE-INTRO is the introduction rule for

true

, while FALSE-ELIM is the elimination rule for

false

stating that anything can be derived from falsehood. The

- *

-INTRO and

- *

-ELIM rules show that magic wands can be used similarly to the modus ponens inference rule of propositional logic, with respect to *. The rules ∀-INTRO, ∀-ELIM, ∃-INTRO and ∃-ELIM are the standard introduction and elimination rules for universal and existential quantifiers. ITER-SPLIT-MERGE enables splitting and merging iterated separating conjunctions along the associated (finite) index set.

Clarifying the rules for handling heap ownership;

↪

-SPLIT-MERGE expresses that heap ownership predicates

{\overset{π}{↪}}_{t}

of any type t may be split (in the left-to-right direction) as well as be merged (right-to-left) along

π

. Note however, that multiple points-to predicates for the same heap location may only co-exist if they have the same ownership type, as indicated by the

↪

-INCOMPATIBLE rule. Any heap ownership assertion with an invalid fractional permission associated to it entails

false

by

↪

-INVALID Furthermore, the *-PROCACT-SPLIT-MERGE inference rule states that iterated

procact

heap ownership predicates can be split into disjoint iterated

proc

and

act

predicates, or be merged into one such iteration.

Moving to the entailment rules for handling process-algebraic abstractions; PROC-≅ allows replacing any process by one that is bisimilar in the current context. The rules ≈-REFL, ≈-SYMM and ≈-TRANS show that context-dependent bisimilarity forms an equivalence relation in the logic with respect to separating conjunction, while ≈-CONG-∘, ≈-CONG-SUM, ≈-CONG-COND and ≈-CONG-ITER together show that ≈ is a congruence relation in the logic for all process-algebraic connectives. The rules ≈-COND-TRUE and ≈-COND-FALSE allows eliminating conditionals in any processes (together with the Proc-≅ rule that is). Moreover, ≈-SUM-ALT allows singling out a single choice out of a process-algebraic summation of choices. Notice here that one can pick any arbitrary program expression for singling out such a choice, which makes this rule particularly useful. Similarly to heap ownership, any process ownership with an invalid fractional permission entails

false

by Proc-INVALID. The rule ≈-TERM again makes explicit the intuitive meaning of successful termination, matching Proposition 1. Finally, Proc-SPLIT-MERGE allows splitting and merging process ownership predicates in the same style as

{\overset{π}{↪}}_{t}

, to distribute parallel processes over parallel threads. Notably, by splitting a predicate

{Proc}_{π_{1} + π_{2}} (X, \tilde{P_{1}} ‖ \tilde{P_{2}}, Π)

into two, both parts can be distributed over different concurrent threads in the program logic, so that thread i can establish that it executes as prescribed by its part

{Proc}_{π_{i}} (X, \tilde{P_{i}}, Π)

of the abstract model. Afterwards, when the threads join again the remaining partial abstractions can be merged back into a single

{Proc}_{π_{1} + π_{2}}

predicate. This system of splitting and merging thus provides a compositional, thread-modular way of verifying that programs meet their abstraction. The logical machinery of this is further discussed in Section 3.4.2.

Any deduction that can be derived using the rules of Figure 3 is sound in the standard sense:

Theorem 2

(Soundness of the entailment rules).

P ⊢ Q

implies

P ⊨ Q

.

3.4.2. Program Judgments

We now define program judgments and give the Hoare rules of the program logic. Judgments of programs are sequents (quintuples) of the form

Γ; R ⊢ {P} C {Q}

. The right-hand side is a traditional Hoare triple, whereas

R

is a resource invariant that captures resources available only to atomically executing programs (i.e., in executions that are free of thread interference), and

Γ

an environment in the style of interface specifications of [38]. These process environments have the following definition.

Definition 29

(Process environments).

\begin{matrix} Γ \in ProcEnv : : = \emptyset | Γ, {\tilde{B}} P (x) \end{matrix}

That is, process environments are comma-separated sequences of pairs

{\tilde{B}} P (x)

of processes P and their precondition

\tilde{B}

, with x a placeholder variable for an input parameter that may occur freely in both P and

\tilde{B}

. Note that processes do not have postconditions here; if desired one could encode process Hoare triples op top of these pairs as

{\tilde{B_{pre}}} P (x) {\tilde{B_{post}}} ≜ {\tilde{B_{pre}}} (P (x) \cdot ? (\tilde{B_{post}}))

—by adding a trailing assertion. Moreover, even though process environments are given as sequences, they are used as if they were (finite) sets, as is customary, in the sense that the order of their pairs is unimportant.

Process environments contain the contracts of the process-algebraic models defined for the program under verification. In particular, they allow for assume-guarantee style reasoning: the proof system may assume validity of these contracts when dealing with process-algebraic models, since they must be guaranteed externally, for example via interactive theorem proving or model checking, e.g., using mCRL2 [25]. Validity of process contracts and process environments is defined as follows.

Definition 30

(Validity of process environments). Any pair

{\tilde{B}} P (x)

of a process P and its precondition

\tilde{B}

is defined to be valid, denoted

⊨ {\tilde{B}} P (x)

, if

\forall s, σ . [[\tilde{B} [s]]] σ \Rightarrow (P, σ)

.

Any process environment Γ is defined to be validif

⊨_{env} Γ

, which is a judgment inductively defined by the following two rules.

\bar{⊨_{env} \emptyset} \frac{⊨_{env} Γ ⊨ {\tilde{B}} P (x)}{⊨_{env} Γ, {\tilde{B}} P (x)}

Figure 4 and Figure 5 give the Hoare rules of the logic. The standard structural rules are essentially the same as the ones of classical CSL [34]. One minor difference is that HT-ATOMIC leaves

true

instead of “

emp

” after obtaining a resource invariant, since our logic is intuitionistic. (The assertion language of classical CSL contains an extra

emp

construct, for explicitly denoting that the heap is empty. In our intuitionistic version of the logic, resources are allowed to be thrown away using *-WEAK. As a consequence, assertions cannot express “precise” properties about the content of the heap, including emptiness of heaps.) Moreover, our assertion language does not contain the logical conjunction ∧ connective, but has separating conjunction instead.

3.4.3. Heap Ownership

The HT-READ rule states that reading from the heap is allowed with any type t of heap ownership

{\overset{π}{↪}}_{t}

, whereas heap writing (HT-WRITE) is only allowed with ownership predicates of type

std

or

act

. The HT-WRITE rule thus restricts

{\overset{π}{↪}}_{proc}

assertions to exclusively grant read-access to the associated location. We will in a moment see that the proof rule for

action

programs can upgrade

E {\overset{π}{↪}}_{proc} E^{'}

predicates to

E {\overset{π}{↪}}_{act} E^{'}

to regain write access to the heap location at E. This system of upgrading enforces that all modifications to E are captured by the program abstraction to which the heap location is subject to, inside an action block. The rule HT-ALLOC for heap allocation generates a new points-to predicate of type

std

, indicating that the allocated heap location is not (yet) subject to any program abstraction. Heap deallocation (HT-DISPOSE) requires a full standard ownership predicate for the associated heap location, thereby making sure that the deallocation does not break any bindings of active program abstractions, which would be unsound.

3.4.4. Process Ownership

Figure 5 gives the Hoare rules for introducing, eliminating and updating process-algebraic abstractions. The HT-PROC-INIT rule handles initialisation of an abstract model P with input parameter y, over a set of heap locations as specified by

Π

. This rule requires standard heap write ownership for any heap location that is to be bound by P according to

Π

, and these are converted to

{\overset{1}{↪}}_{proc}

. Moreover, HT-PROC-INIT requires that the precondition B of P holds, which is constructed from

\tilde{B}

by replacing all process variables

x_{i}

by the values

E_{i}

at the corresponding heap locations as specified by

Π

. (Here we slightly abuse notation for ease of presentation. In the proof rule, we write

\tilde{B} {[x_{i} / E_{i}]}_{\forall i \in I}

for converting a condition

\tilde{B}

to a condition over only program variables, by substituting all free process variables

x_{i}

occurring in

\tilde{B}

by a program expression

E_{i}

. However, in our Coq formalisation we of course have a special operation for such conversions.) A

{Proc}_{1}

predicate with full permission is ensured, giving the current thread full ownership of the abstraction.

The HT-PROC-UPDATE rule handles updates to program abstractions, by performing an action

a (E^{'})

in the context of an

action E a (E^{'}) do C

program. This rule imposes four preconditions on handling

action

programs. First, a predicate of the form

{Proc}_{π} (E, a (E^{'}) \cdot \tilde{P} + \tilde{Q}, Π)

is required for some

π

. In particular, the process component must be of the form

a (E^{'}) \cdot \tilde{P} + \tilde{Q}

and therewith allow performing the a action. After performing a the process will be reduced to

\tilde{P}

, and

\tilde{Q}

will be discarded as the choice is made not to follow execution as prescribed by

\tilde{Q}

. To get processes in the required format, the entailment rules in Figure 3 can be used together with the bisimulation equivalences given earlier in Figure 2. To give an example, processes of the form

a (E^{'}) \cdot \tilde{P}

can always be rewritten to

a (E^{'}) \cdot \tilde{P} + δ

to obtain the required choice. Second,

{\overset{π}{↪}}_{proc}

predicates are required for any heap location that is bound by

Π

. These points-to predicates are needed to resolve the guard and effect of a. Third, a’s guard must indeed hold. (The notation

guard a E^{'}

is a shorthand for

(guard a z) [z / E^{'}]

for some fresh

z \in ProcVar

, and likewise for

effect a E^{'}

.) And last, the remaining resource

P

must hold as well.

Among the premises of HT-PROC-UPDATE is a proof derivation for the sub-program C, in which all required

{\overset{π_{i}}{↪}}_{proc}

predicates are “upgraded” to

{\overset{π_{i}}{↪}}_{act}

and thereby regain write access when

π_{i} = 1

. However, in case

π_{i} < 1

the upgrade does not give any additional privileges, since

{\overset{π_{i}}{↪}}_{proc}

provides read-access just the same. We found that these unnecessary conversions complicate the soundness proof. To avoid unnecessary upgrades, we convert all affected

{\overset{π_{i}}{↪}}_{proc}

predicates to

{\overset{π_{i}}{↪}}_{procact}

instead, which simplifies the correctness proof. The HT-PROC-UPDATE rule ensures a process ownership predicate that holds the resulting process

\tilde{P}

after execution of a. In addition, updates to the heap are ensured that comply with the postconditions of the proof derivation of C.

HT-PROC-FINISH handles finalisation of process-algebraic models that can successfully terminate. A predicate

{Proc}_{1} (E, ε + \tilde{P}, Π)

with full permission is required, implying that no other thread can have a fragment of the abstract model. The rule converts all bound

{\overset{1}{↪}}_{proc}

ownership predicates back to

{\overset{1}{↪}}_{std}

ownerships to indicate that these are no longer subject to the abstract model.

Lastly, HT-PROC-QUERY allows “querying” for properties

\tilde{B}

that are verified on the process algebra level. Recall that the main objective of process-algebraic analysis is to verify that all reachable assertions

? (\tilde{B})

hold. Observe that in this rule, the assertions may contain program variables in addition to process-algebraic variables, as these may have been introduced via summations (≈-SUM-ALT) or input parameters (HT-PROC-INIT). The soundness argument of the logic makes sure that the process-algebraic analysis can still be done fully on the process level (i.e., without relying on program state), meaning that this rule really makes a fusion between process-algebraic reasoning and deductive reasoning.

3.5. Soundness

This section defines the semantic meaning of program judgments and discuss the soundness proof of the program logic. This soundness proof has been mechanised using Coq, which was non-trivial and required substantial auxiliary definitions. This section discusses the most important auxiliary definitions and explains their use. For further proof details we refer to the Coq development [19].

The soundness theorem relates program judgments to the operational semantics of programs and boils down to the following: if (1) a proof

Γ; R ⊢ {P} C {Q}

can be derived for any program C and (2) the contracts in

Γ

of all abstract models of C are satisfied (proven externally), then C executes safely for any number of computation steps. Execution safety in this sense also includes that C does not fault for any number of reduction steps with respect to the fault semantics of programs; see Definition 13.

Our definition of execution safety extends the well-known inductive definition of configuration safety of Vafeiadis [34] by adding machinery to handle process-algebraic abstractions. The most important extension is a simulation argument between concrete program executions (with respect to

⇝

) and the executions of all active models (with respect to

\overset{α}{⟶}

). However, as the reduction steps of these two semantics do not directly correspond one-to-one, this simulation is established via an intermediate instrumented semantics referred to as the ghost operational semantics. This intermediate semantics is defined in Section 3.5.1 in terms of ghost transitions

⇝_{ghost}

that essentially define the lock-step execution of program transitions

⇝

and the transitions

\overset{α}{⟶}

of their abstractions. Our definition of “executing safely for n execution steps” includes that all

⇝

steps can be simulated by

⇝_{ghost}

steps, and vice versa, for n execution steps. Thus, the end-result is a refinement between programs and their abstractions.

In addition to establishing such refinements, our definition of execution safety must ensure that the HT-PROC-QUERY proof rule is sound. In other words, it must allow relying on any assertions embedded in the process-algebraic models in a sound manner, as these are (assumed to be) verified externally. To account for these assertions, the definition of execution safety needs to maintain the invariant that all active program abstractions preserve their execution safety as defined in Definition 5 for n execution steps, with respect to the current state of the program. (Recall that any process P is said to be safe according to this definition if P’s assertions always hold.) The details of maintaining this invariant are discussed further in Section 3.5.3.

Finally, Section 3.5.4 formally defines process execution safety—the semantic meaning of program judgments—and presents the exact soundness statement.

3.5.1. Ghost Operational Semantics

To establish the refinements between programs and their abstractions, an intermediate semantics is used that administers the states of all active program abstractions. This intermediate semantics is referred to in the sequel as the ghost operational semantics. The ghost semantics is expressed as a transition relation

⇝_{ghost} \subseteq GhostConf

between ghost configurations

G = (C, h, pm, s, g) \in GhostConf

, which extend program configurations by two extra components, namely:

A process map $pm \in ProcMap$ that is used to administer the state of all active (initialised, but not yet finalised) process-algebraic abstractions; and
An extra store $g \in Store$ , referred to as a ghost store, as it is used to map variable names to process identifiers in the context of “ghost” instructions.

The ghost operational semantics uses two stores instead of one, to keep the administration of program data and specification-only (ghost) data strictly separated. Doing so eases establishing that variables referred to in ghost code do not interfere with regular program execution, and vice versa.

Ghost reductions essentially describe the lock-step execution of concrete programs (

⇝

steps) and their abstractions (

\overset{α}{⟶}

steps). Figure 6 presents an excerpt of the transition rules. This excerpt only contains the reduction rules related to program abstraction; all other rules are essentially the same as those of

⇝

, with the two extra configuration components simply carried over and left unchanged. Recall that the blue colourings are merely visual cues and do not have any special semantical meaning.

Clarifying the ghost reduction rules, GHOST-PROC-INIT instantiates a new program abstraction and stores it in a free entry in

pm

. GHOST-PROC-FINISH finalises program abstractions that are able to terminate successfully. The rules GHOST-ACT-INIT, GHOST-ACT-STEP and GHOST-ACT-END handle the execution of action blocks. Before discussing these, first observe that the ghost semantics maintains an extra component m in

inact

m Ccommands, containing (ghost) metadata: extra runtime information about the process-algebraic model in whose context the program C is being executed. Concretely, ghost metadata m is defined as a quadruple

m = (a, v, v^{'}, h) \in Act \times Val \times Val \times Heap

, consisting of:

The label a of the action that is being executed;
The input argument v for this action;
The identifier $v^{'}$ of the corresponding process-algebraic model in the process map, in which the action a is to be executed; and
A copy h of the heap, made when the program started to execute the action block; that is, when the $action$ program was reduced to $inact$ by $⇝$ .

The ghost-ACT-INIT reduction rule starts executing an

action

block by reducing it to an

inact

program, thereby assembling and attaching ghost metadata. In particular, a copy of the heap is made at this point, so that the ghost-ACT-END rule for finalising

inact

programs is able to access the original contents of the heap. This is needed to allow the abstraction to make a matching

\overset{α}{⟶}

step; in particular to determine the pre-state of such a step. To see how this works, first recall that the process-algebraic state of program abstractions are linked to concrete program state—entries in the heap—via the

Λ

binders maintained in process maps. Therefore, to be able to make an

\overset{α}{⟶}

step, the ghost-ACT-END rule first needs to construct process-algebraic state out of the current program state. This is done using the auxiliary function

| | \cdot | | : Binder \to Heap \to ProcStore

referred to as the abstract state reification function.

Definition 31

(Abstract state reification).

| | Λ | | (h) ≜ λ x \in ProcVar . \{\begin{matrix} h (Λ (x)) & if x \in dom (Λ) and Λ (x) \in dom (h) \\ 𝟙_{Val} & otherwise \end{matrix}

The ghost-ACT-STEP rule allows making reductions in the context of

inact

programs. Finally ghost-QUERY handles reductions of assertions and synchronises any

\overset{assn}{⟶}

reductions on the process level with reductions of queries on the program level, with respect to the reified program state.

3.5.2. Faulting Ghost Configurations

In addition to faulting program configurations (Definition 13) we also define a fault semantics for ghost configurations. This ghost fault semantics is expressed in terms of a predicate

↯_{ghost} (G)

over ghost configurations

G

. Figure 7 gives an excerpt of the rules. Only the rules related to specification-only constructs are shown. All other rules are essentially the same as those of Definition 13. We shall later show and prove properties that connect the two faulting semantics.

Clarifying the ghost fault semantics; the initialisation of any process-algebraic model faults if there is no free entry available in

pm

(by

↯_{ghost}

-PROC-FULL). The finalisation of program abstractions can fault if the corresponding entry in the process map is (1) either unoccupied or invalid (

↯_{ghost}

-PROC-FINISH-1), or (2) contains a process-algebraic abstraction that is unable to successfully terminate (by the rule

↯_{ghost}

-PROC-FINISH-2). Reductions within action blocks

inact

m C may fault if (1) m does not refer to an abstraction (

↯_{ghost}

-ACT-SKIP-1), or (2) the abstraction relies on process variables that have an incorrect binding (by the rule

↯_{ghost}

-ACT-SKIP-2), or (3) the process is not able to make a matching step (

↯_{ghost}

-ACT-SKIP-3), or (4) the subprogram C is able to fault (by

↯_{ghost}

-ACT-SKIP). Any

query

program can fault under similar conditions as those of action programs.

The ghost semantics enjoys the same progress property as the standard operational semantics.

Theorem 3

(Progress of

⇝_{ghost}

). For any ghost configuration

G

for which

\neg ↯_{ghost} (G)

holds, either

G

is final, or there exists a

G^{'}

such that

G ⇝_{ghost} G^{'}

.

Moreover, it is quite straightforward to establish a forward simulation between

⇝

and

⇝_{ghost}

. A matching backward simulation is ensured by the soundness argument of the program logic, as is customary for establishing refinements [39].

Lemma 8

(Forward simulation). The standard operational semantics and the fault semantics of programs are embeddedin the ghost operational semantics and ghost fault semantics, respectively:

1.: If $(C, h, pm, s, g) ⇝_{ghost} (C^{'}, h^{'}, {pm}^{'}, s^{'}, g^{'})$ , then $(C, h, s) ⇝ (C^{'}, h^{'}, s^{'})$ .
2.: If $↯ (C, h, s)$ , then also $↯_{ghost} (C, h, pm, s, g)$ , for any $pm$ and g.

The above theorem also shows that the ghost fault semantics extends ↯. The soundness argument of the program logic establishes that verified programs do not fault with respect to

⇝_{ghost}

, and thus also do not fault with respect to

⇝

by the above Lemma.

3.5.3. Preservation of Process Execution Safety

As already hinted upon in the preamble of this section, establishing soundness of the program logic requires maintaining an invariant stating that all active program abstractions retain their execution safety throughout program execution, with respect to Definitions 5 and 6. Since process maps are used to administer the status of all active program abstractions, we lift the notion of process configuration safety (Definition 5) to safety of process maps. Process map safety is expressed in terms of judgments of the form

h ⊨_{pm} pm

stating that

pm

is safe if all process-algebraic models stored in

pm

execute safely with respect to Definition 5 together with a process store that is constructed (reified) from h.

Definition 32

(Process map safety).

\begin{matrix} h ⊨_{pm} pm & ≜ \forall v \in Val . h ⊨_{mc} pm (v) \end{matrix}

where

h ⊨_{mc} mc

is defined by case distinction on

mc

, so that

1 - 1 h ⊨_{mc} me ≜ \{\begin{matrix} true & if me = free \\ ✓ (P, | | Λ | | (h)) & if me = {〈 P, Λ 〉}^{π} for some π \\ false & if me = inv \end{matrix}

Free process cells are always safe whereas corrupted entries

inv

are never safe. Moreover, both

⊨_{pm}

and

⊨_{mc}

are closed under bisimilarity of process maps and their entries, respectively.

Lemma 9.

1.: If $h ⊨_{pm} pm$ and $pm ≅_{pm} {pm}^{'}$ , then $h ⊨_{pm} {pm}^{'}$ .
2.: If $h ⊨_{mc} me$ and $me ≅_{mc} {me}^{'}$ , then $h ⊨_{mc} {me}^{'}$ .

In a moment we will also define a notion of execution safety for commands. This notion of program execution safety maintains the aforementioned invariant that

h ⊨_{pm} pm

always holds throughout program execution, where h and

pm

are constructed from the current state, at every execution step. This invariant is needed to establish soundness of the HT-PROC-QUERY proof rule.

However, one must be careful on how to exactly state this invariant, to allow re-establishing it after every computation step. In most cases re-establishing the invariant is straightforward. For example,

h ⊨_{pm} pm

can be re-established after initialising a new program abstraction using the HT-PROC-INIT proof rule, by Definition 6 and by the structure of that proof rule. The invariant can also trivially be re-established after finalising an abstraction using HT-PROC-FINISH, as the abstraction is then no longer active and thereby removed from

pm

. However, computation steps that involve heap writing (i.e., handling of

[E] : = E^{'}

programs) may be problematic, as illustrated below.

Technicality 1

(Potential problems due to heap writing). To see the potential problem, consider the following code snippet.

Suppose that

h ⊨_{pm} pm

holds on line 5. After computing line 6, the heap h holds the value

- 2

at location

[[E^{'}]] s

. Moreover, the process map

pm

has not been changed, since the action program (lines 5–8) has not fully been executed yet. Nevertheless,

h [[[E^{'}]] s \mapsto - 2] ⊨_{pm} pm

may now be violated, as the

reset

action can no longer be performed, since

x = - 2

after reification, while

reset

’s guard requires x to be positive.

The root of the problem is that the invariant should not necessarily have to hold during intermediate reduction steps while executing

action

programs, but only at the pre- and poststate of such programs. Program execution safety will solve this by making a snapshot of the heap every time an action program is being started on (likewise to ghost-ACT-INIT), and expressing the invariant over these snapshot heaps. Snapshots are recorded at the level of permission heaps, which already have the required structure to do this: action heap cells

{〈 v_{1}, v_{2} 〉}_{act}^{π}

allow storing snapshot values

v_{2}

alongside “concrete” values

v_{1}

. These snapshot values are used to construct snapshot heaps.

Definition 33

(Snapshot heaps). The snapshot of a permission heap is defined in terms of a total function

{⌊ \cdot ⌋}_{snapshot} : PermHeap \to Heap

, so that

{⌊ ph ⌋}_{snapshot} ≜ λ v \in Val . {⌊ ph (v) ⌋}_{snapshot}

, with

{⌊ hc ⌋}_{snapshot} ≜ \{\begin{matrix} v & if hc = {〈 v 〉}_{proc}^{π} for some π \\ v_{2} & if hc = {〈 v_{1}, v_{2} 〉}_{act}^{π} for some v_{1} and π \\ undefined & otherwise \end{matrix}

The snapshot heap

{⌊ ph ⌋}_{snapshot}

of any permission heap

ph

only contains heap cells bound by process-algebraic models, and is constructed by taking the snapshot values of all

ph

’s action heap cells. As we shall see in a moment, the final invariant maintained by program execution safety will be

{⌊ ph ⌋}_{snapshot} ⊨_{pm} pm

, where

ph

and

pm

are taken from the models of the program logic and represent the current state of the program. This invariant, combined with establishing a refinement between the program and its abstract models, provide sufficient means for proving soundness of the program logic.

3.5.4. Adequacy

This section defines program execution safety and uses it to define the semantic meaning of program judgments, from which the soundness theorem (i.e., adequacy of the logic) can be formulated. Program execution safety extends on the well-known notion of configuration safety of [34], by adding permission accounting, process-algebraic state, and the machinery introduced earlier in this section.

First, in order to help connect the models of the program logic to concrete program state, we define a concretisation function for permission heaps in the same style as snapshot heaps.

Definition 34

(Concretisation). Concretisation of permission heaps is defined as a total function

{⌊ \cdot ⌋}_{concr} : PermHeap \to Heap

, so that

{⌊ ph ⌋}_{concr} ≜ λ v \in Val . {⌊ ph (v) ⌋}_{concr}

, with

{⌊ ph (v) ⌋}_{concr}

defined as

{⌊ hc ⌋}_{concr} ≜ \{\begin{matrix} v & if hc = {〈 v 〉}_{std}^{π} for some π \\ v & if hc = {〈 v 〉}_{proc}^{π} for some π \\ v_{1} & if hc = {〈 v_{1}, v_{2} 〉}_{act}^{π} for some π and v_{2} \\ undefined & otherwise \end{matrix}

The heap concretisation operator constructs (program) heaps out of permission heaps by simply discarding all internal structure regarding process-algebraic models. Only the information relevant for regular program execution is retained.

{⌊ \cdot ⌋}_{snapshot}

essentially does the same, but only retains heap cells bound to program abstractions and takes snapshot values whenever possible.

We now have all the ingredients for defining adequacy. Program execution safety is defined in terms of a predicate

{safe}_{n} (C, ph, pm, s, g, R, Q)

, stating that C is safe for n reduction steps with respect to a permission heap

ph

, process map

pm

, two stores s and g, a resource invariant

R

and postcondition

Q

.

Definition 35

(Program execution safety). The

{safe}_{0} (C, ph, pm, s, g, R, Q)

predicate always holds, whereas

{safe}_{n + 1} (C, ph, pm, s, g, R, Q)

holds if and only if the following five conditions hold.

1.

If

C = skip

, then

ph, pm, s, g ⊨ Q

.

2.

For every

{ph}_{F}

and

{pm}_{F}

such that

ph ⊥_{hc} {ph}_{F}

and

pm ⊥_{mc} {pm}_{F}

, it holds that

↯_{ghost} (C, {⌊ ph ⊎_{ph} {ph}_{F} ⌋}_{concr}, pm ⊎_{pm} {pm}_{F}, s, g)

.

3.

For any

v \in acc (C, s)

it holds that

ph (v) \notin {free, inv}

.

4.

For any

v \in writes (C, s)

it holds that

{full}_{hc} (ph (v))

.

5.

For any

{ph}_{J}

,

{ph}_{F}

,

{pm}_{J}

,

{pm}_{F}

,

{pm}_{C}

,

h^{'}

,

s^{'}

, and

C^{'}

such that, if:

5a.: $ph ⊥_{ph} {ph}_{J}$ and $(ph ⊎_{ph} {ph}_{J}) ⊥_{ph} {ph}_{F}$ , and
5b.: $pm ⊥_{pm} {pm}_{J}$ and $(pm ⊎_{pm} {pm}_{J}) ⊥_{pm} {pm}_{F}$ , and
5c.: $\neg locked (C)$ implies ${ph}_{J}, {pm}_{J}, s, g ⊨ R$ , and
5d.: $(pm ⊎_{pm} {pm}_{J} ⊎_{pm} {pm}_{F}) ≅_{pm} {pm}_{C}$ , and
5e.: ${⌊ ph ⊎_{ph} {ph}_{J} ⊎_{ph} {ph}_{F} ⌋}_{snapshot} ⊨_{pm} {pm}_{C}$ , and
5f.: $(C, {⌊ ph ⊎_{ph} {ph}_{J} ⊎_{ph} {ph}_{F} ⌋}_{concr}, s) ⇝ (C^{'}, h^{'}, s^{'})$ ;

then there exists

{ph}^{'}

,

{ph}_{J}^{'}

,

{pm}^{'}

,

{pm}_{J}^{'}

,

{pm}_{C}^{'}

, and

g^{'}

, such that

5g.: ${ph}^{'} ⊥_{ph} {ph}_{J}^{'}$ and $({ph}^{'} ⊎_{ph} {ph}_{J}^{'}) ⊥_{ph} {ph}_{F}$ , and
5h.: ${pm}^{'} ⊥_{pm} {pm}_{J}^{'}$ and $({pm}^{'} ⊎_{pm} {pm}_{J}^{'}) ⊥_{pm} {pm}_{F}$ , and
5i.: ${⌊ {ph}^{'} ⊎_{ph} {ph}_{J}^{'} ⊎_{ph} {ph}_{F} ⌋}_{concr} = h^{'}$ , and
5j.: $({pm}^{'} ⊎_{pm} {pm}_{J}^{'} ⊎_{pm} {pm}_{F}) ≅_{pm} {pm}_{C}^{'}$ , and
5k.: ${⌊ {ph}^{'} ⊎_{ph} {ph}_{J}^{'} ⊎_{ph} {ph}_{F} ⌋}_{snapshot} ⊨_{pm} {pm}_{C}^{'}$ , and
5l.: $\neg locked (C^{'})$ implies ${ph}_{J}^{'}, {pm}_{J}^{'}, s^{'}, g^{'} ⊨ R$ , and
5m.: $(C, {⌊ ph ⊎_{ph} {ph}_{J} ⊎_{ph} {ph}_{F} ⌋}_{concr}, {pm}_{C}, s, g) ⇝_{ghost} (C^{'}, h^{'}, {pm}_{C}^{'}, s^{'}, g^{'})$ , and
5n.: ${safe}_{n} (C^{'}, {ph}^{'}, {pm}^{'}, s^{'}, g^{'}, R, Q)$ .

Clarifying the above definition, any configuration is safe for

n + 1

steps intuitively if: the postcondition

Q

is satisfied if C has terminated (1); the program C does not fault (2); C only accesses heap entries that are allocated (3); C only writes to heap locations for which full permission is available (4); and finally, after making a computation step the program remains safe for another n steps (5). (The predicate

{full}_{hc} (hc)

is

true

whenever

hc

is an occupied heap cell with an associated fractional permission

π

equal to 1.) Condition 2 implies race freedom, while conditions 3 and 4 account for memory safety.

Condition 5 is particularly involved. In particular it encodes the backward simulation: if the program can do a

⇝

step (5f), then it must be able to make a matching

⇝_{ghost}

step (by 5m). Moreover, the resource invariant

R

must remain satisfied (due to 5c and 5l) after making a computation step, whenever the current program is not locked. In addition, the process maps invariably remain safe with respect to the snapshot heap due to 5e and 5k, as discussed in Section 3.5.3.

Lemma 10.

Program execution safety satisfies the following properties.

1.: If ${safe}_{n} (C, ph, pm, s, g, R, Q)$ and $m \leq n$ , then ${safe}_{m} (C, ph, pm, s, g, R, Q)$ .
2.: If ${safe}_{n} (C, ph, {pm}_{1}, s, g, R, Q)$ and ${pm}_{1} ≅_{pm} {pm}_{2}$ , then ${safe}_{n} (C, ph, {pm}_{2}, s, g, R, Q)$ .
3.: If ${safe}_{n} (C, ph, pm, s, g, R, Q)$ and $Q ⊨ Q^{'}$ , then ${safe}_{n} (C, ph, pm, s, g, R, Q^{'})$ .

1 in Lemma 10 states monotonicity in the sense that being safe for n reduction steps implies safety for less than n steps. 2 in Lemma 10 states that process maps can always be replaced by bisimilar ones in safe configurations. Finally, 3 in Lemma 10 states that postconditions may always be weakened.

3.5.5. Semantics of Program Judgments

The semantics of program judgments is defined in terms of a quintuple

Γ; R ⊨ {P} C {Q}

, expressing that C is safe for any number of reduction steps starting from any state satisfying

P

.

Definition 36

(Semantics of program judgments).

Γ; R ⊨ {P} C {Q}

holds if and only if:

(1): $user (C)$ , and
(2): If $⊨_{env} Γ$ and $wf (C)$ , then for any $ph, pm, s, g$ such that ${valid}_{ph} (ph)$ and ${valid}_{pm} (pm)$ and ${⌊ ph ⌋}_{snapshot} ⊨_{pm} pm$ and $ph, pm, s, g ⊨ P$ hold, it holds that:

$\forall n . {safe}_{n} (C, ph, pm, s, g, R, Q)$

The underlying idea of the above definition, i.e., having a continuation-passing style definition for program judgments, has first been applied in [40] and has further been generalised in [41,42]. Moreover, the idea of defining execution safety in terms of an inductive predicate originates from [43]. These two concepts have been reconciled in [34] into a formalisation for the classical CSL of Brookes [23], that has been encoded and mechanically been proven in both Isabelle and Coq. Our definition builds on the latter, by having a refinement between programs and abstractions encoded in

safe

.

Observe that only judgments of user programs (i.e., commands free of runtime constructs like

inatom

and

inact

) have a semantic meaning. Also observe that the semantics of program judgments is conditional on the safety of

Γ

. It states that C executes safely for any number n of computation steps with respect to any state satisfying

P

, only if

Γ

is safe—that is, only if all process-algebraic models for C are (assumed to be externally) verified. From the above definition it trivially follows that

Γ; R ⊨ {false} C {P}

for any

Γ

,

R

,

P

and user program C. Notice however that

Γ; R ⊨ {P} C {true}

does not hold in general, since C might be able to fault, for example by having data-races.

The following main soundness theorem states that verified programs (i.e., programs for which a proof can be derived according to the proof rules given earlier in Figure 4 and Figure 5) are semantically valid (that is, are fault-free, memory-safe, and refine their process-algebraic models).

Theorem 4

(Soundness).

Γ; R ⊢ {P} C {Q} \Rightarrow Γ; R ⊨ {P} C {Q}

.

The soundness proofs of all proof rules have been mechanised using the Coq proof assistant and can be found on the Git repository accompanying this article [19].

The HT-PROC-UPDATE and HT-PROC-QUERY proof rules were the most difficult to prove sound, as their proofs require, among other things, (1) showing that the abstract model can always match the program with a simulating execution step, as well as (2) maintaining the invariant that any process-algebraic abstraction inside the process map is safe with respect to the reified program state. On top of that, the combination of (1) and (2) requires some extra bookkeeping to ensure that the snapshot heaps stored in ghost metadata agree with the snapshot values stored in permission heaps. This additional bookkeeping has been left out of the formalisation presented so far, but the details of this can be studied in the Coq formalisation.

4. Implementation

The presented verification approach has been implemented in the VerCors concurrency verifier, which specialises in automated verification of parallel and concurrent programs written in high-level languages like (subsets of) Java and C [8]. VerCors can reason about programs with heterogeneous concurrency features as in Java, as well as homogeneous concurrency like in OpenCL, and compiler directives as in OpenMP. VerCors allows specifying (concurrent) programs with annotations from a separation logic with permission accounting. VerCors supports reasoning about freedom of data-races, memory safety and functional program behaviour—compliance of the program annotations.

4.1. Tool Support

Tool support for our technique has been implemented in VerCors for languages with fork/join concurrency and statically-scoped parallel constructs [14]. Our technique has been implemented by defining an axiomatic domain for process types in Viper, consisting of constructors for the process-algebraic connectives and standard process-algebraic axioms to support these. The three different ownership types

{\overset{π}{↪}}_{t}

are encoded in Viper by defining extra fields that maintain the ownership status t for each global reference. The

{Proc}_{π}

assertions are encoded as predicates over process types.

Note however that VerCors does not yet support writing and reasoning about assertional processes

? (\cdot)

. Instead, the properties to verify on a process-algebraic level are specified as postconditions of the models. That is, VerCors currently only allows reasoning about postcondition properties of process-algebraic models, while the formalisation as presented in this article allows reasoning about properties also at intermediate points of process execution. But since the formalisation is more general than the VerCors implementation (as was already indicated in Section 3.4.2), soundness is retained.

Recall that process-algebraic abstractions are to be verified externally with our approach, for example using an interactive theorem prover or a model checker like mCRL2. Nevertheless, VerCors itself also has capabilities to reason about process-algebraic models. This is done by first linearising all specified processes, and then encoding these linearised processes together with their contracts into the Viper language, and delegate further reasoning to Viper. Any process is said to be linear if it does not use the ‖ and Applsci 10 03928 i004

connectives. Linearisation is a mechanical (automated) procedure based on a rewrite system that uses a subset of the bisimulation equivalences of Figure 2 as rewrite rules [44] (but in one direction only), to try to eliminate parallel connectives. For example, a process term

(a_{1} \cdot a_{2}) ‖ a_{3}

can automatically be linearised to the bisimilar process

a_{1} \cdot a_{2} \cdot a_{3} + a_{1} \cdot a_{3} \cdot a_{2} + a_{3} \cdot a_{1} \cdot a_{2}

. Note that linearisation may not always succeed. VerCors outputs a verification error in case linearlisation fails.

Process-algebraic models may also algorithmically be analysed, for example using a model checker. We are currently actively investigating the use of the mCRL2 toolset [21] and the Ivy verifier [45] to reason about (different forms of) processes, and in different use cases; see for example [17], in which we use process-algebraic abstractions to reason about distributed message passing programs.

Finally, we would like to remark that the VerCors implementation of the abstraction technique is much richer than the simple language of Section 3.2 that is used to formalise the approach on. For example, the abstraction language in VerCors supports general recursion instead of Kleene iteration. VerCors also has support for several axiomatic data types that enrich the expressivity of reasoning with program abstractions, like (multi)sets, sequences and option types.

4.2. Coq Formalisation

The formalisation and soundness proof (Section 3.1, Section 3.2, Section 3.3, Section 3.4 and Section 3.5) of the program logic have been fully mechanised using Coq, as a deep embedding inspired by [34]. The overall implementation comprises over

23.000

lines of code. The Coq development and its documentation are available online [19].

5. Case Study

Finally, we demonstrate our verification approach on a well-known version of the leader election protocol [46] that is based on shared memory. Most importantly, this case study shows how our approach bridges the typical abstraction gap between process algebraic models and program implementations. In particular, it shows how a high-level process algebraic model of a leader election protocol, together with a contract for this model (checked with mCRL2 for various inputs), is formally connected to an actual program implementation of the protocol, using VerCors.

The protocol is performed by N concurrent workers that are organised in a ring, so that worker i only sends to worker

i + 1

and only receives from worker

i - 1

, modulo N. The goal is to determine a leader among these workers. To find a leader, the election procedure assumes that each worker i receives a unique integer value to start with, and then operates in N rounds. In every round, (1) each worker sends the highest value it encountered so far to its right neighbour, (2) receives a value from its left neighbour, and (3) remembers the highest of the two. The result after N rounds is that all workers know the highest unique value in the network, allowing its original owner to announce itself as leader.

The case study has been verified with VerCors using the presented approach. All workers communicate via two standard non-blocking operations for message passing: mp_send r,

msg

for sending a message

msg

to the worker with rank r, and

msg : = mp_recv r

for receiving a message from worker r. (The identifiers of workers are typically called ranks in message passing terminology. Ranks are simply natural numbers.) The election protocol is implemented on top of this message passing system.

The main challenge of this case study is to define a message passing system on the process algebra level that matches this implementation. To design such a system we follow the ideas of [46]; by defining two actions,

send (r, msg)

and

recv (r, msg)

, that abstractly describe the behaviour of the concrete implementations in

mp_send

and

mp_recv

, respectively. Process algebraic summation

Σ_{x} \tilde{P}

is used to quantify over the possible messages that

mp_recv

might receive.

The following two rules illustrate how the abstract send and recv actions are connected to the

mp_send

and

mp_recv

procedures in the program, respectively. The latter rule uses a summation (shorthand) of the form

Σ_{x \in Msg} \tilde{P}

that can be considered equivalent to the process

Σ_{x} (x \in Msg : \tilde{P})

.

\begin{matrix} {send (r, msg) \cdot \tilde{P}} mp_send r, msg & {\tilde{P}} \\ {Σ_{x \in Msg} recv (r, x) \cdot \tilde{P}} msg : = mp_recv r & {\tilde{P} [x / msg]} \end{matrix}

Finally, we construct a process-algebraic model of the election protocol using send and recv, and verify that the implementation adheres to this model. This model has been analysed with mCRL2 for various inputs (since mCRL2 is essentially finite-state) to establish the global property of announcing the correct leader. The deductive proof of the program can then rely on this property.

5.1. Behavioural Specification

The main goal is proving that the implementation determines the correct leader upon termination. To prove this we first define a behavioural specification of the election protocol that hides all irrelevant implementation details, and prove the correctness property on this specification. Process algebra provides a proper abstraction language that suits our needs well, as the behaviour of leader election can concisely be specified in terms of sequences of sends and receives.

Figure 8 presents the process algebraic specification. In particular, ParElect specifies the global behaviour of the program whereas Elect specifies its thread-local behaviour. The ParElect process encodes the parallel composition of all eligible participants. ParElect takes a sequence

vs

of initial values as argument, whose length equals the total number of workers by its precondition. ParElect’s postcondition (i.e., trailing assertion) states that

lead

must be a valid rank after termination and that

vs [lead]

be the highest initial worker value. It follows that worker

lead

is the correctly chosen leader.

The Elect process takes four arguments, which are: the rank of the worker, the initial unique value

v_{0}

of that worker, the current highest value v encountered by that worker, and finally the number n of remaining rounds. The rounds are implemented via general recursion. In each round all workers send their current highest value v to their right neighbour (on line 24), receive a value

v^{'}

from their left neighbour (line 25), and continue with the highest of the two. The announce action is declared and used to announce the leader after n rounds. The effect of announce is that

lead

stores the leader’s rank.

The contracts of send and recv describe the behaviour of standard non-blocking message passing. Communication on the specification level is implemented via message queues. Message queues are defined as sequences of messages taken from a domain

Msg

. Since workers are organised in a ring it suffices to have only a single queue for every worker, meaning that the global communication channel architecture can be defined as a sequence of message queues:

chan

in the figure. The action contract of

send (r, msg)

expresses enqueuing

msg

onto the message queue

chan [r]

of the worker with rank r. The effect of send is that

msg

has been enqueued onto

chan [r]

and that the queues

chan [r^{'}]

for any

r^{'} \neq r

have not been altered. Likewise,

recv (r, msg)

’s contract expresses dequeuing

msg

from

chan [r]

. The expression

\ old (e)

indicates that e is to be evaluated with respect to the pre-state of computation.

5.2. Protocol Implementation

Figure 9 presents the annotated implementation of the election protocol. (It should be noted here that the presentation is slightly different from the version that is verified with VerCors, to better connect to the theory discussed in the earlier sections to the case study. This is because VerCors uses Implicit Dynamic Frames [47] as its underlying logical framework, which is equivalent to separation logic [48] but handles ownership slightly differently. The details of this are deferred to [8,49].) The elect method contains the code that is executed by every worker. The contract of elect

rank, v_{0}, v

states that the method body adheres to the behavioural specification

Elect (rank, v_{0}, v, N)

of the election protocol. Each worker performing elect enters a for-loop that iterates N times, whose loop invariant states that, at iteration i, the remaining program behaves as prescribed by the process

Elect (rank, v_{0}, v, i - 1)

. The invocations to

mp_send

and

mp_recv

on lines 32 and 36 are annotated with

with

clauses that resolve the assignments required by the

given

clauses in the contracts of

mp_send

and

mp_recv

. The

given \bar{η}

annotation expresses that the parameter list

\bar{η}

are extra ghost arguments for the sake of specification. Stated differently,

\bar{η}

is a sequence of logical variables which are universally quantified at the (outer) level of the method contract (the types of these are left implicit for ease of presentation). After N rounds all workers with

v = v_{0}

announce themselves as leader. However, since the initial values are chosen to be unique there can only be one such worker. Finally, we can verify that at the post-state of elect the abstract model has been fully executed and thus reduced to

ε

in the logic.

The

mp_send rank, msg

method implements the operation of enqueuing

msg

onto the message queue of worker

rank

. Its implementation has been omitted for brevity. The contract of

mp_send

expresses that the enqueuing operation is abstracted as a

send (rank, msg)

action that is prescribed by an abstract model identified by X. The

mp_recv X, rank

method implements the operation of dequeuing and returns the first message of the message queue of worker

rank

. The receive is prescribed as an abstract

recv

action, where the received message is ranged over by the summation on line 17.

Figure 10 presents bootstrapping code for the implementation of message passing. The main procedure initialises the communication channels whereas parelect spawns all workers. main

vs

additionally initialises and finalises the abstraction

ParElect (vs)

on the specification level (on lines 76 and 83, resp.) whose analysis allows establishing main’s postconditions. The procedure parelect

vs

implements the abstract model

ParElect (vs)

by spawning N workers that all execute the elect program. The contract associated to the parallel block (lines 55–59) is called an iteration contract and assigns pre- and postconditions to every parallel instance. For more details on iteration contracts we refer to [50]. Most importantly, the iteration contract of each parallel worker states (on line 58) that it behaves as specified by Elect. Thus, we deductively verify in a thread-modular way that the program implements its behavioural specification. Observe that all the required ownership for the global fields and the

{Proc}_{1}

predicate is split and distributed among the individual workers via the iteration contract and the

with

clause on lines 62–64. Finally, the main correctness property is conveyed from the process level to the program level by the

query X

annotation on line 82, which “queries” for ParElect’s postconditions.

5.3. Specification and Verification Details

The VerCors encoding of the presented leader election protocol comprises 433 lines of code, of which 275 are specification annotations (

63.5 %

of the total) and 27 lines are comments (

6.2 %

of the total). Out of the 275 lines of specification code, 62 are used for specifying the process-algebraic model (

22.5 %

), and the remaining 213 lines (

77.5 %

) for formally linking this model to the program code.

The average verification time with VerCors is

19.75

s (average of 30 runs), measured on a Macbook with an Intel Core i5 CPU with 2,9 GHz and 8Gb memory. All verification files are available online [19].

5.4. Industrial Applicability

Apart from the presented leader election case study, our approach has been applied in a larger, industrial case study covering the formal verification of a traffic tunnel emergency control system [15]. In this case study, we successfully verified a safety-critical component of an emergency control system of an actual traffic tunnel that is currently in use in the Netherlands. This particular software component is responsible for handling any emergency situations that occur inside the traffic tunnel. For example, whenever a fire breaks out inside the tunnel or an accident occurs, it must start an emergency procedure to evacuate all people and turn on the emergency lights to help guide them out; control the fans to blow away any smoke; et cetera. Naturally the reliability demands imposed on such a software component are very high. Our research goal was to see if formal verification could help.

Our approach for this case study was to use mCRL2 to construct a formal, process-algebraic model of the software design, which was written informally as a state machine together with pseudo-code descriptions of the different system behaviours. We then analysed the state space of this model and checked whether it satisfies desirable properties, which we composed together with the company that wrote the actual code. Ultimately we found problematic behaviour: the system could, due to an unlucky combination of timing and events, reach a calamity state in which the emergency procedure is not started. However, the software company already knew of this problematic behaviour and deliberately provided us with an older version of their software. Nevertheless, we demonstrated that formal methods can indeed help to improve the quality of real-world industrial software.

In addition to modelling and analysing the software design with mCRL2, we also used VerCors to prove that the actual code implementation is soundly abstracted by the process-algebraic model, using the techniques presented in this article. We did this to increase the value of our formal model, and to ensure that its analysis is meaningful. Moreover, our verification also (indirectly) proved that the code implementation adheres to the pseudo-code specifications in the original software design.

Overall this case study highlights how the presented approach is applicable to real-world projects. We were able to identify potential vulnerabilities in the software design, and could link (a formal model of) the software design to the actual code. We are currently involved in a follow-up project with the same company, and aim to apply our technique during the software development process.

6. Related Work

Significant progress has been made on the theory of concurrent software verification over the last years [1,2,3,4,5,51,52,53]. This line of research proposes advanced program logics that all provide some notion of expressing and restricting thread interference of various complexity, via protocols [54]—formal descriptions of how shared program state is allowed to evolve over time. In our approach protocols have the form of processes.

The original work on CSL [24] allows specifying simple thread interference in shared-memory concurrent programs via resource invariants and critical regions. Later, RGSep [55] merges CSL with rely-guarantee reasoning to enable describing more fine-grained inter-thread interference by identifying atomic concurrent actions. Many modern program logics build on these principles and propose even more advanced ways of verifying shared-memory concurrency. For example, TaDa [5] and CaReSL [3] express thread interference protocols through state-transition systems. iCAP [51] and Iris [56] propose a more unified approach by accepting user-defined monoids to express protocols on shared state, together with invariants restricting these protocols. Iris provides reasoning support for proving language properties in Coq, whereas our focus is on proving (concrete) programs correct.

In the distributed setting, Disel [6] allows specifying protocols for distributed systems. Disel builds on dependent type theory and is implemented as a shallow embedding in Coq. Even though their approach is more expressive than ours, it has to be used in the context of Coq and thus can be applied only semi-automatically at the moment. Villard et al. [57] present a program logic for message passing concurrency, where threads may communicate over channels using native send/receive primitives. This program logic allows specifying protocols via contracts, which are state-machines in the style of Session Types [58] to describe channel behaviour. Our technique is more general however, as the approach of Villard et al. is tailored specifically to reason about basic shared-memory message passing (Section 5 for example demonstrates how a system of message passing can be realised using process-algebraic abstractions). Actor Services [59] is a program logic with assertions to express the consequences of asynchronous message transfers between actors—independent program units that communicate via message passing. The meta-theory of Actor Services has not been proven sound.

Most of the related work given so far is essentially theoretical and tend to focus primarily on expressivity—on contributing approaches that are expressive yet not necessarily easy to implement into SMT-based program verifiers like for example VerCors of Viper. In fact, for most of these approaches it is very challenging to implement such automated tool support. Instead, they have to be applied in pen-and-paper style, or in the context of an interactive theorem prover like Coq or Isabelle. Our abstraction approach is different, in the sense that its aim is not to maximise expressivity (for example by integrating into higher-order separation logics, like for example [60]). Instead, we aim for a verification approach that balances expressivity and usability—an approach that is expressive enough to reason about real-world concurrent programs that follow some protocol, while being implementable into automated code verifiers; in this case, VerCors and Viper.

Related concurrency verifiers are SmallfootRG [61], VeriFast [7], CIVL [62], THREADER [63] and Viper [9,10]; the latter tool is used as the main back-end of VerCors. SmallfootRG is a memory-safety verifier based on RGSep. VeriFast is a rich toolset for verifying (multi-threaded) Java and C programs using separation logic. The CIVL framework can reason about race-freedom and functional correctness of MPI programs written in C [64,65]. The reasoning is done via bounded model checking combined with symbolic execution. THREADER is an automated verifier for multi-threaded C, based on model checking and counterexample-guided abstraction refinement.

One approach that is particularly noteworthy is the one of Penninckx et al. [66], who propose a logic to specify and verify input/output (I/O) behaviour of (sequential) programs. The logic has been implemented in VeriFast. In this approach the I/O behaviour of programs is specified essentially as a Petri Net. Its assertion language has constructs to specify I/O permission tokens, and its proof system has inference rules that allow reducing a Petri Net specification alongside the structure of the program, similar to our approach. However, their specification/verification strategy is to make predictions about the behaviour of the environment (which may or may not turn out true), by specifying assumptions on what the environment will input given a particular I/O operation and output, which is in contrast to our approach. In fact, with our approach one could analyse and use process-algebraic models together with an extra process that models an environment, to achieve stronger reasoning capabilities.

Apart from the proposed technique, VerCors also allows using process algebraic abstractions as histories [67,68]. Also related in this respect are the time-stamped histories of [69], which records atomic state changes in concurrent programs as a history, which are, likewise to our approach, handled as resources in the logic. However, history recording is only suitable for terminating programs.

There is also related earlier work on using process-algebraic abstractions to reason about message passing distributed programs [17]. This work introduces the assertional processes

? (\cdot)

as well as process-algebraic summation, as they are used in this article. In fact, this article merges the core ideas of [16,17] into a single logical framework, to make the original work of [16] more general.

Finally, there is a lot of general work on proving linearisability [70,71,72], which essentially allows reasoning about fine-grained concurrency by using sequential verification techniques. Our technique, as well as the history-based technique of [67] uses process algebraic linearisation to do so.

7. Conclusions

To reason effectively about realistic concurrent and distributed software, we have presented a verification technique that performs the reasoning at a suitable level of abstraction that hides irrelevant implementation details, is scalable to realistic programs by being modular and compositional, and is practical by being supported by automated tools. The approach is expressive enough to allow reasoning about realistic software as is demonstrated by the case study as well as by [15], and can be implemented as part of an automated deductive SMT-based program verifier, viz. VerCors. The proof system underlying our technique has mechanically been proven sound using Coq. Our technique is therefore supported by a strong combination of theoretical justification and practical usability.

This article extends [16], which we considered to be the beginning of a comprehensive verification framework that aims to capture many different concurrent and distributed programming paradigms. This extended version makes a step forward towards such a framework, by unifying the core ideas of [16,17,18], thereby generalising the original framework, most importantly to include assertional processes.

We are currently further investigating the use of mCRL2 to reason algorithmically about program abstractions, e.g., [73]. We are looking in particular into distributed programs that communicate over channels, since process algebra have been used extensively to model such programs (see for example the work and uses of the

π

-calculus). We are also (somewhat more passively) looking into whether Ivy would be a suitable tool to reason about our process specifications, possibly with mild adaptions. Moreover, we are planning to investigate the preservation of liveness properties in addition to safety.

Author Contributions

Investigation, W.O., D.G. and M.H. All authors have read and agreed to the published version of the manuscript.

Funding

The third author is supported by the NWO VICI 639.023.710 Mercedes project.

Conflicts of Interest

The authors declare no conflict of interest.

References

Feng, X.; Ferreira, R.; Shao, Z. On the Relationship Between Concurrent Separation Logic and Assume- Guarantee Reasoning. In Proceedings of the European Symposium on Programming (ESOP), Braga, Portugal, 24 March–1 April 2007; De Nicola, R., Ed.; Springer: Berlin, Germany, 2007; pp. 173–188. [Google Scholar]
Dinsdale-Young, T.; Dodds, M.; Gardner, P.; Parkinson, M.; Vafeiadis, V. Concurrent Abstract Predicates. In Proceedings of the European Conference on Object-Oriented Programming (ECOOP), Maribor, Slovenia, 21–25 June 2010; LNCS; D’Hondt, T., Ed.; Springer: Berlin/Heidelberg, Germany, 2010; Volume 6183, pp. 504–528. [Google Scholar]
Turon, A.; Dreyer, D.; Birkedal, L. Unifying Refinement and Hoare-style Reasoning in a Logic for Higher-Order Concurrency. In Proceedings of the 2013 International Conference on Functional Programming (ICFP), Boston, MA, USA, 25–27 September 2013; pp. 377–390. [Google Scholar]
Nanevski, A.; Ley-Wild, R.; Sergey, I.; Delbianco, G. Communicating State Transition Systems for Fine-Grained Concurrent Resources. In Proceedings of the European Symposium on Programming (ESOP), Grenoble, France, 5–13 April 2014; Shao, Z., Ed.; Springer: Berlin/Heidelberg, Germany, 2014; pp. 290–310. [Google Scholar]
Rocha Pinto, P.d.; Dinsdale-Young, T.; Gardner, P. TaDA: A Logic for Time and Data Abstraction. In Proceedings of the European Conference on Object-Oriented Programming (ECOOP), Uppsala, Sweden, 28 July–1 August 2014; LNCS; Jones, R., Ed.; Springer: Berlin/Heidelberg, Germany, 2014; pp. 207–231. [Google Scholar]
Sergey, I.; Wilcox, J.; Tatlock, Z. Programming and Proving with Distributed Protocols. Princ. Programm. Lang. 2017, 2, 1–30. [Google Scholar] [CrossRef]
Jacobs, B.; Smans, J.; Philippaerts, P.; Vogels, F.; Penninckx, W.; Piessens, F. VeriFast: A powerful, sound, predictable, fast verifier for C and Java. In Proceedings of the NASA Formal Methods (NFM), Pasadena, CA, USA, 18–20 April 2011; Bobaru, M., Havelund, K., Holzmann, G., Joshi, R., Eds.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 41–55. [Google Scholar]
Blom, S.; Darabi, S.; Huisman, M.; Oortwijn, W. The VerCors Tool Set: Verification of Parallel and Concurrent Software. In Proceedings of the International Conference on Integrated Formal Methods (iFM), Torino, Italy, 20–22 September 2017; LNCS; Polikarpova, N., Schneider, S., Eds.; Springer: Berlin/Heidelberg, Germany, 2017; Volume 10510, pp. 102–110. [Google Scholar]
Juhasz, U.; Kassios, I.; Müller, P.; Novacek, M.; Schwerhoff, M.; Summers, A. Viper: A Verification Infrastructure for Permission-Based Reasoning. Technical Report; ETH Zurich: Zurich, Switzerland, 2014. [Google Scholar]
Müller, P.; Schwerhoff, M.; Summers, A. Viper: A Verification Infrastructure for Permission-Based Reasoning. In Proceedings of the Verification, Model Checking, and Abstract Interpretation (VMCAI), St. Petersburg, FL, USA, 17–19 January 2016; Jobstmann, B., Leino, K., Eds.; Springer: Berlin/Heidelberg, Germany, 2016; pp. 41–62. [Google Scholar]
INRIA—The Coq Webpage. Available online: https://coq.inria.fr (accessed on 4 June 2020).
Bertot, Y.; Castran, P. Interactive Theorem Proving and Program Development: Coq’Art the Calculus of Inductive Constructions, 1st ed.; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
Nipkow, T.; Wenzel, M.; Paulson, L. Isabelle/HOL: A Proof Assistant for Higher-order Logic; Springer: Berlin, Germany, 2002. [Google Scholar] [CrossRef]
Oortwijn, W.; Blom, S.; Gurov, D.; Huisman, M.; Zaharieva-Stojanovski, M. An Abstraction Technique for Describing Concurrent Program Behaviour. In Proceedings of the Verified Software: Theories, Tools, and Experiments (VSTTE), Heidelberg, Germany, 22–23 July 2017; LNCS; Paskevich, A., Wies, T., Eds.; Springer: Berlin/Heidelberg, Germany, 2017; Volume 10712, pp. 191–209. [Google Scholar]
Oortwijn, W.; Huisman, M. Formal Verification of an Industrial Safety-Critical Traffic Tunnel Control System. In Proceedings of the Integrated Formal Methods (iFM), Bergen, Norway, 2–6 December 2019; LNCS; Ahrendt, W., Tapia Tarifa, S.L., Eds.; Springer: Berlin, Germany, 2019. [Google Scholar]
Oortwijn, W.; Gurov, D.; Huisman, M. Practical Abstractions for Automated Verification of Shared-Memory Concurrency. In Proceedings of the Verification, Model Checking, and Abstract Interpretation (VMCAI), New Orleans, LA, USA, 19–21 January 2020; Beyer, D., Zufferey, D., Eds.; Springer International Publishing: Cham, Germany, 2020; pp. 401–425. [Google Scholar]
Oortwijn, W.; Huisman, M. Practical Abstractions for Automated Verification of Message Passing Concurrency. In Proceedings of the Integrated Formal Methods (iFM), Bergen, Norway, 2–6 December 2019; LNCS. Ahrendt, W., Tapia Tarifa, S.L., Eds.; Springer: Berlin, Germany, 2019. To appear. [Google Scholar]
Oortwijn, W. Deductive Techniques for Model-Based Concurrency Verification. Ph.D. Thesis, University of Twente, Enschede, The Netherlands, 2019. [Google Scholar] [CrossRef]
Supplementary Material for this Article. Available online: https://vercors.ewi.utwente.nl/csl-abstractions/ (accessed on 4 June 2020).
Aldini, A.; Bernardo, M.; Corradini, F. A Process Algebraic Approach to Software Architecture Design; Springer Science & Business Media: Berlin, Germany, 2010. [Google Scholar]
Groote, J.; Mousavi, M. Modeling and Analysis of Communicating Systems; MIT Press: Cambridge, MA, USA, 2014. [Google Scholar]
Lamport, L. Proving the Correctness of Multiprocess Programs. IEEE Trans. Softw. Eng. 1977, SE-3, 125–143. [Google Scholar] [CrossRef] [Green Version]
Brookes, S. A Semantics for Concurrent Separation Logic. Theor. Comput. Sci. 2007, 375, 227–270. [Google Scholar] [CrossRef] [Green Version]
O’Hearn, P. Resources, Concurrency and Local Reasoning. Theor. Comput. Sci. 2007, 375, 271–307. [Google Scholar] [CrossRef] [Green Version]
Bunte, O.; Groote, J.; Keiren, J.; Laveaux, M.; Neele, T.; Vink, E.d.; Wesselink, W.; Wijs, A.; Willemse, T. The mCRL2 Toolset for Analysing Concurrent Systems. In Proceedings of the Tools and Algorithms for the Construction and Analysis of Systems (TACAS), Prague, Czech Republic, 6–11 April 2019; Vojnar, T., Zhang, L., Eds.; Springer: Berlin, Germany, 2019; pp. 21–39. [Google Scholar]
Owicki, S.; Gries, D. An Axiomatic Proof Technique for Parallel Programs. Acta Inform. 1975, 6, 319–340. [Google Scholar] [CrossRef]
Rocha Pinto, P.D.; Dinsdale-Young, T.; Gardner, P. Steps in Modular Specifications for Concurrent Modules. In Proceedings of the Mathematical Foundations of Programming Semantics (MFPS), Nijmegen, The Netherlands, 22–25 June 2015. [Google Scholar]
Jones, C. Tentative Steps Toward a Development Method for Interfering Programs. Trans. Programm. Lang. Syst. 1983, 5, 596–619. [Google Scholar] [CrossRef]
Jung, R.; Krebbers, R.; Birkedal, L.; Dreyer, D. Higher-Order Ghost State. In Proceedings of the International Conference on Functional Programming (ICFP), Nara, Japan, 18–24 September 2016; Volume 51, pp. 256–269. [Google Scholar]
Bergstra, J.; Klop, J. Process algebra for Synchronous Communication. Inf. Control 1984, 60, 109–137. [Google Scholar] [CrossRef] [Green Version]
Moller, F. The Importance of the Left Merge Operator in Process Algebras. In Proceedings of the Automata, Languages and Programming (ICALP), Warwick, UK, 16–20 July 1990; Paterson, M., Ed.; Springer: Berlin, Germany, 1990; pp. 752–764. [Google Scholar]
Fokkink, W.; Zantema, H. Basic Process Algebra with Iteration: Completeness of its Equational Axioms. Comput. J. 1994, 37, 259–267. [Google Scholar] [CrossRef] [Green Version]
Baeten, J. Process Algebra with Explicit Termination; Department of Mathematics and Computing Science, Eindhoven University of Technology: Eindhoven, The Netherlands, 2000. [Google Scholar]
Vafeiadis, V. Concurrent separation logic and operational semantics. In Proceedings of the Mathematical Foundations of Programming Semantics (MFPS), Pittsburgh, PA, USA, 25–28 May 2011; Volume 276, pp. 335–351. [Google Scholar]
Reynolds, J. Separation Logic: A Logic for Shared Mutable Data Structures. In Proceedings of the Logic in Computer Science (LICS), Copenhagen, Denmark, 22–25 July 2002; pp. 55–74. [Google Scholar] [CrossRef] [Green Version]
Boyland, J. Checking Interference with Fractional Permissions. In Proceedings of the Static Analysis (SAS), San Diego, CA, USA, 11–13 June 2003; LNCS; Cousot, R., Ed.; Springer: Berlin, Germany, 2003; Volume 2694, pp. 55–72. [Google Scholar]
Bornat, R.; Calcagno, C.; O’Hearn, P.; Parkinson, M. Permission Accounting in Separation Logic. In Proceedings of the Principles of Programming Languages (POPL), Long Beach, CA, USA, 12–14 January 2005; pp. 259–270. [Google Scholar]
O’Hearn, P.; Yang, H.; Reynolds, J. Separation and Information Hiding. In Proceedings of the Principles of Programming Languages (POPL), Venice, Italy, 14–16 January 2004; ACM: New York, NY, USA, 2004; pp. 268–280. [Google Scholar]
Roever, W.D.; Engelhardt, K.; Buth, K. Data Refinement: Model-Oriented Proof Methods and Their Comparison; Cambridge University Press: Cambridge, UK, 1998; Volume 47. [Google Scholar]
Appel, A.; Blazy, S. Separation Logic for Small-Step CMINOR. In Theorem Proving in Higher Order Logics (TPHOLs); Schneider, K., Brandt, J., Eds.; Springer: Berlin/Heidelberg, Germany, 2007; pp. 5–21. [Google Scholar]
Hobor, A. Oracle Semantics. Ph.D. Thesis, Princeton University: Princeton, NJ, USA, 2008. [Google Scholar]
Hobor, A.; Appel, A.; Nardelli, F. Oracle Semantics for Concurrent Separation Logic. In Proceedings of the Programming Languages and Systems (ESOP), Budapest, Hungary, 29 March–6 April 2008; Drossopoulou, S., Ed.; Springer: Berlin/Heidelberg, Germany, 2008; pp. 353–367. [Google Scholar]
Appel, A.; Melliès, P.; Richards, C.; Vouillon, J. A Very Modal Model of a Modern, Major, General Type System. In Proceedings of the Principles of Programming Languages (POPL), Nice, France, 17–19 January 2007; ACM: New York, NY, USA, 2007; pp. 109–122. [Google Scholar] [CrossRef]
Usenko, Y. Linearization in μCRL; Technische Universiteit Eindhoven: Eindhoven, The Netherlands, 2002. [Google Scholar]
Padon, O.; McMillan, K.; Panda, A.; Sagiv, M.; Shoham, S. Ivy: Safety Verification by Interactive Generalization. In Proceedings of the Programming Language Design and Implementation (PLDI), Santa Barbara, CA, USA, 13–17 June 2016; ACM: New York, NY, USA, 2016; pp. 614–630. [Google Scholar] [CrossRef]
Oortwijn, W.; Blom, S.; Huisman, M. Future-based Static Analysis of Message Passing Programs. In Programming Language Approaches to Concurrency- & Communication-cEntric Software (PLACES); Open Publishing Association: Waterloo, Australia, 2016; pp. 65–72. [Google Scholar]
Leino, K.; Müller, P.; Smans, J. Verification of Concurrent Programs with Chalice. In Proceedings of the Foundations of Security Analysis and Design (FOSAD), Bertinoro, Italy, 30 August–4 September 2009; Volume 5705, pp. 195–222. [Google Scholar]
Parkinson, M.; Summers, A. The Relationship between Separation Logic and Implicit Dynamic Frames. In Proceedings of the European Symposium on Programming (ESOP), Saarbrücken, Germany, 26 March–3 April 2011; Barthe, G., Ed.; Springer: Berlin, Germany, 2011; pp. 439–458. [Google Scholar]
Joosten, S.; Oortwijn, W.; Safari, M.; Huisman, M. An Exercise in Verifying Sequential Programs with VerCors. In Proceedings of the Formal Techniques for Java-like Programs (FTfJP), Amsterdam, The Netherlands, 16 July 2018; Summers, A., Ed.; ACM: New York, NY, USA, 2018. [Google Scholar]
Blom, S.; Darabi, S.; Huisman, M. Verification of Loop Parallelisations. In Proceedings of the Fundamental Approaches to Software Engineering (FASE), London, UK, 11–18 April 2015; LNCS; Egyed, A., Schaefer, I., Eds.; Springer: Berlin, Germany, 2015; Volume 9033, pp. 202–217. [Google Scholar]
Svendsen, K.; Birkedal, L. Impredicative Concurrent Abstract Predicates. In Proceedings of the European Symposium on Programming (ESOP), Grenoble, France, 5–13 April 2014; LNCS; Shao, Z., Ed.; Springer: Berlin, Germany, 2014; Volume 8410, pp. 149–168. [Google Scholar]
Svendsen, K.; Birkedal, L.; Parkinson, M. Modular Reasoning about Separation of Concurrent Data Structures. In Proceedings of the European Symposium on Programming (ESOP), Rome, Italy, 16–24 March 2013; Felleisen, M., Gardner, P., Eds.; Springer: Berlin, Germany, 2013; pp. 169–188. [Google Scholar]
Feng, X. Local Rely-Guarantee Reasoning. In Proceedings of the Principles of Programming Languages (POPL), Savannah, Georgia, USA, 21–23 January 2009; ACM: New York, NY, USA, 2009; Volume 44, pp. 315–327. [Google Scholar]
Jung, R.; Swasey, D.; Sieczkowski, F.; Svendsen, K.; Turon, A.; Birkedal, L.; Dreyer, D. Iris: Monoids and Invariants as an Orthogonal Basis for Concurrent Reasoning. In Proceedings of the Principles of Programming Languages (POPL), Mumbai, India, 12–18 January 2015; ACM: New York, NY, USA, 2015; pp. 637–650. [Google Scholar]
Vafeiadis, V.; Parkinson, M. A Marriage of Rely/Guarantee and Separation Logic. In Proceedings of the International Conference on Concurrency Theory (CONCUR), Lisbon, Portugal, 4–7 September 2007; Caires, L., Vasconcelos, V., Eds.; Springer: Berlin/Heidelberg, Germany, 2007; pp. 256–271. [Google Scholar]
Krebbers, R.; Jung, R.; Bizjak, A.; Jourdan, J.; Dreyer, D.; Birkedal, L. The Essence of Higher-Order Concurrent Separation Logic. In Proceedings of the European Symposium on Programming (ESOP), Uppsala, Sweden, 22–29 April 2017; LNCS. Yang, H., Ed.; Springer: Berlin/Heidelberg, Germany, 2017; Volume 10201, pp. 696–723. [Google Scholar]
Villard, J.; Lozes, É.; Calcagno, C. Proving Copyless Message Passing. In Proceedings of the Asian Symposium on Programming Languages and Systems (APLAS), Seoul, Korea, 14–16 December 2009; Hu, Z., Ed.; Springer: Berlin/Heidelberg, Germany, 2009; pp. 194–209. [Google Scholar]
Honda, K.; Vasconcelos, V.; Kubo, M. Language Primitives and Type Discipline for Structured Communication- Based Programming. In Proceedings of the European Symposium on Programming (ESOP), Lisbon, Portugal, 28 March–4 April 1998; Hankin, C., Ed.; Springer: Berlin/Heidelberg, Germany, 1998; pp. 122–138. [Google Scholar]
Summers, A.; Müller, P. Actor Services—Modular Verification of Message Passing Programs. In Proceedings of the European Symposium on Programming (ESOP), Eindhoven, The Netherlands, 2–8 April 2016; Thiemann, P., Ed.; Springer: Berlin/Heidelberg, Germany, 2016; pp. 699–726. [Google Scholar]
Hinrichsen, J.; Bengtson, J.; Krebbers, R. Actris: Session-Type Based Reasoning in Separation Logic. Proc. ACM Programm. Lang. 2019, 4, 1–30. [Google Scholar] [CrossRef] [Green Version]
Calcagno, C.; Parkinson, M.; Vafeiadis, V. Modular Safety Checking for Fine-Grained Concurrency. In Proceedings of the Static Analysis (SAS), Kongens Lyngby, Denmark, 22–24 August 2007; Nielson, H., Filé, G., Eds.; Springer: Berlin/Heidelberg, Germany, 2007; pp. 233–248. [Google Scholar]
Siegel, S.; Zheng, M.; Luo, Z.; Zirkel, T.; Marianiello, A.; Edenhofner, J.; Dwyer, M.; Rogers, M. CIVL: The Concurrency Intermediate Verification Language. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Austin, TX, USA, 15–20 November 2015; ACM: New York, NY, USA, 2015; p. 61. [Google Scholar]
Gupta, A.; Popeea, C.; Rybalchenko, A. Threader: A Constraint-Based Verifier for Multi-threaded Programs. In Proceedings of the International Conference on Computer Aided Verification (CAV), Snowbird, UT, USA, 14–20 July 2011; Gopalakrishnan, G., Qadeer, S., Eds.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 412–417. [Google Scholar] [CrossRef] [Green Version]
Zheng, M.; Rogers, M.; Luo, Z.; Dwyer, M.; Siegel, S. CIVL: Formal Verification of Parallel Programs. In Proceedings of the Automated Software Engineering (ASE), Lincoln, NE, USA, 9–13 November 2015; pp. 830–835. [Google Scholar] [CrossRef]
Luo, Z.; Zheng, M.; Siegel, S. Verification of MPI programs using CIVL. In Proceedings of the 24th European MPI Users’ Group Meeting (EuroMPI), Chicago, IL, USA, 25–28 September 2017. [Google Scholar]
Penninckx, W.; Jacobs, B.; Piessens, F. Sound, Modular and Compositional Verification of the Input/Output Behavior of Programs. In Proceedings of the European Symposium on Programming (ESOP), London, UK, 11–18 April 2015; Vitek, J., Ed.; Springer: Berlin/Heidelberg, Germany, 2015; pp. 158–182. [Google Scholar]
Blom, S.; Huisman, M.; Zaharieva-Stojanovski, M. History-Based Verification of Functional Behaviour of Concurrent Programs. In Proceedings of the Software Engineering and Formal Methods (SEFM), York, UK, 7–11 September 2015; LNCS. Calinescu, R., Rumpe, B., Eds.; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9276, pp. 84–98. [Google Scholar]
Zaharieva-Stojanovski, M. Closer to Reliable Software: Verifying Functional Behaviour of Concurrent Programs. Ph.D. Thesis, University of Twente, Enschede, The Netherlands, 2015. [Google Scholar]
Sergey, I.; Nanevski, A.; Banerjee, A. Specifying and Verifying Concurrent Algorithms with Histories and Subjectivity. In Proceedings of the European Symposium on Programming (ESOP), London, UK, 11–18 April 2015; LNCS. Vitek, J., Ed.; Springer: Berlin/Heidelberg, Germany, 2015; Volume 9032, pp. 333–358. [Google Scholar]
Herlihy, M.; Wing, J. Linearizability: A Correctness Condition for Concurrent Objects. Trans. Programm. Lang. Syst. 1990, 12, 463–492. [Google Scholar] [CrossRef]
Vafeiadis, V. Automatically Proving Linearizability. In Proceedings of the Computer-Aided Verification (CAV), Edinburgh, UK, 15–19 July 2010; Touili, T., Cook, B., Jackson, P., Eds.; Springer: Berlin/Heidelberg, Germany, 2010; pp. 450–464. [Google Scholar] [CrossRef] [Green Version]
Krishna, S.; Shasha, D.; Wies, T. Go with the Flow: Compositional Abstractions for Concurrent Data Structures. Princ. Programm. Lang. 2017, 2, 1–31. [Google Scholar] [CrossRef] [Green Version]
Neele, T.; Willemse, T.; Groote, J. Solving Parameterised Boolean Equation Systems with Infinite Data Through Quotienting. In Proceedings of the Formal Aspects of Component Software (FACS), Pohang, Korea, 10–12 October 2018; Bae, K., Ölveczky, P., Eds.; Springer: Berlin/Heidelberg, Germany, 2018; pp. 216–236. [Google Scholar]

Figure 1. The annotated Owicki–Gries example (the annotations are coloured blue).

Figure 2. Standard bisimulation equivalences of the process algebra language.

Figure 3. The entailment rules of the program logic.

Figure 4. Standard proof rules of the program logic.

Figure 5. The extended proof rules related to handling process-algebraic models.

Figure 6. An excerpt of the transition rules of the ghost operational semantics.

Figure 7. An excerpt of the fault semantics of ghost configurations.

Figure 8. The behavioural process-algebraic specification of the leader election protocol. Processes of the form

if b then P else Q

are a shorthand for

b : P + \neg b : Q

. Moreover, all ensures clauses of ParElect are translated into trailing assertions, as described earlier at the beginning of Section 3.4.2.

Figure 8. The behavioural process-algebraic specification of the leader election protocol. Processes of the form

if b then P else Q

are a shorthand for

b : P + \neg b : Q

. Moreover, all ensures clauses of ParElect are translated into trailing assertions, as described earlier at the beginning of Section 3.4.2.

Figure 9. An excerpt of the annotated implementation of the leader election protocol. Annotations of the form

context P

are shorthand for

requires P; ensures P

.

Figure 9. An excerpt of the annotated implementation of the leader election protocol. Annotations of the form

context P

are shorthand for

requires P; ensures P

.

Figure 10. Bootstrap procedures of the leader election protocol.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Oortwijn, W.; Gurov, D.; Huisman, M. An Abstraction Technique for Verifying Shared-Memory Concurrency. Appl. Sci. 2020, 10, 3928. https://doi.org/10.3390/app10113928

AMA Style

Oortwijn W, Gurov D, Huisman M. An Abstraction Technique for Verifying Shared-Memory Concurrency. Applied Sciences. 2020; 10(11):3928. https://doi.org/10.3390/app10113928

Chicago/Turabian Style

Oortwijn, Wytse, Dilian Gurov, and Marieke Huisman. 2020. "An Abstraction Technique for Verifying Shared-Memory Concurrency" Applied Sciences 10, no. 11: 3928. https://doi.org/10.3390/app10113928

APA Style

Oortwijn, W., Gurov, D., & Huisman, M. (2020). An Abstraction Technique for Verifying Shared-Memory Concurrency. Applied Sciences, 10(11), 3928. https://doi.org/10.3390/app10113928

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Abstraction Technique for Verifying Shared-Memory Concurrency †

Abstract

1. Introduction

1.1. Motivation

1.2. Contributions

1.3. Outline

2. Approach

2.1. Example Program

2.1.1. Step 1: Specifying Program Behaviour

2.1.2. Step 2: Process-Algebraic Reasoning

2.1.3. Step 3: Deductively Linking Processes to Programs

3. Formalisation

3.1. Process-Algebraic Models

3.1.1. Action Contracts

3.1.2. Free Variables and Substitution

3.1.3. Operational Semantics

3.1.4. Process-Algebraic Verification

3.1.5. Bisimulation

3.2. Programs

3.2.1. Standard Language Constructs

3.2.2. Specification-Only Constructs

3.2.3. Free Variables and Substitution

3.2.4. User Programs

3.2.5. Wellformedness

3.2.6. Operational Semantics

3.2.7. Fault Semantics

3.3. Assertions

3.3.1. Heap Ownership

3.3.2. Process Ownership

3.3.3. Models of the Program Logic

3.3.4. Fractional Permissions

3.3.5. Permission Heaps

3.3.6. Process Maps

3.3.7. Semantics of Assertions

3.3.8. Semantic Entailment

3.4. Proof System

3.4.1. Entailment Rules

3.4.2. Program Judgments

3.4.3. Heap Ownership

3.4.4. Process Ownership

3.5. Soundness

3.5.1. Ghost Operational Semantics

3.5.2. Faulting Ghost Configurations

3.5.3. Preservation of Process Execution Safety

3.5.4. Adequacy

3.5.5. Semantics of Program Judgments

4. Implementation

4.1. Tool Support

4.2. Coq Formalisation

5. Case Study

5.1. Behavioural Specification

5.2. Protocol Implementation

5.3. Specification and Verification Details

5.4. Industrial Applicability

6. Related Work

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

An Abstraction Technique for Verifying Shared-Memory Concurrency^†