Binary Context-Free Grammars

Turaev, Sherzod; Abdulghafor, Rawad; Amer Alwan, Ali; Abd Almisreb, Ali; Gulzar, Yonis

doi:10.3390/sym12081209

Open AccessArticle

Binary Context-Free Grammars

by

Sherzod Turaev

^1,*

,

Rawad Abdulghafor

²

,

Ali Amer Alwan

²

,

Ali Abd Almisreb

³

and

Yonis Gulzar

⁴

¹

Department of Computer Science & Software Engineering, College of Information Technology, United Arab Emirates University, Al Ain 15551, UAE

²

Department of Computer Science, Faculty of Information and Communication Technology, International Islamic University Malaysia, Gombak, Selangor 53100, Malaysia

³

Faculty of Engineering and Natural Sciences, International University of Sarajevo, 71210 Sarajevo, Bosnia and Herzegovina

⁴

Department of Management Information Systems, King Faisal University, Al-Ahsa 31982, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Symmetry 2020, 12(8), 1209; https://doi.org/10.3390/sym12081209

Submission received: 17 June 2020 / Revised: 9 July 2020 / Accepted: 21 July 2020 / Published: 24 July 2020

(This article belongs to the Section Computer)

Download

Browse Figure

Versions Notes

Abstract

:

A binary grammar is a relational grammar with two nonterminal alphabets, two terminal alphabets, a set of pairs of productions and the pair of the initial nonterminals that generates the binary relation, i.e., the set of pairs of strings over the terminal alphabets. This paper investigates the binary context-free grammars as mutually controlled grammars: two context-free grammars generate strings imposing restrictions on selecting production rules to be applied in derivations. The paper shows that binary context-free grammars can generate matrix languages whereas binary regular and linear grammars have the same power as Chomskyan regular and linear grammars.

Keywords:

formal language; binary grammar; context-free grammar; matrix grammar; relational grammar; computation power; chomsky hierarchy

1. Introduction

A “traditional” phrase-structure grammar (also known as a Chomskyan grammar) is a generative computational mechanism that produces strings (words) over some alphabet starting from the initial symbol and sequentially applying production rules that rewrite sequences of symbols [1,2,3]. According to the forms of production rules, phrase-structure grammars and their languages are divided into four families: regular, context-free, context-sensitive, and recursively enumerable [4,5].

Regular and context-free grammars, which have good computational and algorithmic properties, are widely used in modeling and studying of phenomena appearing in linguistics, computer science, artificial intelligence, biology, etc. [6,7]. However, many complex structures such as duplication (

w w

), multiple agreements (

a^{n} b^{n} c^{n}

) and crossed agreements (

a^{n} b^{m} c^{n} d^{m}

) found in natural languages, programming languages, molecular biology, and many other areas cannot be represented by context-free grammars [6]. Context-sensitive grammars that can model these and other “non-context-free" structures are too powerful in order to be used in applications [5,8]. In addition many computational problems related to context-sensitive grammars are undecidable, and known algorithms for decidable problems concerning to these grammars have exponential complexities [4,5,6,9].

An approach to overcome this problem is to define “in-between’’ grammars that are powerful than context-free grammars but have similar computational properties. Regulated grammars are such types of grammars that are defined by adding control mechanisms to underlying context-free grammars in order to select specific strings for their languages [5]. Several variants of regulated grammars, such as matrix, programmed, ordered, context-conditional, random-context, tree-controlled grammars, have been defined according to control mechanisms used with the grammars [5,6,8]. Regulated grammars are classified into two main categories: rule-based regulated grammars that generate their languages under various production-related restrictions, and context-based regulated grammars that produce their languages under different context-related restrictions [10].

Though this type of the classification of regulated grammars allows us to understand the nature of restrictions imposed on the grammars, it does not clarify the role of control mechanisms from the aspect of the computational power. If we observe the control mechanisms used in both categories of regulated grammars, we can see that they consist of two parts [6]: (1) a “regular part", which is represented by a regular language of the labels of production rules (in rule-based case) or a regular language over the nonterminal and/or terminal alphabets (in context-based case) and (2) an “irregular part", which is represented by appearance checking (in rule-based case) and forbidding context (in context-based case), provides an additional power to the regular part.

If a regular part is solely used, the regulated grammars can generate small subsets of context-sensitive languages. On the other hand, if the both of regulations are used, then most regulated grammars generate all context-sensitive languages [6,10]. At this point, we again return to the same computational problems related to context-sensitive grammars, which we discussed early. Thus, we need to consider such regulation mechanisms that can grant to extend the family of context-free languages only to required ranges that cover necessary aspects of modeled phenomena. One of the possibilities to realize this idea can be the combination of several regular control mechanisms. One can consider a matrix of matrices or a matrix conditional context as combined regulation mechanisms. For instance, paper [11] studied simple-semi-conditional versions of matrix grammars with this approach. The problem in this case is that the combinations of such control mechanisms are, firstly, not natural and, secondly, they are too complex.

We propose, as a solution, an idea of imposing multiple regulations on a context-free grammar without combining them. We realize this idea by using relational grammars. A relational grammar is an n-ary grammar, i.e., a system of n terminal alphabets, n nonterminal alphabets, a set of n-tuples of productions and the initial n-tuple of nonterminals that generates the language of relations, i.e., tuples of strings over the terminal alphabets. On the other hand, a relational grammar can be considered of a system of n grammars in which each grammar generates own language using the corresponding productions enclosed in n-tuples. Thus, we can redefine a relational grammar as a system of mutually controlled n grammars where the grammars in relation generate their languages imposing restrictions on the applications of productions of other grammars. If we specify one grammar as the main, the other grammars in the system can be considered to be the regulation mechanisms controlling the generative processes of the main grammar.

This work is a preliminary work in studying mutually controlled grammars. In this paper, we define binary context-free grammars and study their generative power. We show that even mutually controlled two grammars can be as powerful as matrix grammars or other regulated grammars without appearance checking or forbidding context.

The paper is organized as follows. Section 2 surveys on regulated, parallel and relational grammars that are related to the introduced grammars. Section 3 contains necessary notions and notations used throughout the paper. Section 4 defines binary strings, languages and grammars Section 5 introduces synchronized normal forms for binary grammars and shows that for any binary context-free (regular, linear) grammar there exists an equivalent binary grammar in synchronized form. Section 6 investigate the generative powers of binary regular, linear and context-free grammars. Section 7 discusses the results of the paper and the power of mutually controlled grammars, and indicates to possible topics for future research.

2. Regulated, Parallel and Relational Grammars

In this section, we briefly survey some variants of regulated grammars with respect to the control mechanisms associated with them, parallel grammars as well as relational grammars, which are related to the introduced mutually controlled grammars.

The purpose of regulation is to restrict the use of the productions in a context-free grammar to select only specific terminal derivations successful hence to obtain a subset of the context-free language generated in usual way. Various regulation mechanisms used in regulated grammars can be classified into general types by their common features.

Control by prescribed sequences of production rules where the sequence of productions applied in a derivation belong to a regular language associated with the grammar:

matrix grammars [12]—the set of production rules is divided into matrices and if the application of a matrix is started, a second matrix can be started after finishing the application of the first one, as well as the rules have to been applied in the order given a matrix;
vector grammars [13]—in which a new matrix can be started before finishing those which have been started earlier;
regularly controlled grammars [14]—the sequence of production rules applied in a derivation belong to a given regular language associated with the grammar.

Control by computed sequences of production rules where a derivation is accompanied by a computation, which selects the allowed derivations:

programmed grammars [15]—after applying a production rule, the next production rule has to be chosen from its success field, and if the left hand side of the rule does not occur in the sentential form, a rule from its failure field has to be chosen;
valence grammars [16]—where with each sentential form an element of a monoid is associated, which is computed during the derivation and derivations where the element associated with the terminal word is the neutral element of the monoid are accepted.

Control by context conditions where the applicability of a rule depends on the current sentential form and with any rule some restrictions are associated for sentential forms which have to be satisfied in order to apply the rule:

random context grammars [17]—the restriction is the belonging to a regular language associated with the rule;
conditional grammars [18]—the restriction to special regular sets;
semi-conditional grammars [19]—the restriction to words of length one in the permitting and forbidden contexts;
ordered grammars [18]—a production rule can be applied if there is no greater applicable production rule.

Control by memory where with any nonterminal in a sentential form, its derivation is associated:

indexed grammars [20]—the application of production rules gives sentential forms where the nonterminal symbols are followed by sequences of indexes (stack of special symbols), and indexes can be erased only by rules contained in these indexes but erasing of the indexes is done in reverse order of their appearance.

Control by external mechanism where a mechanism used to select derivations does not belong to the grammar:

graph-controlled grammars [21,22]—the sequence of productions applied in a derivation to obtain a string corresponds to a path, whose nodes represent the production rules, in an associated bicolored digraph;
Petri net controlled grammars [23,24]—the sequence of productions used to obtain a string of the language of a grammar corresponds to a firing sequence of transitions, which are labeled by the productions, from the initial marking to a final marking.

Parallelism is another nontraditional approach used with grammars where, instead of rewriting a single symbol in each derivation step, several symbols can be rewritten simultaneously. There are two main variants of parallel mechanisms associated with grammars. The first is total parallelism, which is used in the broad varieties of (Deterministic Extended Tabled Zero-Sided) Lindenmayer systems [5,25] where all symbols of strings including terminals are in each step rewritten by productions. The second is partial parallelism where all or some nonterminal symbols (not terminal symbols) are written in each step of the derivations:

absolutely parallel grammars [26]—all nonterminals of the sentential form are rewritten in one derivation step;
Indian parallel grammars [27]—all occurrences of one letter are replaced (according to one rule);
Russian parallel grammars [28]—which combines the context-free and Indian parallel feature;
scattered context grammars [29]—in which only a fixed number of symbols can be replaced in a step but the symbols can be different;
concurrently controlled grammars [30]—the control over a parallel application of the productions is realized by a Petri net with different parallel firing strategies.

Another perspective in using the notion of parallelism with grammars is a grammar system, which is a system of several phrase-structure grammars with own axioms, symbols and rewriting productions that can work simultaneously and generate own strings. One of such grammar systems is a parallel communicating grammar system [31,32], where the grammars start from separate axioms, work parallelly rewriting their own sentential forms, and also communicate with each other by request. The language of one distinguished grammar in the system is considered the language of the system.

A relational grammar (an n-ary grammar) can be considered to be another type of grammar systems consisting of several grammars that work by applying productions synchronously or asynchronously [33,34]. More precisely, an n-ary grammar (where n is a positive integer) is a system of n terminal alphabets, n nonterminal alphabets, a set of productions and an initial n-tuple of nonterminals. Each production is an n-tuple of common productions or empty places. An n-ary grammar generates the language of relations, i.e., n-tuples of strings over the terminal alphabets. Work [33] showed that classes of languages generated by relational grammars forms a hierarchy between the family of context-free languages and the family of context-sensitive languages. Paper [34] studied closure, projective and other properties of relational grammars, and generalized the Chomsky’s classification for n-ary grammars. Several other papers [35,36,37,38,39,40] also investigated the properties of relational grammars and applied in solving problems appeared in natural and visual language processing.

3. Notions and Notations

Throughout the paper, we assume that the reader is familiar with the basic concepts and results of the theory of formal languages, Petri nets and relations; for details we refer to [4,9,41] (formal languages, automata, computation), [6,10] (regulated rewriting systems), [42,43] (Petri nets), [23,24,44] (Petri net controlled grammars), and [33,34,45,46,47,48] (finitary relations). Though, in this section, we recall all necessary notions and notations that are important for understanding this paper.

Basic conventions: the inclusion is denoted by ⊆ and the strict (proper) inclusion is denoted by ⊂. The symbol ∅ denotes the empty set. The powerset of a set X is denoted by

P (X)

, while its cardinality is denoted by

| X |

. An ordered sequence of elements

a, b

is called a pair and denoted by

(a, b)

. Two pairs

(a_{1}, a_{2})

and

(b_{1}, b_{2})

are equal iff

a_{1} = b_{1}

and

a_{2} = b_{2}

. Let

X, Y

be sets. The set of all pairs

(a, b)

, where

a \in X

and

b \in Y

, is called the Cartesian product of X and Y, and denoted by

X \times Y

. Then,

X \times X = X^{2}

. A binary relation on sets

X, Y

is a subset of the Cartesian product

X \times Y

.

Strings, Languages and Grammars

We first recall the fundamental concepts of the formal language theory such as an alphabet, a string and a language from [41]:

Definition 1.

An alphabet is a nonempty set of abstract symbols.

Definition 2.

A string (or a word) over an alphabet Σ is a finite sequence of symbols from Σ. The sequence of zero symbols is called the empty string, and denoted by λ. The set of all strings over Σ is denoted by

Σ^{*}

. The set

Σ^{*} - {λ}

is denoted by

Σ^{+}

.

Definition 3.

A subset of

Σ^{*}

is called a language.

Definition 4.

The number of the occurrences of symbols in

w \in Σ^{*}

is called its length and denoted by

| w |

. The number of occurrences of a symbol x in a string w is denoted by

{| w |}_{x}

.

Example 1.

Let

Σ = {a, b, c}

be an alphabet. Then,

w = a a a b b b c c

is a string over Σ where

| w | = 8

and

{| w |}_{a} = {| w |}_{b} = 3

,

{| w |}_{c} = 2

. We can notice that w belongs to

L = {a^{n} b^{n} c^{m} ∣ n \geq 1, m \geq 1}

, which is a language over Σ.

Next, we cite the definitions of context-free, matrix grammars and related notations which are more detailly given in [4,6].

Definition 5.

A context-free grammar is a quadruple

G = (V, Σ, S, R)

where V and Σ are disjoint alphabets of nonterminal and terminal symbols, respectively,

S \in V

is the start symbol and

R \subseteq V \times {(V \cup Σ)}^{*}

is a finite set of (production) rules. Usually, a rule

(A, x)

is written as

A \to x

. A rule of the form

A \to λ

is called an erasing rule, and a rule of the form

A \to x

, where

x \in Σ^{*}

, is called terminal.

Definition 6.

Let

G = (V, Σ, S, R)

be a context-free grammar. If

R \subseteq V \times Σ^{*} (V \cup {λ})

, then G is called regular, and if

R \subseteq V \times Σ^{*} (V \cup {λ}) Σ^{*}

, then it is called linear.

The families of regular, linear and context-free languages are denoted by

L (REG)

,

L (LIN)

and

L (CF)

, respectively.

Definition 7.

Let

G = (V, Σ, S, R)

be a context-free grammar.

The string $x \in {(V \cup Σ)}^{+}$ directly derives $y \in {(V \cup Σ)}^{*}$ , written as $x \to y$ , if and only if there is a rule $r = A \to α \in R$ such that $x = x_{1} A x_{2}$ and $y = x_{1} α x_{2}$ .
The reflexive and transitive closure of the relation ⇒ is denoted by $\Rightarrow^{*}$ .
A derivation using the sequence of rules $τ = r_{1} r_{2} \dots r_{n}$ is denoted by $\overset{τ}{\Rightarrow}$ or $\overset{r_{1} r_{2} \dots r_{n}}{\Rightarrow}$ .
The language generated by a grammar G is defined by $L (G) = {w \in Σ^{*} ∣ S \Rightarrow^{*} w}$ .

Example 2.

G_{1} = ({S, A, B}, {a, b, c}, S, R)

where R contains the productions:

r_{0} : S \to A B, r_{1} : A \to a A b, r_{2} : A \to a b, r_{3} : B \to c B, r_{4} : B \to c

is a context-free grammar, and it generates the language L in Example 1.

Definition 8.

A matrix grammar is a quadruple

G = (V

, Σ, S,

M)

where

V, Σ, S

are defined as for a context-free grammar, M is a finite set of matrices which are finite strings over a set of context-free rules (or finite sequences of context-free rules). The language generated by a matrix grammar G is defined by

L (G) = {w \in Σ^{*} ∣ S \overset{π}{\Rightarrow} w and π \in M^{*}}

.

Example 3.

G^{'} = ({S, A, B}, {a, b, c}, S, M)

where M contains the matrices:

m_{0} : (S \to A B), m_{1} : (A \to a A b, B \to c B), m_{2} : (A \to a b, B \to c)

is a matrix grammar, and it generates the language

{a^{n} b^{n} c^{n} ∣ n \geq 1}

.

The family of languages generated by matrix grammars is denoted by

L (MAT)

.

Lastly, we retrieve the notions of a Petri net, a context-free Petri net and a Petri net controlled grammar from [23,24,44].

Definition 9.

A Petri net is a construct

N = (P, T, F, ϕ)

where P and T are disjoint finite sets of places and transitions, respectively,

F \subseteq (P \times T) \cup (T \times P)

is a set of directed arcs,

ϕ : (P \times T) \cup (T \times P) \to {0, 1, 2, \dots}

is a weight function, where

ϕ (x, y) = 0

for all

(x, y) \in ((P \times T) \cup (T \times P)) - F

.

A Petri net can be represented by a bipartite directed graph with the node set

P \cup T

where places are drawn as circles, transitions as boxes and arcs as arrows with labels

ϕ (p, t)

or

ϕ (t, p)

. If

ϕ (p, t) = 1

or

ϕ (t, p) = 1

, the label is omitted. A mapping

μ : P \to {0, 1, 2, \dots}

is called a marking. For each place

p \in P

,

μ (p)

gives the number of tokens in p.

Definition 10.

A context-free Petri net (in short, a cf Petri net) with respect to a context-free grammar

G = (V, Σ, S, R)

is a tuple

N = (P, T, F, ϕ, β, γ, ι)

where

(1): $(P, T, F, ϕ)$ is a Petri net;
(2): labeling functions $β : P \to V$ and $γ : T \to R$ are bijections;
(3): there is an arc from place p to transition t if and only if $γ (t) = A \to α$ and $β (p) = A$ . The weight of the arc $(p, t)$ is 1;
(4): there is an arc from transition t to place p if and only if $γ (t) = A \to α$ and $β (p) = x$ where $x \in V$ and ${| α |}_{x} > 0$ . The weight of the arc $(t, p)$ is ${| α |}_{x}$ ;
(5): the initial marking ι is defined by $ι (β^{- 1} (S)) = 1$ and $ι (p) = 0$ for all $p \in P - {β^{- 1} (S)}$ .

The following example ([23]) explains the construction of a cf Petri net.

Example 4.

Let

G_{1}

be a context-free grammar defined in Example 2. Figure 1 illustrates a cf Petri net N with respect to the grammar

G_{1}

.

Definition 11.

A Petri net controlled grammar is a tuple

G = (V, Σ, S, R, N, γ, M)

where V, Σ, S, R are defined as for a context-free grammar and the construct

N = (P, T, F, ϕ, ι)

is a Petri net,

γ : T \to R \cup {λ}

is a labeling function and M is a set of final markings. The language generated by a Petri net controlled grammar G, denoted by

L (G)

, consists of all strings

w \in Σ^{*}

such that there is a derivation

S \overset{r_{1} r_{2} \dots r_{k}}{\Rightarrow} w

and an occurrence sequence

ν = t_{1} t_{2} \dots t_{s}

which is successful for M such that

r_{1} r_{2} \dots r_{k} = γ (t_{1} t_{2} \dots t_{s})

.

The family of languages generated by Petri net controlled grammars is denoted by

L (PN)

.

The hierarchical relationships of the language families defined above are summarized as follows.

Theorem 1.

L (REG) \subset L (LIN) \subset L (CF) \subset L (MAT) = L (PN) .

The correctness of inclusions

L (REG) \subset L (LIN) \subset L (CF)

was first shown in [1,3]. The proof of the strict inclusion

L (CF) \subset L (MAT)

can be found in [41]. The equality

L (MAT) = L (PN)

was established in [23].

4. Binary Strings, Languages and Grammars

In this section, we define binary strings, languages and grammars by modifying the n-ary counterparts initially studied in [33,34].

Definition 12.

A pair of strings

u_{1}, u_{2}

, is called an binary string over V and denoted by

u = (u_{1}, u_{2})

. The binary empty string is denoted by

(λ, λ)

.

Definition 13.

A subset L of

{(V^{*})}^{2}

is called a binary language.

Definition 14.

The concatenation of binary strings

u = (u_{1}, u_{2}) \in {(V^{*})}^{2}

and

v = (v_{1}, v_{2}) \in {(V^{*})}^{2}

is defined as

u v = (u_{1} v_{1}, u_{2} v_{2})

.

Definition 15.

For two binary languages

L_{1}, L_{2} \in {(V^{*})}^{2}

, their

union is defined as $L_{1} \cup L_{2} = {w ∣ w \in L_{1} or w \in L_{2}},$
concatenation is defined as $L_{1} L_{2} = {u v ∣ u \in L_{1} and v \in L_{2}} .$

Definition 16.

For

L \in {(V^{*})}^{2}

, its Kleene star is defined as

L^{*} = L^{0} \cup L^{1} \cup L^{2} \cup \dots

where

L^{0} = {(λ, λ)}

and

L^{i} = L^{i - 1} L

,

i \geq 1

.

Definition 17.

A binary context-free grammar is a quadruple

G = (V_{1} \times V_{2}, Σ_{1} \times Σ_{2}, (S_{1}, S_{2}), R)

where

(1): $V_{i}$ , $i = 1, 2$ , are sets of nonterminal symbols,
(2): $Σ_{i}$ , with $V_{i} \cap Σ_{i} = \emptyset$ , $i = 1, 2$ , are sets of terminal symbols,
(3): $(S_{1}, S_{2}) \in V_{1} \times V_{2}$ is the start (initial) pair, and
(4): R is a finite set nonempty set of binary productions (rules).
A binary production is a pair $(r_{1}, r_{2})$ where $r_{i}$ , $i = 1, 2$ , is either empty or it is a context-free production, i.e., $r_{i} \in V_{i} \times {(V_{i} \cup Σ_{i})}^{*}$ .

Remark 1.

A binary production

(r_{1}, r_{2})

can be written as

u \to v

where

u = (u_{1}, u_{2})

,

v = (v_{1}, v_{2})

and

u_{i} \to v_{i}

if

r_{i} \neq \emptyset

,

u_{i} = v_{i} = λ

, if

r_{i} = \emptyset

. For production

r_{i} = u_{i} \to v_{i}

, to indicate its left-hand side and right-hand side, we also use notations

l h s (r_{i})

and

r h s (r_{i})

, i.e.,

l h s ((r_{i}) = u_{i}

and

r h s ((r_{i}) = v_{i}

.

Remark 2.

In Definition 17, if each nonempty

r_{i}

is regular production, then the grammar G is called regular and if each nonempty

r_{i}

is linear production, then it is called linear.

Remark 3.

We say

G_{l} = (V_{1}, Σ_{1}, S_{1}, R_{1})

and

G_{r} = (V_{2}, Σ_{2}, S_{2}, R_{2})

are left and right grammars with respect to the binary grammar G, respectively, where

R_{1} = {r_{1} ∣ (r_{1}, r_{2}) \in R and r_{1} \neq \emptyset} and R_{2} = {r_{2} ∣ (r_{1}, r_{2}) \in R and r_{2} \neq \emptyset} .

Definition 18.

Let G be a binary context-free grammar. Let

u, v

be pairs in

{(V_{1} \cup Σ_{1})}^{*} \times {(V_{2} \cup Σ_{2})}^{*}

. We say that u directly derives v, written as

u \to v

, if there exist pairs

x, z \in {(V_{1} \cup Σ_{1})}^{*} \times {(V_{2} \cup Σ_{2})}^{*}

and a production

y \to w \in R

such that

u = x y z

and

v = x w z

. The reflexive and transitive closure of → is denoted by

\to^{*}

.

Definition 19.

The binary language generated by a binary context-free grammar G is defined as

L (G) = {(w_{1}, w_{2}) \in (Σ_{1}^{*} \times Σ_{2}^{*}) ∣ (S_{1}, S_{2}) \to^{*} (w_{1}, w_{2})} .

Definition 20.

The left and right languages are defined as

L_{l} (G) = {w_{1} ∣ (w_{1}, w_{2}) \in L (G)} and L_{r} (G) = {w_{2} ∣ (w_{1}, w_{2}) \in L (G)},

i.e., the sets of left and right strings in all binary strings of

L (G)

, respectively.

Example 5.

Consider the binary grammar

G_{2} = ({S_{1}} \times {S_{2}}, {a, b} \times {a, b}, (S_{1}, S_{2}), R)

where R consists of the following productions

(S_{1}, S_{2}) \to (S_{1} S_{1}, a S_{2}), (S_{1}, S_{2}) \to (λ, b S_{2}), (S_{1}, S_{2}) \to (λ, λ) .

It is not difficult to see that, after n steps, the first production produces the pair

(S_{1}^{n + 1}, a^{n} S_{1})

. Then in order to eliminate all

S_{1}^{'}

s, we apply the second production n times, and terminate derivation with applying the third production, which generates the pair string

(λ, a^{n} b^{n})

. Thus,

L (G_{2}) = {(λ, a^{n} b^{n}) ∣ n > 0}

, and

L_{r} (G_{2}) = {a^{n} b^{n} ∣ n > 0} \in L (CF)

.

Example 6.

The grammar

G_{3} = ({S_{1}, A, B, C} \times {S_{2}, X, Y, Z}, {a, b, c}^{2}, (S_{1}, S_{2}), R)

where R consists of the productions

\begin{matrix} (S_{1} \to A B C, S_{2} \to X), \\ (A \to a A, X \to Y), & (B \to b B, Y \to Z), & (C \to c C, Z \to X), \\ (A \to a, X \to Z), & (B \to b, Z \to Y), & (C \to c, Y \to λ), \end{matrix}

generates the language

L (G_{3}) = {(λ, a^{n} b^{n} c^{n}) ∣ n > 0}

where the right language

L_{r} (G_{3}) = {a^{n} b^{n} c^{n} ∣ n > 0} \notin L (CF)

.

Example 6 illustrates that binary context-free grammars can generate non-context-free languages, which implies that binary context-free grammars are more powerful than the Chomskyan context-free grammars. Thus, binary context-free grammars can be used in studying non-context-free structures such as cross-serial dependencies appearing in natural and programming languages using “context-free” tools such as parsing (derivation) trees.

We denote the families of binary languages generated by binary grammars with left and right grammars of type

X, Y \in {REG, LIN, CF}

by

B (X, Y)

. We also denote the the families of the left and right languages generated by binary grammars by

B (X)

,

X \in {REG, LIN, CF}

.

5. Synchronized Forms for Binary Grammars

In a binary grammar, the derivation of a string in the left or right grammar can pause or stop while the other still continues because of the rules of the forms

(\emptyset, r_{2})

and

(r_{1}, \emptyset)

. However, we show that the both first and second derivations by binary context-free grammars can be synchronized, i.e., in each derivation step, some pair of nonempty productions is applied and both derivations stop at the same time.

Definition 21.

A binary context-free (regular, linear) grammar G is called synchronized if it does not have any production of the form

(\emptyset, r_{2})

or

(r_{1}, \emptyset)

.

Lemma 1.

For every binary context-free grammar, there exist an equivalent synchronized binary context-free grammar.

Proof.

Let

G = (V_{1} \times V_{2}, Σ_{1} \times Σ_{2}, (S_{1}, S_{2}), R)

be a binary context-free grammar. Let

R_{\emptyset} = {(r_{1}, r_{2}) \in R ∣ r_{1} = \emptyset or r_{2} = \emptyset} .

We define the binary cf grammar

G^{'} = (V_{1}^{'} \times V_{2}^{'}, Σ_{1} \times Σ_{2}, (S_{1}^{'}, S_{2}^{'}), R^{'})

where

V_{i}^{'} = V_{i} \cup {X, S_{1}^{'}, S_{2}^{'}}

,

i = 1, 2

, where X,

S_{1}^{'}

and

S_{2}^{'}

are new nonterminals, and

\begin{matrix} R^{'} = & (R - R_{\emptyset}) \\ \cup {(X \to X, r_{2}) ∣ (\emptyset, r_{2}) \in R} \\ \cup {(r_{1}, X \to X) ∣ (r_{1}, \emptyset) \in R} \\ \cup {(S_{1}^{'} \to S_{1} X, S_{2}^{'} \to S_{2} X), (X \to λ, X \to λ)} . \end{matrix}

Then, the equality

L (G) = L (G^{'})

is obvious. □

The proof of Lemma 1 cannot be used for showing that there is also a synchronized form for a binary regular or linear grammar. The next lemma illustrates the existence of an equivalent synchronized form for any binary regular grammar too.

Lemma 2.

For every binary regular grammar, there exist an equivalent synchronized binary regular grammar.

Proof.

Let

G = (V_{1} \times V_{2}, Σ_{1} \times Σ_{2}, (S_{1}, S_{2}), R)

be a binary regular grammar. Let

R_{\emptyset} = {(r_{1}, r_{2}) \in R ∣ r_{1} = \emptyset or r_{2} = \emptyset}

and

\begin{matrix} R_{1} = & {(r_{1}, r_{2}) \in R ∣ rhs (r_{1}) \in T_{1}^{*}, rhs (r_{2}) \in T_{2}^{*}}, \\ R_{2} = & {(r_{1}, r_{2}) \in R ∣ rhs (r_{1}) \in T_{1}^{*} N_{1}, rhs (r_{2}) \in T_{2}^{*} N_{2}} . \end{matrix}

The proof of the lemma consists of two parts. First, we replace all binary productions of the form

(A_{1} \to x_{1}, r_{2})

and

(r_{1}, A_{2} \to x_{2})

with the productions

(A_{1} \to x_{1} E, r_{2})

and

(r_{1}, A_{2} \to x_{2} E)

where E is a new nonterminal,

x_{i} \in T_{1}^{*} \cup T_{2}^{*}

and

rhs (r_{i}) \notin T_{1}^{*} \cup T_{2}^{*}

,

i = 1, 2

. By this change, the early stop of one of derivations by the left and right grammars is prevented. Let

\begin{matrix} R_{3} = & {(r_{1}, r_{2}) \in R ∣ rhs (r_{1}) \in T_{1}^{*}, rhs (r_{2}) \notin T_{2}^{*}}, \\ R_{4} = & {(r_{1}, r_{2}) \in R ∣ rhs (r_{1}) \notin T_{1}^{*}, rhs (r_{2}) \in T_{2}^{*}} . \end{matrix}

We set

V_{1}^{'} = V_{1} \cup {E}

,

V_{2}^{'} = V_{2} \cup {E}

and

\begin{matrix} R_{3}^{'} = & {(A_{1} \to x_{1} E, r_{2}) ∣ (A_{1} \to x_{1}, r_{2}) \in R_{3}}, \\ R_{4}^{'} = & {(r_{1}, A_{2} \to x_{2} E) ∣ (r_{1}, A_{2} \to x_{2}) \in R_{4}} \end{matrix}

and

R^{'} = (R - (R_{3} \cup R_{4})) \cup R_{3}^{'} \cup R_{4}^{'}

.

Second, in order to eliminate empty productions, we replace all binary production rules of the form

(A_{1} \to x_{1} B_{1}, \emptyset)

and

(\emptyset, A_{2} \to x_{2} B_{2})

with the pairs of production rules

(A_{1} \to A_{r}, A_{2} \to A_{r})

,

(A_{r} \to x_{1} B_{1}, A_{r} \to A_{2})

and

(A_{1} \to A_{r}, A_{2} \to A_{r})

,

(A_{r} \to A_{1}, A_{r} \to x_{2} B_{2})

, respectively, where

A_{r}

s are new nonterminals. Thus, we define the following sets of new productions:

\begin{matrix} R_{5} = { & (A_{1} \to A_{r}, A_{2} \to A_{r}), (A_{r} \to A_{1}, A_{r} \to x B_{2}) ∣ \\ r = (\emptyset, A_{2} \to x B_{2}) \in R^{'} and (A_{1}, A_{2}) \in V_{1}^{'} \times V_{2}^{'}}, \\ R_{6} = { & (A_{1} \to A_{r}, A_{2} \to A_{r}), (A_{r} \to x B_{1}, A_{r} \to A_{2}) ∣ \\ r = (A_{1} \to x B_{1}, \emptyset) \in R^{'} and (A_{1}, A_{2}) \in V_{1}^{'} \times V_{2}^{'}} \end{matrix}

where

A_{r}

s are new nonterminal symbols introduced for each production

r = (A_{1} \to α_{1}, \emptyset)

or

r = (\emptyset, A_{2} \to α_{2})

in

R^{'}

. Let

V_{R, 1} = {A_{r} ∣ r = (\emptyset, A_{2} \to x B_{2}) \in R^{'}}

and

V_{R, 2} = {A_{r} ∣ r = (A_{1} \to x B_{1}, \emptyset) \in R^{'}} .

We define the binary regular grammar

G^{″} = (V_{1}^{″} \times V_{2}^{″}, Σ_{1} \times Σ_{2}, (S_{1}, S_{2}), R^{″})

as follows:

\begin{matrix} V_{1}^{″} = & V_{1}^{'} \cup V_{R, 1}, \\ V_{2}^{″} = & V_{2}^{'} \cup V_{R, 2}, \\ R^{″} = & R_{1} \cup R_{2} \cup R_{5} \cup R_{6} \cup {(E \to λ, E \to λ)} . \end{matrix}

First, we show that

L (G) \subseteq L (G^{″})

. Consider a derivation

(S_{1}, S_{2}) = (w_{1, 0}, w_{2, 0}) \Rightarrow (w_{1, 1}, w_{2, 1}) \Rightarrow \dots \Rightarrow (w_{1, n}, w_{2, n}) = (w_{1}, w_{2})

in G where

(w_{1}, w_{2}) \in Σ_{1}^{*} \times Σ_{2}^{*} .

For a derivation step

(w_{1, i - 1}, w_{2, i - 1}) \Rightarrow (w_{1, i}, w_{2, i})

,

1 \leq i \leq n

, the following cases are possible:

Case 1: the right-hand side is obtained by applying a production from

R_{1} \cup R_{2} \cup {(E \to \emptyset, E \to \emptyset)}

. Then, the same rule is also applied in the corresponding step in the simulating derivation of

G^{″}

.

Case 2: the right-hand side is obtained by applying a production of the form

r = (\emptyset, A_{2} \to α_{2})

or

r = (A_{1} \to α_{1}, \emptyset)

. Then, the production sequence

(A_{1} \to A_{r}, A_{2} \to A_{r}), (A_{r} \to A_{1}, A_{r} \to x B_{2}) \in R_{5}

or the production sequence

(A_{1} \to A_{r}, A_{2} \to A_{r}), (A_{r} \to x B_{1}, A_{r} \to A_{2}) \in R_{6}

is applied in the corresponding step in the simulating derivation of

G^{″}

.

Case 3: the right-hand side is obtained by applying a rule of the form

(A_{1} \to x_{1}, r_{2})

,

rhs (r_{2}) \neq \emptyset

or

(r_{1}, A_{2} \to x_{2})

,

rhs (r_{1}) \neq \emptyset

. Then, the rule

(A_{1} \to x_{1} E, r_{2})

or

(r_{1}, A_{2} \to x_{2} E)

is applied in the corresponding step in the simulating derivation in

G^{″}

, and the derivation terminates with applying

(E \to \emptyset, E \to \emptyset)

.

The inclusion

L (G^{″}) \subseteq L (G)

is obvious:

(1) the application of a rule of the form

(A_{1} \to x_{1} E, r_{2})

,

rhs (r_{2}) \neq \emptyset

, or

(r_{1}, A_{2} \to x_{2} E)

,

rhs (r_{1}) \neq \emptyset

, can be immediately replaced with the pair

(A_{1} \to x_{1}, r_{2})

or

(r_{1}, A_{2} \to x_{2})

, respectively;

(2) if a rule of the form

(A_{1} \to A_{r}, A_{2} \to A_{r})

or

(A_{1} \to A_{r}, A_{2} \to A_{r})

is applied at some derivation step, the only applicable pair of productions then is

(A_{r} \to A_{1}, A_{r} \to x B_{2})

or

(A_{r} \to x B_{1}, A_{r} \to A_{2})

, respectively, since

A_{r}

is the unique for each pair of productions. Thus, the sequence of pairs of productions

(A_{1} \to A_{r}, A_{2} \to A_{r})

and

(A_{r} \to A_{1}, A_{r} \to x B_{2})

or

(A_{1} \to A_{r}, A_{2} \to A_{r})

and

(A_{r} \to x B_{1}, A_{r} \to A_{2})

are replaced with

r = (\emptyset, A_{2} \to α_{2})

or

r = (A_{1} \to α_{1}, \emptyset)

, respectively. □

Using the same arguments of the proof of Lemma 2, one can show that the similar fact also holds for binary linear grammars.

Lemma 3.

For every binary linear grammar, there exist an equivalent synchronized binary linear grammar.

6. Generative Capacities of Binary Grammars

In this section, we discuss the generative capacities of binary regular, linear and context free grammars.

The following two lemmas immediately follows from the definitions of binary languages.

Lemma 4.

B (X, Y) = B (Y, X)

,

X \in {REG, LIN, CF}

.

Lemma 5.

B (REG) \subseteq B (LIN) \subseteq B (CF) .

Lemma 6.

L (REG) \subseteq B (REG), L (LIN) \subseteq B (LIN), L (CF) \subseteq B (CF) .

Proof.

Let

G = (V, Σ, S, R)

be a context-free (regular, linear) grammar. Then, we define the binary context-free (regular, linear) grammar

G^{'} = (V \times V, Σ \times Σ, R^{'}, (S, S))

by setting, for each production

r = A \to α \in R

, the production

r^{'} = (\emptyset, A \to α)

in

R^{'}

. Then, it is not difficult to see that

L (G) = L_{r} (G^{'})

. In the same way, we can also show that

L (G) = L_{l} (G^{'})

. □

Now we show that binary regular and linear grammars generate regular and linear languages, respectively.

Lemma 7.

B (LIN) \subseteq L (LIN) .

Proof.

Let

G = (V_{1} \times V_{2}, Σ_{1} \times Σ_{2}, (S_{1}, S_{2}), R)

be a binary linear grammar. Without loss of generality, we can assume that the grammar G is synchronized. We set

V = {[A_{1}, A_{2}] ∣ (A_{1}, A_{2}) \in V_{1} \times V_{2}}

where

[A_{1}, A_{2}]

,

(A_{1}, A_{2}) \in V_{1} \times V_{2}

, are new nonterminals, and we define

\begin{matrix} R^{'} = & {[A_{1}, A_{2}] \to x_{2} [B_{1}, B_{2}] y_{2} ∣ (A_{1} \to x_{1} B_{1} y_{1}, A_{2} \to x_{2} B_{2} y_{2}) \in R} \\ \cup {[A_{1}, A_{2}] \to x_{2} ∣ (A_{1} \to x_{1}, A_{2} \to x_{2}) \in R} . \end{matrix}

Then,

G^{'} = (V, Σ_{2}, [S_{1}, S_{2}], R^{'})

is a linear grammar, and

L_{r} (G) = L (G^{'})

is obvious. Hence,

L_{r} (G)

is linear, i.e.,

B (LIN) \subseteq L (LIN)

. □

Corollary 1.

B (REG) \subseteq L (REG) .

Next, we show that binary context-free grammars are more powerful than Chomskyan context-free grammars.

Lemma 8.

L (CF) \subset B (CF)

.

Proof.

By Lemma 6,

L (CF) \subseteq B (CF)

. Let us consider a binary context-free grammar

G_{4} = ({S, C, D, E} \times {S, A, B}, {a, b, c}^{2}, (S, S), R)

where R consists of the following productions:

\begin{matrix} (1) (S \to S, S \to A B), \\ (2) (S \to C, A \to a A), & (C \to S, B \to a B), \\ (3) (S \to D, A \to b A), & (D \to S, B \to b B), \\ (4) (S \to E, A \to λ), & (E \to a, B \to λ), \end{matrix}

Any successful derivation starts with production

(1)

. For each production sequence

(i)

,

2 \leq i \leq 4

, if the first production is applied, then, the only applicable production in the derivation is the second one. If after

(1)

, production sequence

(4)

is applied, then the derivation generates the binary string

(a, λ)

. Else, after some steps, the derivation results in

(S, w A w B)

,

w \in {a, b}^{+}

, by applying production sequences

(2)

or/and

(3)

, and then, it terminates by applying production sequence

(4)

. Thus,

L (G_{4}) = {(a, w w) ∣ w \in {a, b}^{*}} .

Since

L_{r} (G_{4}) \notin L (CF)

, we have the strict inclusion

L (CF) \subset B (CF)

. □

Next, we show that binary context-free grammars are at least as powerful as matrix grammars.

Lemma 9.

L (MAT) \subseteq B (CF) .

Proof.

Let

G = (V, Σ, S, M)

be a matrix grammar where M consists of matrices

m_{1}, m_{2}, \dots, m_{n}

with

m_{i} : (r_{i, 1}, r_{i, 2}, \dots, r_{i, k (i)})

,

1 \leq i \leq n

. We set the following sets of new nonterminals

\begin{matrix} V_{1} = & {Y_{i} ∣ 1 \leq i \leq n} \cup {Z_{i, j} ∣ 1 \leq i \leq n, 1 \leq j \leq k (i)} \cup {S_{1}, X}, \\ V_{2} = & V \cup {S_{2}}, \end{matrix}

We construct the following binary productions

(1): the start production: $(S_{1} \to X, S_{2} \to S),$
(2): the matrix entry productions: $(X \to Y_{i}, \emptyset), 1 \leq i \leq n,$
(3): the matrix processing productions:

$(Y_{i} \to Z_{i, 1}, r_{i, 1}), (Z_{i, 1} \to Z_{i, 2}, r_{i, 2}), \dots, (Z_{i, k (i) - 1} \to Z_{i, k (i)}, r_{i, k (i)})$

where $1 \leq i \leq n$ ,
(4): the matrix exit productions: $(Z_{i, k (i)} \to X, \emptyset), 1 \leq i \leq n .$
(5): the terminating production: $(X \to λ, \emptyset) .$

We define the binary context-free grammar

G^{'} = (V_{1} \times V_{2}, Σ \times Σ, (S_{1}, S_{2}), R)

where R consists of all productions (1)–(5) constructed above.

Claim 1:

L (G) \subseteq L (G^{'})

. Let

D : S = w_{0} \overset{m_{i_{1}}}{\Rightarrow} w_{1} \overset{m_{i_{2}}}{\Rightarrow} w_{2} \dots \overset{m_{i_{k}}}{\Rightarrow} w_{k} = w \in Σ^{*}

be a derivation in G. We construct the derivation

D^{'}

in

G^{'}

that simulates D. The derivation starts with the step:

(S_{1}, S_{2}) \Rightarrow (X, S) .

Since,

m_{i_{1}}

is the first matrix applied in derivation D, the next step in

D^{'}

is

(X, S) \Rightarrow (Y_{i_{1}}, S) .

Furthermore,

(Y_{i_{1}}, S) \overset{(Y_{i_{1}} \to Z_{i_{1}}, r_{i_{1}})}{\Rightarrow} \dots \overset{(Z_{i_{1}, k (i_{1}) - 1} \to Z_{i_{1}, k (i_{1})}, r_{i_{1}, k (i_{1})})}{\Rightarrow} w_{1} .

By applying

(Z_{i_{1}, k (i_{1})} \to X, \emptyset)

,

w_{1} \overset{(Z_{i_{1}, k (i_{1})} \to X, \emptyset)}{\Rightarrow} w_{1},

we return X to the derivation, and then, we can continue the simulation of the application of the matrix

m_{i_{2}}

in the same manner. Thus,

D^{'}

simulates D, and

L (G) \subseteq L (G^{'})

.

Claim 2:

L (G) \subseteq L (G^{'})

. Any successfully terminating derivation in

G^{'}

starts by applying production

(S_{1} \to X, S_{2} \to S)

, followed by applying

(X \to Y_{i}, \emptyset)

for some

1 \leq i \leq n

. Then only possible productions to be applied are matrix processing productions of the form (3). When the application of the productions in the currently active sequence starts, the productions of another sequence of the form (3) cannot be applied. In order to switch to another sequence of productions of the form (3), the corresponding production of the form (4) must be applied after finishing the application of all productions in the current sequence in the given order. To successfully terminate the derivation, the productions of the forms (4) and (5) must be applied. By construction, each sequence of productions of the form (3) simulates some matrix from G, any successful derivation in

G^{'}

can be simulated by a successful derivation in G. Thus,

L (G^{'}) \subseteq L (G)

. □

The lemma above shows that any matrix language can be generated by a binary grammar where one of its grammars is regular and the other is context-free. Here, the natural question arises whether there is a binary grammar with both grammars are context-free that generates a non-matrix language or not. Next lemma shows that binary context-free grammars can only generate matrix languages even if their both grammars are context-free.

Lemma 10.

B (CF) \subseteq L (MAT) .

Proof.

Let

G = (V_{1} \times V_{2}, Σ_{1} \times Σ_{2}, (S_{1}, S_{2}), R)

be a binary context-free grammar. Without loss of generality, we assume that G is in a synchronized form. Let

R_{2} = {r_{2} ∣ (r_{1}, r_{2}) \in R}

. The proof idea is as follows: First, we will construct a context-free Petri net N with respect to G. Second, we define a Petri net controlled grammar

G^{'}

where the underlying right grammar

G_{r} = (V_{2}, Σ_{2}, S_{2}, R_{2})

is controlled by the Petri net N. Then we show that

L_{r} (G) = L (G^{'})

, i.e.,

B (CF) \subseteq L (PN)

.

Part 1: We construct the cf Petri net

N = (P, T, F, ϕ, β, γ, ι)

with respect to the nonterminals of the left grammar and productions of the grammar G by setting its components in the following way:

$(P, T, F, ϕ)$ is a Petri net;
the labeling functions $β : P \to V_{1}$ and $γ : T \to R$ are bijections;
there is an arc from place p to transition t if and only if $γ (t) = (r_{1}, r_{2})$ and $β (p) = lhs (r_{1})$ . The weight of the arc $(p, t)$ is 1;
there is an arc from transition t to place p if and only if $γ (t) = (r_{1}, r_{2})$ and $β (p) = X$ where $X \in V_{1}$ and $| rhs (r_{1}) |_{X} > 0$ . The weight of the arc $(t, p)$ is $| rhs (r_{1}) |_{X}$ ;
the initial marking $ι$ is defined by $ι (β^{- 1} (S_{1})) = 1$ and $ι (p) = 0$ for all $p \in P - {β^{- 1} (S_{1})}$ .

Part 2: Using the right grammar

(V_{2}, Σ_{2}, R_{2}, S_{2})

, we define the PN controlled grammar

G^{'} = (V_{2}, Σ_{2}, R_{2}, S_{2}, N, η, M)

where

N = (P, T, F, ϕ, β, γ, ι)

is the cf Petri net defined above,

η : T \to R_{2}

is a labeling function and M is a set of final markings. We set

M = \emptyset

and

η (t) = r_{2}

if and only if

γ (t) = (r_{1}, r_{2})

.

Part 3: Now we show that

L_{r} (G) = L (G^{'})

. Let

D : (S_{1}, S_{2}) \overset{(r_{1, 1}, r_{2, 1})}{\Rightarrow} (w_{1, 1}, w_{2, 1}) \overset{(r_{1, 2}, r_{2, 2})}{\Rightarrow} \dots \overset{(r_{1, k}, r_{2, k})}{\Rightarrow} (w_{1, k}, w_{2, k})

be a derivation in G where

(w_{1, k}, w_{2, k}) \in Σ_{1}^{*} \times Σ_{2}^{*}

. We show that the derivation D can be simulated by the derivation

D^{'}

in the grammar

G^{'}

constructed as follows.

D^{'}

starts with

S_{2}

, and by definition of N,

ι (β^{- 1} (S_{1})) = 1

, thus, transition

t_{1} = γ^{- 1} ((r_{1, 1}, r_{2, 1}))

is enabled. In the first step, we obtain

D^{'} : S_{2} \overset{r_{2, 1}}{\Rightarrow} w_{2, 1} where η (t_{1}) = r_{2, 1} .

When transition

t_{1}

occurs the place in N corresponding to each nonterminal in

rhs (r_{1, 1}) = w_{1, 1}

receive the tokens whose number is equal to the number of the occurrence of the nonterminal in

rhs (r_{1, 1})

.

Suppose that for some

1 \leq i < k

, we constructed the first i steps of the derivation

D^{'}

:

D^{'} : S_{2} \overset{r_{2, 1}}{\Rightarrow} w_{2, 1} \overset{r_{2, 2}}{\Rightarrow} \dots \overset{r_{2, i}}{\Rightarrow} w_{2, i}

with

r_{2, 1} r_{2, 2} \dots r_{2, i} = η (t_{1} t_{2} \dots t_{i})

where

(r_{1, 1}, r_{2, 1}) (r_{1, 2}, r_{2, 2}) \dots (r_{1, i}, r_{2, i}) = γ (t_{1} t_{2} \dots t_{i})

, which corresponds to the first i steps of D. By definition,

γ (t_{i}) = (r_{1, i}, r_{2, i})

and

η (t_{i}) = r_{2, i}

. When transition

t_{i}

fires, a token moves from the input place of

t_{i}

, that is labeled by the left-hand side of

r_{1, i}

, to its output places, that are labeled by the nonterminals occurring in the right-hand side of

r_{1, i}

. Thus, these nonterminals are also occur in

w_{1, i}

. The next step in D occurs by applying the pair

(r_{1, i + 1}, r_{2, i + 1})

:

(w_{1, i}, w_{2, i}) \overset{(r_{1, i + 1}, r_{2, i + 1})}{\Rightarrow} (w_{1, i + 1}, w_{2, i + 1}) .

Since production

r_{1, i + 1}

is applicable in the current step, its left-hand side occurs in

w_{1, i}

. It follows that the transition

t_{i + 1}

,

γ (t_{i + 1}) = (r_{1, i + 1}, r_{2, i + 1})

, can fire. Consequently, we choose the production

r_{2, i + 1}

with

η (t_{i + 1}) = r_{2, i + 1}

, in

D^{'}

, and obtain

w_{2, i} \overset{r_{2, i + 1}}{\Rightarrow} w_{2, i + 1} .

The last step in D results in

(w_{1, k}, w_{2, k}) \in Σ_{1}^{*} \times Σ_{2}^{*}

that is obtained by applying the pair

(r_{1, k}, r_{2, k})

. Then, we choose

r_{2, k}

with

η (t_{k}) = r_{2, k}

in

D^{'}

. Since

w_{1, k} \in Σ_{1}^{*}

, i.e., it does not contain nonterminals, all places of N have no tokens. Thus,

M = \emptyset

. It shows that

L_{r} (G) \subseteq L (G^{'})

.

Let

D^{'} : S_{2} \overset{r_{2, 1}}{\Rightarrow} w_{2, 1} \overset{r_{2, 2}}{\Rightarrow} \dots \overset{r_{2, k}}{\Rightarrow} w_{2, k}

with

r_{2, 1} r_{2, 2} \dots r_{2, k} = η (t_{1} t_{2} \dots t_{k})

. By definition, we immediately obtain

γ (t_{1} t_{2} \dots t_{k}) = (r_{1, 1}, r_{2, 1}) (r_{2, 2}, r_{2, 2}) \dots (r_{1, k}, r_{2, k})

for some

(r_{1, i}, r_{2, i}) \in R

,

1 \leq i \leq k

. Then, we can construct the derivation D in the grammar G

D : (S_{1}, S_{2}) \overset{(r_{1, 1}, r_{2, 1})}{\Rightarrow} (w_{1, 1}, w_{2, 1}) \overset{(r_{1, 2}, r_{2, 2})}{\Rightarrow} \dots \overset{(r_{1, k}, r_{2, k})}{\Rightarrow} (w_{1, k}, w_{2, k}),

which shows that

L (G^{'}) \subseteq L_{r} (G)

. Thus,

B (CF) \subseteq L (PN)

. By Theorem 1,

B (CF) \subseteq L (MAT)

□.

We summarize the results obtained above in the following theorem.

Theorem 2.

B (REG) = L (REG) \subset B (LIN) = L (LIN) \subset L (CF) \subset L (MAT) = B (CF) .

7. Conclusions

In this paper, we redefined binary grammars as mutually controlled grammars where either grammar in a relation generates own language imposing restriction to the other.

Though binary grammars are asynchronous systems by their definitions, we showed that they can also work in synchronized mode (Lemmas 1–3), i.e., the both grammars in a binary relation generate strings with derivations where the grammars apply some productions in each step, and stop at the same time. This feature of binary grammars allows using one grammar in a relation as a regulation mechanism for the other.

We have studied the generative capacity of binary context-free grammars, and showed that binary regular and linear grammars have the same power as their Chomskyan alternatives, i.e., traditional regular and linear grammars, respectively (Lemmas 6 and 7 and Corollary 1). On the other hand, we have proved that binary context-free grammars are much more powerful than traditional context-free grammars (Lemma 8), that is, they generate all matrix languages even if binary grammars consist of regular and context-free pairs (Lemma 9). Moreover, we established that using context-free grammars as the components of relations does not increase the computational power of binary context-free grammars, i.e., they remain equivalent to matrix grammars (Lemma 10). Using the inclusion hierarchies in Theorem 1 and the results of the paper, we obtained the comparative hierarchy for binary regular, linear and context-free grammars (Theorem 2). We have also illustrated that binary grammars have practical significance: Example 6, and Lemma 8 show that cross-serial dependencies such as duplication and multiple agreements—non-context-free syntactical structures appearing in natural and programming languages—can be expressed with binary grammars.

Here, we emphasize that ternary or higher degree relational context-free grammars are more powerful than binary ones, and can be used in modeling “nested” cross-serial dependencies. Let us assume that a ternary context-free grammar is defined similarly to binary grammars. Then, the reader can convince himself that the following language

\{{(a_{1}^{n_{1}} b_{1}^{n_{1}} c_{1}^{n_{1}})}^{n} {(a_{2}^{n_{2}} b_{2}^{n_{2}} c_{2}^{n_{2}})}^{n} {(a_{3}^{n_{3}} b_{3}^{n_{3}} c_{3}^{n_{3}})}^{n} ∣ n, n_{1}, n_{2}, n_{3} \geq 1\},

the language of nested mutual agreements, can be generated by a ternary context-free grammar

G = ({S_{1}, S_{2}, S_{3}}, {a_{i}, b_{i}, c_{i} ∣ i = 1, 2, 3}, (S_{1}, S_{2}, S_{3}), R)

where R consists of the following tuples of productions:

\begin{matrix} (S_{1} \to X_{1}, S_{2} \to X_{2}, S_{3} \to A_{1} A_{2} B_{1} B_{2} C_{1} C_{2}), & (X_{1} \to X_{1}, X_{2} \to X_{2}^{'}, A_{1} \to a_{1} A_{1} b_{1}), \\ (X_{1} \to X_{1}, X_{2}^{'} \to X_{2}, A_{2} \to c_{1} A_{2}), & (X_{1} \to Y_{1}, X_{2}^{'} \to Y_{2}, A_{2} \to c_{1} A_{2}), \\ (Y_{1} \to Y_{1}, Y_{2} \to Y_{2}^{'}, B_{1} \to a_{2} B_{1} b_{2}), & (Y_{1} \to Y_{1}, Y_{2}^{'} \to Y_{2}, B_{2} \to c_{2} B_{2}), \\ (Y_{1} \to Z_{1}, Y_{2}^{'} \to Z_{2}, B_{2} \to c_{2} B_{2}), & (Z_{1} \to Z_{1}, Z_{2} \to Z_{2}^{'}, C_{1} \to a_{3} C_{1} b_{3}), \\ (Z_{1} \to Z_{1}, Z_{2}^{'} \to Z_{2}, C_{2} \to c_{3} C_{2}), & (Z_{1} \to X_{1}, Z_{2}^{'} \to X_{2}, C_{2} \to c_{3} C_{2}), \\ (X_{1} \to X_{1}, X_{2} \to X_{2}^{'}, A_{1} \to a_{1} b_{1}), & (X_{1} \to Y_{1}, X_{2}^{'} \to Y_{2}, A_{2} \to c_{1}), \\ (Y_{1} \to Y_{1}, Y_{2} \to Y_{2}^{'}, B_{1} \to a_{2} b_{2}), & (Y_{1} \to Z_{1}, Y_{2}^{'} \to Z_{2}, B_{2} \to c_{2}), \\ (Z_{1} \to Z_{1}, Z_{2} \to Z_{2}^{'}, C_{1} \to a_{3} b_{3}), & (Z_{1} \to λ, Z_{2}^{'} \to λ, C_{2} \to c_{3}) . \end{matrix}

The detailed study of higher degree relational grammars as mutually controlled grammars will be the topic of our next investigation.

Author Contributions

Conceptualization, S.T. and A.A.A. (Ali Amer Alwan); methodology, S.T. and R.A.; validation, A.A.A. (Ali Abd Almisreb) and Y.G.; formal analysis, A.A.A. (Ali Abd Almisreb) and Y.G.; investigation, S.T. and R.A.; writing—original draft preparation, S.T. and R.A.; writing—review and editing, S.T. and A.A.A. (Ali Amer Alwan); supervision, S.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been supported by the United Arab Emirates University Start-Up Grant 31T137.

Acknowledgments

We would like to thank the anonymous reviewers for their valuable comments and useful remarks about this paper.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study and in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

CF	context-free
REG	regular
LIN	linear
MAT	matrix

References

Chomsky, N. Three models for the description of languages. IRE Trans. Inf. Theory 1956, 2, 113–124. [Google Scholar] [CrossRef] [Green Version]
Chomsky, N. Syntactic Structure; Mouton: Gravenhage, The Netherland, 1957. [Google Scholar]
Chomsky, N. On certain formal properties of grammars. Inf. Control 1959, 2, 137–167. [Google Scholar] [CrossRef] [Green Version]
Hopcroft, J.; Motwani, R.; Ullman, J. Introduction to Automata Theory, Languages, and Computation; Pearson: London, UK, 2007. [Google Scholar]
Rozenberg, G.; Salomaa, A. (Eds.) Handbook of Formal Languages; Springer: Berlin/Heidelberg, Germany, 1997; Volume 1–3. [Google Scholar]
Dassow, J.; Păun, G. Regulated Rewriting in Formal Language Theory; Springer: Berlin/Heidelberg, Germany, 1989. [Google Scholar]
Pǎun, G.; Rozenberg, G.; Salomaa, A. DNA Computing. New Computing Paradigms; Springer: Berlin/Heidelberg, Germany, 1998. [Google Scholar]
Meduna, A.; Soukup, O. Modern Language Models and Computation. Theory with Applications; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
Sipser, M. Introduction to the Theory of Computation; Cengage Learning: Boston, MA, USA, 2013. [Google Scholar]
Meduna, A.; Zemek, P. Regulated Grammars and Automata; Springer: Berlin/Heidelberg, Germany, 2014. [Google Scholar]
Meduna, A.; Kopeček, T. Simple-Semi-Conditional Versions of Matrix Grammars with a Reduced Regulated Mechanism. Comput. Inform. 2004, 23, 287–302. [Google Scholar]
Abraham, A. Some questions of phrase-structure grammars. Comput. Linguist. 1965, 4, 61–70. [Google Scholar] [CrossRef]
Cremers, A.; Mayer, O. On vector languages. J. Comp. Syst. Sci. 1974, 8, 158–166. [Google Scholar] [CrossRef] [Green Version]
Ginsburg, S.; Spanier, E. Control sets on grammars. Math. Syst. Theory 1968, 2, 159–177. [Google Scholar] [CrossRef]
Rozenkrantz, D. Programmed grammars and classes of formal languages. J. ACM 1969, 16, 107–131. [Google Scholar] [CrossRef]
Pǎun, G. A new generative device: Valence grammars. Rev. Roum. Math. Pures Appl. 1980, 25, 911–924. [Google Scholar]
Cremers, A.; Maurer, H.; Mayer, O. A note on leftmost restricted random context grammars. Inform. Proc. Lett. 1973, 2, 31–33. [Google Scholar] [CrossRef]
Fris, I. Grammars with partial ordering of the rules. Inform. Control 1968, 12, 415–425. [Google Scholar] [CrossRef] [Green Version]
Kelemen, J. Conditional grammars: Motivations, definitions and some properties. In Proc. Conf. Automata, Languages and Mathematical Sciences; Peak, I., Szep, J., Eds.; Salgótarján, Hungary, 1984; pp. 110–123. [Google Scholar]
Aho, A. Indexed grammars. An extension of context-free grammars. J. ACM 1968, 15, 647–671. [Google Scholar] [CrossRef]
Wood, D. Bicolored Digraph Grammar Systems. RAIRO Inform. Thérique et Appl./Theor. Inform. Appl. 1973, 1, 145–150. [Google Scholar] [CrossRef] [Green Version]
Wood, D. A Note on Bicolored Digraph Grammar Systems. IJCM 1973, 3, 301–308. [Google Scholar]
Dassow, J.; Turaev, S. Petri net controlled grammars: The power of labeling and final markings. Rom. J. Inf. Sci. Technol. 2009, 12, 191–207. [Google Scholar]
Dassow, J.; Turaev, S. Petri net controlled grammars: The case of special Petri nets. J. Univers. Comput. Sci. 2009, 15, 2808–2835. [Google Scholar]
Prusinkiewicz, P.; Hanan, J. Lindenmayer Systems, Fractals, and Plants; Lecture Notes in Biomathematics; Springer: Berlin, Germany, 1980; Volume 79. [Google Scholar]
Rajlich, V. Absolutely parallel grammars and two-way deterministic finite state transducers. J. Comput. Syst. Sci. 1972, 6, 324–342. [Google Scholar] [CrossRef] [Green Version]
Siromoney, R.; Krithivasan, K. Parallel context-free languages. Inform. Control 1974, 24, 155–162. [Google Scholar] [CrossRef] [Green Version]
Levitina, M. On some grammars with global productions. NTI Ser. 1972, 2, 32–36. [Google Scholar]
Greibach, S.; Hopcroft, J. Scattered context grammars. J. Comput. Syst. Sci. 1969, 3, 232–247. [Google Scholar] [CrossRef] [Green Version]
Mavlankulov, G.; Othman, M.; Turaev, S.; Selamat, M.; Zhumabayeva, L.; Zhukabayeva, T. Concurrently Controlled Grammars. Kybernetika 2018, 54, 748–764. [Google Scholar] [CrossRef]
Păun, G.; Santean, L. Parallel communicating grammar systems: The regular case. Ann. Univ. Buc. Ser. Mat.-Inform. 1989, 37, 55–63. [Google Scholar]
Csuhaj-Varjú, E.; Dassow, J.; Kelemen, J.; Păun, G. Grammar Systems: A Grammatical Approach to Distribution and Cooperation; Gordon and Beach Science Publishers: New York, NY, USA, 1994. [Google Scholar]
Král, J. On Multiple Grammars. Kybernetika 1969, 5, 60–85. [Google Scholar]
Čulík II, K. n-ary Grammars and the Description of Mapping of Languages. Kybernetika 1970, 6, 99–117. [Google Scholar]
Bellert, I. Relational Phrase Structure Grammar and Its Tentative Applications. Inf. Control 1965, 8, 503–530. [Google Scholar] [CrossRef]
Bellert, I. Relational Phrase Structure Grammar Applied to Mohawk Constructions. Kybernetika 1966, 3, 264–273. [Google Scholar]
Crimi, C.; Guercio, A.; Nota, G.; Pacini, G.; Tortora, G.; Tucci, M. Relation Grammars and their Application to Multidimensional Languages. J. Vis. Lang. Comput. 1991, 4, 333–346. [Google Scholar] [CrossRef]
Wittenburg, K. Earley-Style Parsing for Relational Grammars. In Proceedings of the IEEE Workshop on Visual Languages, Seattle, WA, USA, 15–18 September 1992; pp. 192–199. [Google Scholar]
Wittenburg, K.; Weitzman, L. Relational Grammars: Theory and Practice in a Visual Language Interface for Process Modeling. In Visual Language Theory; Marriott, K., Meyer, B., Eds.; Springer Science & Business Media: New York, NY, USA, 1998; pp. 193–217. [Google Scholar]
Johnson, D. On Relational Constraints on Grammars. In Grammatical Relations; Cole, P., Sadock, J., Eds.; BRILL: Leiden, The Netherlands, 2020; pp. 151–178. [Google Scholar]
Martín-Vide, C.; Mitrana, V.; Păun, G. (Eds.) Formal Languages and Applications; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
Baumgarten, B. Petri-Netze. Grundlagen und Anwendungen; Wissensschaftverlag: Mannheim, Germany, 1990. [Google Scholar]
Reisig, W.; Rozenberg, G. (Eds.) Lectures on Petri Nets I: Basic Models; Springer: Berlin, Germany, 1997; Volume 1441. [Google Scholar]
Dassow, J.; Turaev, S. Petri net controlled grammars with a bounded number of additional places. Acta Cybernetica 2009, 19, 609–634. [Google Scholar]
Novák, V.; Novotný, M. Binary and Ternary Relations. Math. Bohem. 1992, 117, 283–292. [Google Scholar]
Novák, V.; Novotný, M. Pseudodimension of Relational Structures. Czechoslov. Math. J. 1999, 49, 541–560. [Google Scholar]
Cristea, I.; Ştefănescu, M. Hypergroups and n-ary Relations. Eur. J. Comb. 2010, 31, 780–789. [Google Scholar] [CrossRef] [Green Version]
Chaisansuk, N.; Leeratanavalee, S. Some Properties on the Powers of n-ary Relational Systems. Novi Sad J. Math. 2013, 43, 191–199. [Google Scholar]

Figure 1. A cf Petri net N associated with the grammar

G_{1}

, where the places are labeled with the nonterminals and the transitions are labeled with the productions in one-to-one manner. Moreover, the input place of each transition corresponds to the left-hand side of the associated production and its output places correspond to the nonterminals in the right-hand side of the production. If a transition does not have output places, then the associated production is terminal.

Figure 1. A cf Petri net N associated with the grammar

G_{1}

, where the places are labeled with the nonterminals and the transitions are labeled with the productions in one-to-one manner. Moreover, the input place of each transition corresponds to the left-hand side of the associated production and its output places correspond to the nonterminals in the right-hand side of the production. If a transition does not have output places, then the associated production is terminal.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Turaev, S.; Abdulghafor, R.; Amer Alwan, A.; Abd Almisreb, A.; Gulzar, Y. Binary Context-Free Grammars. Symmetry 2020, 12, 1209. https://doi.org/10.3390/sym12081209

AMA Style

Turaev S, Abdulghafor R, Amer Alwan A, Abd Almisreb A, Gulzar Y. Binary Context-Free Grammars. Symmetry. 2020; 12(8):1209. https://doi.org/10.3390/sym12081209

Chicago/Turabian Style

Turaev, Sherzod, Rawad Abdulghafor, Ali Amer Alwan, Ali Abd Almisreb, and Yonis Gulzar. 2020. "Binary Context-Free Grammars" Symmetry 12, no. 8: 1209. https://doi.org/10.3390/sym12081209

APA Style

Turaev, S., Abdulghafor, R., Amer Alwan, A., Abd Almisreb, A., & Gulzar, Y. (2020). Binary Context-Free Grammars. Symmetry, 12(8), 1209. https://doi.org/10.3390/sym12081209

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Binary Context-Free Grammars

Abstract

1. Introduction

2. Regulated, Parallel and Relational Grammars

3. Notions and Notations

Strings, Languages and Grammars

4. Binary Strings, Languages and Grammars

5. Synchronized Forms for Binary Grammars

6. Generative Capacities of Binary Grammars

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI