Entry Aggregation and Early Match Using Hidden Markov Model of Flow Table in SDN

Cheng Wang; Hee Yong Youn

doi:10.3390/s19102341

and

¹

Electrical and Computer Engineering, Sungkyunkwan University, Suwon 16419, Korea

²

College of Software, Sungkyunkwan University, Suwon 16419, Korea

^*

Author to whom correspondence should be addressed.

Sensors2019, 19(10), 2341;https://doi.org/10.3390/s19102341

This article belongs to the Special Issue Software-Defined Networking for Sensor Networks and Internet of Things

Version Notes

Order Reprints

Abstract

The usage of multiple flow tables (MFT) has significantly extended the flexibility and applicability of software-defined networking (SDN). However, the size of MFT is usually limited due to the use of expensive ternary content addressable memory (TCAM). Moreover, the pipeline mechanism of MFT causes long flow processing time. In this paper a novel approach called Agg-ExTable is proposed to efficiently manage the MFT. Here the flow entries in MFT are periodically aggregated by applying pruning and the Quine–Mccluskey algorithm. Utilizing the memory space saved by the aggregation, a front-end ExTable is constructed, keeping popular flow entries for early match. Popular entries are decided by the Hidden Markov model based on the match frequency and match probability. Computer simulation reveals that the proposed scheme is able to save about 45% of space of MFT, and efficiently decrease the flow processing time compared to the existing schemes.

Keywords:

SDN; entry aggregation; Quine–McCluskey Algorithm; match frequency and probability; Hidden Markov Model

1. Introduction

In software-defined networking (SDN) [1,2] efficient network management is achieved by separating the control plane from the data plane, and providing programmable interfaces to the application layer to flexibly enable the customization of the network management functions. The traditional legacy network services often require various processing of data packets such as routing, monitoring, access control, and load balancing. If all the services are provided in a single network infrastructure, the development of the network will be very complicated and adapting to a variety of functional requirements will be difficult. In this regard, SDN provides a new direction of networking services and drastically changes the traditional network with distinctive features including the separation of controller and forwarding unit, virtualization, programmable service, dynamic reconfiguration, and centralized management [3].

SDN is an innovative movement in the field of computer networks. It stems from a series of projects on networks such as active networks, software switches, 4D (Decision, Dissemination, Discovery, and Data) networks, and traditional telephone networks [4,5]. The network infrastructure has been evolving from the approach of vertical integration to horizontal one to be more intelligent. Intelligent control of data forwarding units (switches, routers, etc.) has been unified into the control plane [2]. With SDN the network devices of the data plane only forward the data, while they are directly managed by the control plane. As a result, the control plane can get the global information of the network such as connection topology, link state, etc., and thus proper decisions on forwarding the flows can be made. The network is programmable through a unified interface provided to control the behavior of the whole network [6]. The programmable interface greatly improves the flexibility of network management and data forwarding, and enables the network to dynamically change the path of the flow. In SDN, the physical characteristics of the networking devices are concealed so that the control plane can provide a unified management and services for high level applications [7]. At the same time, the data forwarding devices are abstracted from the control layer to ensure the mobility, and reduce future investment of the devices.

Among the main issues with SDN such as switch designs [8,9], distributed controller platforms [10,11], resilient communication [12,13], and security [14,15], flow table management is one of the primary tasks directly influencing the performance. From OpenFlow 1.3 [16,17], there were several improvements including multiple flow tables (MFT). In SDN, the OpenFlow switches just forward the flows through the MFT which are composed of data packets, and the operation is managed by OpenFlow controllers. Even though MFT plays an important role in SDN, two challenging issues exist which are memory overhead and flow forwarding delay. The memory overhead is high due to the need of ternary content addressable memory (TCAM) and fast forwarding [18]. If the number of entries in the table is limited too much for reducing the overhead, the flow match rate will be very low. Meanwhile, the pipeline mechanism employed with MFT causes forwarding delay as each flow has to go through the flow table one after another, and the action corresponding to the matched entry is executed at the end of the pipeline. There exist various studies dealing with such issues of MFT. For example, the Flow Table Reduction Scheme [18] (FTRS) aggregates the flow entries of the same action and destination addresses. In [19], the Multi-Stage OF (MsOF) switch model is proposed to save memory space and reduce forwarding time by deploying the tables each requiring less memory space. However, the FTRS works well only in local area network (LAN) environment and the implementation of MsOF is relatively complicated.

In this paper we advocate a novel approach called Agg-ExTable which efficiently reduces the memory overhead and flow forwarding time of MFT. Here pruning and the Quine–McCluskey (QM) algorithm are periodically applied to aggregate the flow entries in each flow table, and a table called ExTable is put in front of the pipeline which contains the flow entries of high expected match probability. Its flow entry is dynamically determined using a Hidden Markov model constructed based on the number of transitions and match probability of the entries. The QM algorithm allows an efficient usage of TCAM by reducing the size and power consumption, while ExTable allows quick match and execution of the incoming flows. Computer simulation reveals that the proposed scheme substantially reduces the memory size and flow processing time compared to the TCAM-based size reduction scheme [20] and the scheme merging and migrating the flow entries based on directed acyclic graph (DAG) [21]. The improvement gets more significant as the flow arrival rate rises. The main contributions of the paper are summarized as follows:

The flow entry aggregation problem is simplified by transforming it to a logic minimization problem, which is effectively solved by the QM algorithm.
The ExTable scheme substantially reduces the flow processing time by placing a table containing the entries of high match probability up front.
The match frequency and match probability of the entries are handled with the Hidden Markov model to decide the likelihood of the match.

The remainder of the paper is organized as follows. Section 2 provides an overview of TCAM used for the implementation of MFT, Hidden Markov model, and MFT-based switching. In Section 3 the proposed Agg-ExTable approach is presented, along with the analytical model of the flow processing time based on queueing theory. Section 4 is for the performance evaluation of the proposed schemes. Finally, Section 5 concludes the paper and outlines the future research direction.

2. Related Work

2.1. TCAM

TCAM [20,22] is usually employed to speed up the table look up operation. However, the size is always limited due to high cost. TCAM allows a third matching state of ‘X’ or ‘don’t care’ for one or more bits in the stored entry, allowing the flexibility in the search. For example, a TCAM may have an entry of ‘101XX’ which will match to ‘10100’, ‘10101’, ‘10110’, or ‘10111’. The additional state is typically implemented by adding a mask bit to the corresponding memory cell. If a bit of the mask is ‘0’, the bit of corresponding entry is ‘don’t care’. In Table 1, for example, if the first five bits of a flow are ‘10011’ or ‘10111’, it matches E₁.

Table 1. An example of flow table with TCAM.

Another important feature of TCAM is that ‘0’ and ‘1’ in the mask is not required to be continuous. For E₁ in Table 1, the third bit of the entry is ‘don’t care’, and thus the flow whose first three bits are ‘100’ or ‘101’ matches this entry. The proposed aggregation method takes advantage of this feature of TCAM.

In reference [20], two techniques, pruning and ESPRESSO-II based mask extension, are proposed to compact traditional routing table stored in TCAM. They allow a smaller TCAM reducing the size and power consumption.

2.2. Quine–McCluskey (QM) Algorithm

The Quine–McCluskey (QM) algorithm [23,24,25] was developed for simplifying logical functions. The QM algorithm is effective for implementation since it has tabular form, and it also provides a deterministic method checking if the logical function is minimal. There exist mainly four definitions in using the QM algorithm:

Minterm: an expression in which all variables of the logical function appears once.
Implicant: an aggregation of minterms in the logical function.
Prime implicant: an implicant that cannot be covered by a more simplified implicant.
Essential prime implicant: prime implicants that cover an output of the logical function for which no combination of other prime implicants is able to cover.

The procedure of the QM algorithm consists of two steps:

Step 1: Find prime implicants from the minterm table.
Step 2: Find the essential prime implicants and other necessary prime implicants to cover the logical function.

In the proposed scheme the QM algorithm is employed for mask extension which is transformed to be a logic minimization problem.

2.3. Hidden Markov Model (HMM)

HMM [26,27,28,29] is used in various fields such as language recognition, reinforcement learning, and bioinformatics. An HMM is a finite discrete time Markov model in which the system is assumed to be a Markov process with hidden state. It can be defined as a triple λ = (π,A,B), where π is the initial probability distribution, A is transition probability matrix, and B is a sequence of observation likelihoods (emission probabilities). Specifically, an HMM is defined by the following components [30]:

Q = {q₁, q₂, …, q_N} is a set of states where N is the number of hidden states, and q_t is the hidden state at time t.
O = {o₁, o₂, …, o_T} is a set of observations where T is the number of observations. o_t is the observable state at time t. Each observation is drawn from a vocabulary V = {v₁, v₂, …, v_M}, where M is the number of observation values.
A = {a₁₁, a₁₂, …, a_N₁, …, a_NN} is an N × N transition probability matrix. a_ij (1 ≤ i, j ≤ N) represents the probability of changing from state_i to state_j. Here,

$a_{i j} = P (q_{t + 1} = s_{j} | q_{t} = s_{i}),$

(1)

while,

$a_{i j} \geq 0, \forall i, j,$

(2)

$\sum_{j = 1}^{N} a_{i j} = 1, \forall i .$

(3)
B = {b_j(k)}(1 ≤ j ≤ N, 1 ≤ k ≤ M) is an N × M emission probability matrix. b_j(k) represents the probability of an observation o_t being generated from state_j. Here,

$b_{j} (k) = P (o_{t} = v_{k} | q_{t} = s_{j}),$

(4)

while,

$\sum_{k = 1}^{M} b_{j} (k) = 1, \forall j .$

(5)
π = {π₁, π₂, …, π_N} is the initial probability distribution over hidden states. π_i is the probability that the Markov chain will start in state_i. Here,

$π_{i} = P (q_{1} = s_{i}),$

(6)

while,

$\sum_{i = 1}^{N} π_{i} = 1, \forall i .$

(7)

A generic HMM is illustrated in Figure 1, where H_i(i = 1, 2, 3, …, T) is the set of hidden states and O_i(i = 1, 2, 3, …, T) is the set of observations. The Markov process, located above the observable states, is determined by the current state and A. Only O_i is able to be observed, which is determined by the hidden states of the Markov process and B.

Figure 1. The state diagram of the Hidden Markov model.

2.4. MFT

The pipeline processing of OpenFlow [16,17] is depicted in Figure 2. An OpenFlow switch is required to have one or more flow tables, while a single flow table is used only for relatively simple network.

Figure 2. The pipeline processing with MFT.

The flow tables of an OpenFlow switch are sequentially numbered, starting from 0. The pipeline processing always starts from the first flow table where the packet is matched against its flow entries. Other flow tables may be used depending on the outcome of the match. The pipeline processing occurs only in the forward direction. If a flow entry matches, the instruction set included in that entry is executed at the Action Execution Unit (AEU) located at the end of the pipeline.

The existing flow entry aggregation techniques can be classified into two types according to the location of aggregation, in packet classifier or in OpenFlow switch flow table. The TCAM Razor [31] is a systematic flow aggregation algorithm which uses the decision diagram to minimize the TCAM rules required for packet classification. However, the execution time grows rapidly when the number of flow entries increases. The Fast Flow Table Aggregation [32] (FFTA) and FTRS also support entry aggregation in SDN. FFTA is an offline aggregation scheme based on bit weaving, which applies ORTC (Optimal Routing Table Constructor) after cutting the entries using a binary search tree. The FTRS aggregates the flow entries according to the destination IP address, instead of the match field. It achieves a good compression ratio with different topologies. However, FFTA causes coarse traffic statistics due to the mixture of all the entries, while FRTS has a high probability of flow table overflow.

The schemes of [19,33] aim to implement effective forwarding with MFT. In reference [21], an algorithm called migrating flow rules (MILE) was proposed which merges and migrates the flow entries to reduce the number of flow table lookup operations by employing directed acyclic graph (DAG). The dependencies of the flow entries are handled using DAG, where interdependent entries are grouped and migrated as a whole. Using the 2Q LRU replacement algorithm, the recently accessed entries are replaced at Flow Table_0 to be matched early. With the Multi-Stage OF (MsOF) [19] switch model, more tables each requiring less memory space are deployed. Here a processor is implemented in each flow table, and multiple pipeline operations occur in parallel so that several flows can be matched at the same time. The spatial and temporal complexity were examined using queuing theory. The implementation of MsOF is relatively complicated and needs large networking resources.

In order to improve the efficiency of the MFT, both the TCAM implementation and flow match probability are focused in this paper. The proposed scheme aggregate the entries using the pruning and QM algorithm to minimize the space of TCAM, while the saved memory space is used to store popular entries. With the HMM constructed based on the match frequency and match probability of the entries, the presumably popular flow entries are selected and put in the ExTable located in front of the flow table. The pruning and QM algorithm aim to maximize the efficiency of the TCAM by reducing the size and power consumption. Maintaining high match rate with ExTable substantially decreases the overall flow processing time, via early match and execution of the incoming flows. The proposed scheme is presented in the next section.

3. The Proposed Scheme

3.1. The Structure

The proposed Agg-ExTable scheme allows entry aggregation and fast pipeline operation using ExTable holding popular flows. It works in two phases. In Phase 1, the proposed entry aggregation algorithm is executed periodically to reduce the size of TCAM. Then in Phase 2, the saved memory space is utilized to set up ExTable in front of the pipeline, keeping popular entries. Here the HHM is used to decide the popularity of the flow entry. The structure of the proposed MFT pipeline of an OF switch is illustrated in Figure 3. Observe that there exist two paths; express path for popular flow and regular path for nonpopular flow. When a flow arrives at a switch, it is first parsed for the matching with the ExTable. Upon a match, the actions of the flow entry are sent to AEU. Otherwise, the match operation is continued with the other flow tables.

Figure 3. The structure of the proposed scheme.

3.2. Aggregation of Entry

The number of possible flow paths in a flow table is typically small because only a limited number of interface cards can fit into the switch chassis. In contrast, the number of forwarding entries is quite large, in the range of several thousands. Considering this disparity, a scheme reducing the size of flow table is developed which involves two techniques presented below.

3.2.1. Pruning of Redundant Entries

Pruning is a technique eliminating some redundant entries [20]. To facilitate the discussion, some terms are defined as follows. Notice that the match fields of a flow entry may have different lengths.

Assume that entry_P is the parent of entry_Q, Lp is the length of entry_P, and P(i) is the ith bit of entry_P. Then the following three conditions hold: (a) L_P < L_Q; (b) For all i(1 < i < L_P), P(i) = Q(i); (c) There is no Q’ such that L_P < L_Q’ < L_Q; and Q’(i) = Q(i) for all i(1 < i < L_P).
Entry_P is identical to entry_Q if same actions are executed for the matched packet.

If P is identical to Q, Q is a redundant flow entry. Assume that Q matches a flow. Then the flow will match P as well by the definition. If Q is removed from the flow table, P becomes the matched entry. As P and Q have the same actions, removing Q makes no difference. Note that the technique is general enough that it can be used with any entry lookup algorithm regardless of the type of the flow table.

3.2.2. QM-Based Mask Extension

The second technique exploits the flexibility offered by the TCAM hardware. TCAM allows arbitrary mask, in other words, the bits of 1s or 0s do not require to be continuous.

Table 2 shows an example of mask extension. E₁ and E₂ both have the same action of ‘Forward to 1’. It is possible to combine E₁ and E₂ into one single entry with the prefix of 1100 and mask of 1101. The 0 at bit 3 in the mask allows combining E₁ and E₂ into a same entry. The aggregated version of the original flow table with the mask extension technique is shown in the right-hand side. The table size has been reduced to 3 from 5.

Table 2. An example of mask extension.

Note that the mask extension is equivalent to the logic minimization problem [20]. The problem is that ‘given a set of entries with the same action, find a set of minimal covers.’ Such logic minimization problem [33] is a non-deterministic polynomial (NP) complete problem, and there exist mainly three kinds of methods used for its solution.

Karnaugh mapping [34]: It is simple but when the number of variables is larger than six, it becomes very complex.
Quine–McCluskey (QM) algorithm [23,24,25]: It is functionally identical to Karnaugh mapping, but the tabular form makes it more efficient to be used with a computer algorithm, supporting any number of variables. It also provides a deterministic method checking if the logical function is minimal.
Espresso logic minimizer [35,36]: It can produce a solution fast but cannot guarantee optimal result.

Here the QM algorithm is employed for mask extension. Algorithm 1 shows the proposed entry aggregation scheme with the QM algorithm-based mask extension. Here, E(l,a) is the set of original entries having the same length of l and action of a. A(l,a) is the result of QM algorithm.

Algorithm 1. Entry aggregation with mask extension

//n is the number of original entries

//m is the number of entries having the same length and action

1: Begin

2: Input entry[i]

3: for i from 0 to n

4: if have same entry[i].l and entry[i].a

5: move to E(l,a)

6: End for

7: for i from 0 to m

8: A(l,a) = QMminimize(E(l,a))

9: end for

10: remove E(l,a)

11: install A(l,a)

12: end

An example of the proposed mask extension scheme is shown below. In Table 3, there are 11 entries with different actions. After selecting the entries having the same action, the entry aggregation problem is simplified into the following minimization function:

F (A, B, C, D) = \sum E (0, 1, 3, 4, 5, 6, 7, 8, 9) .

(8)

Table 3. An example table.

Step 1: find prime implicants (P_i). Here all minterms are placed in the minterm table as shown in Table 4, and Stage I is to combine the minterms. If two terms vary by only a single bit, that bit is replaced with a dash (-). Stage II is the result of Stage I, and Stage III is the result of combining the minterms in Stage II. ‘/’ indicates if the entry is combined in the next stage.

Table 4. Minterm table.

According to Table 4, all the prime implicants are shown as follows:

P_{1} = \sum E (0, 3, 9, 5) = B D, P_{2} = \sum E (8, 5) = A C D, P_{3} = \sum E (6, 8) = A B^{'} C, P_{4} = \sum E (7, 6) = A B^{'} D^{'}, P_{5} = \sum E (4, 0) = A^{'} B C^{'}, P_{6} = \sum E (1, 7) = B^{'} C^{'} D^{'}, P_{7} = \sum E (1, 4) = A ’ C ’ D ’ .

(9)

Step 2: find essential prime implicants (P_e).

From Table 4, none of the minterms can be combined any further. At this point, the table of essential prime implicant is constructed as in Table 5.

Table 5. The essential prime implicant table.

In order to find the essential prime implicants, each column needs to be checked whether there exists only one ‘*’. If a column has only one ‘*’, the minterm can be covered by only one prime implicant. Then this prime implicant is essential. According to Table 5, P₁ is the only essential prime implicant.

As P₂ can be covered by P₁ and P₃, same as P₃, P₄, P₅, P₆, and P₇, they are not essential. In this example, the essential prime implicants cannot handle all the minterms (only E₀, E₃, E₅, and E₉ are covered). Therefore, other prime implicants are combined with P₁ to get the final result as:

F (A, B, C, D) = P_{1} + P_{2} + P_{4} + P_{5} = B D + A C D + A B^{'} D^{'} + A^{'} B C^{'} .

(10)

At last the final Table 6 is obtained as follows:

Table 6. The final table.

The number of entries is aggregated from 11 to 6, and the compression ratio is 6/11 = 0.545.

3.3. Hidden Markov Model-Based Prediction

The goal of the proposed scheme based on HMM is to dynamically predict the popularity of the flow entries as accurately as possible and update the ExTable accordingly. After the flow entries are aggregated, the popularity of the flow entries are estimated periodically and the entries deemed to be popular are moved to ExTable. Note that the size of ExTable, n_p, is smaller than (1 − C)∙N_MFT where C and N_MFT are the TCAM compression ratio and the size of entire MFT, respectively.

The match frequency of an entry indicates the popularity. For this, a counter, M, is associated to each flow entry, which is activated when the entry is installed in the flow table. M is the number of matches before the prediction occurs. Note that M may not be the only indicator of the popularity, and thus HMM is employed to estimate the probability of the flow entries to be matched in the near future.

The interarrival time of the flow is assumed to follow exponential distribution, and therefore the number of arrivals can be modeled using Poisson distribution. The probability of a flow arriving in a given interval of time of Δt is predicted as follows. Assume that flow arrival occurs at any time. The probability of k arrivals in Δt is given by:

P (k, Δ t) = \frac{e^{- λ Δ t} {(λ Δ t)}^{k}}{k!} .

(11)

The probability of at least one arrival in the interval Δt is given as:

P (k \geq 1, Δ t) = 1 - e^{- λ Δ t} .

(12)

The probability of a flow arriving in the next interval is computed as the mean value of the entire period from the beginning. It is:

P (k = 1, Δ t) = e^{- λ Δ t} \cdot (λ Δ t) .

(13)

HMM is effective to predict the probability of an observed sequence with the given triple λ = (π,A,B). Let H = {H₀, H₁, …, H_n} be a set of hidden states, where H_i is defined as the number of time segments (Δt seconds per segment) a flow entry has not been matched from the initial state. For example, if there was no match for last 3Δt seconds, H₃ = 3Δt. If there was a match, H₃ = 0. Note that the hard timeout period (T_H) is preset, and an entry is forced to be evicted if no match occurs during T_H. Therefore, there will be n (=T_H/Δ_t) segments before an entry is finally evicted, and H_n is the last state. Since O = {O₀, O₁, …, O_n} is a set of observable states of any entry, O_i indicates if the entry is matched in ith segment. It has two values; ‘1’ for a successful match, ‘0’ no match. Figure 4 shows the structure of the HMM of the proposed scheme.

Figure 4. The state diagram of the proposed HMM.

With the HMM the probability of an observed sequence is found with the given parameters, A, B, and π. For a flow entry, there exist N (=n + 1) hidden states in its life time. Assume that there exist m time segments before the prediction occurs. Then the (n − m) × (n − m) hidden state transition probability matrix, A, and the (n − m) × 2 emission probability matrix, B, are obtained as follows:

A = \begin{matrix} \begin{matrix} [\begin{matrix} \begin{matrix} 1 - e^{- λ Δ t} & e^{- λ Δ t} \\ \begin{matrix} 1 - e^{- λ Δ t} \\ \begin{matrix} ⋮ \\ \begin{matrix} 1 - e^{- λ Δ t} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} 0 \\ \begin{matrix} ⋮ \\ \begin{matrix} 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} \begin{matrix} \begin{matrix} \begin{matrix} 0 \\ e^{- λ Δ t} \end{matrix} \\ ⋮ \end{matrix} \\ 0 \end{matrix} & \begin{matrix} \begin{matrix} \begin{matrix} \begin{matrix} \dots \end{matrix} \\ ⋮ \end{matrix} \\ ⋮ \\ \dots \end{matrix} & \begin{matrix} \begin{matrix} \begin{matrix} \begin{matrix} 0 \\ 0 \end{matrix} \\ ⋮ \end{matrix} \\ e^{- λ Δ t} \end{matrix} \end{matrix} \end{matrix} \end{matrix} \end{matrix}] \end{matrix} \end{matrix},

(14)

B = \begin{matrix} \begin{matrix} [\begin{matrix} 0 & 1 \\ 1 & 0 \\ \begin{matrix} ⋮ \\ 1 \end{matrix} & \begin{matrix} ⋮ \\ 0 \end{matrix} \end{matrix}] \end{matrix} \end{matrix} .

(15)

In order to compute the likelihood probability, P(O|λ) of O = {O₀, O₁, …, O_(n−m)}, the forward algorithm is adopted. Note that the probability of the observation sequence is obtained in which the value of O_i is 1 and the predicted last observable state is O_(n−m). Then P(O|λ) is calculated as follows.

Initialization. Each cell of the forward algorithm, α_t(j), represents the probability of hidden state H_j after checking the first t observations with the given λ. It expresses the probability as:

$α_{t} (j) = P (o_{0}, o_{1}, \dots, o_{t}, H_{t} = q_{j} | λ) .$

(16)

Here H_t = q_j denotes that the tth hidden state in the sequence is q_j. Then the initial probability is calculated as follows:

$α_{0} (j) = π_{j} b_{j} (o_{0}), 1 \leq j \leq (n - m) .$

(17)

If one or more matches occur during m segments, the following is obtained according to Equation (12) and the sum of π is 1.

$π_{0} = 1 - e^{- λ Δ t}, π_{j} = \frac{e^{- λ Δ t}}{n - m}$

(18)

If no match occurs, then:

$π_{0} = \frac{1 - e^{- λ Δ t}}{(m + 1)}, π_{j} = \frac{e^{- λ Δ t}}{(m + 1) (n - m)} .$

(19)
Recursion. For the hidden state sequence, H, and the observation sequence, O, the likelihood of O is estimated as:

$P (O | H, λ) = \prod_{i = 0}^{n - m} P (o_{i} | q_{i}) = \prod_{i = 0}^{n - m} b_{i} (o_{i}) .$

(20)

While introducing π and A, it follows:

$P (H | λ) = \prod_{i = 0}^{n - m} P (q_{i} | q_{i - 1}) = π_{q_{0}} \prod_{i = 1}^{n - m} a_{q_{i - 1} q_{i}},$

(21)

where q₀ is the initial state.
Then the joint probability of H and O is:

$P (O, H | λ) = P (O | H, λ) P (H | λ) = \prod_{i = 0}^{n - m} b_{i} (o_{i}) \times π_{q_{0}} \prod_{i = 1}^{n - m} a_{q_{i - 1} q_{i}} .$

(22)

Therefore, the total probability of the observations can be calculated by summing up all possible hidden state sequences:

$P (O | λ) = \sum_{H} P (O, H | λ) = \sum_{H} [\prod_{i = 0}^{n - m} b_{i} (o_{i}) \cdot π_{q_{0}} \prod_{j = 1}^{n - m} a_{q_{j - 1} q_{j}}] .$

(23)

For each given state q_j at time t, the probability α_t(j) is estimated as:

$α_{t} (j) = b_{j} (O_{t}) \sum_{i = 0}^{n - m} α_{t - 1} (i) a_{i j},$

(24)

where 0 ≤ j ≤ (n − m) and 1 ≤ t ≤ n.
Termination. According to Equations (23) and (24), the probability of O is estimated as:

$P (O | λ) = \sum_{j = 0}^{n - m} α_{t} (j) .$

(25)

Through the forward algorithm, the P(O|λ) of each observed state can be computed. In order to decide the popularity of each entry, the value of O_i is set to 1 to record the probability of successful match.

Large P(O|λ) means that the entry has high match probability. The number of popular flow entries selected in each flow table is denoted as k. Periodic ExTable update occurs in every ΔT. Here the popularity, ω, is decided based on the match frequency, M, and match probability, P(O|λ), as follows:

ω = \frac{M}{m} + P (O | λ) .

(26)

After calculating the popularity, the flow entries of k largest popularity are moved to the ExTable. The number of flow entries in ExTable is n_t∙k if there exist n_t flow tables (n_t∙k ≤ n_p). If there exist several flow entries of the same value, the one of the longest remaining life time is selected. The proposed periodic entry selection scheme is depicted in Algorithm 2.

Algorithm 2. Selection operation of popular entry

1: Begin

2: create ExTable using the saved memory

3: set ΔT, k

4: flowEntry.M[i][j] = 0

5: flowEntry.P[i][j] = 0

6: flowEntry.ω[i][j] = 0

7: while 0 < Δt < ΔT

8: for i from 0 to n_t −1

9: for j from 0 to n_f −1

10: input O, λ = (π,A,B)

11: initial α₀(j) by Equation (17), π by Equations (18) and (19)

12: compute α_t(j) by Equation (24)

13: calculate flowEntry.P[i][j] by Equation (25)

14: count flowEntry.M[i][j]

15: calculate flowEntry.ω[i][j] by Equation (26)

16: end for

17: end for

18:end while

19: if (Δt == ΔT)

20: fori from 0 to n_t − 1

21: sort flowEntry. ω[i][j] from large to small

22: forj from 0 to k

23: select flowEntry[i][j]

24: end for

25: end for

26: end if

27: update flowEntry[i][j] into ExTable

28: end

3.4. Flow Processing Time

The queuing model [19,37] of the proposed ExTable scheme is shown in Figure 5 where each of the nodes is considered as an M/M/1 queue. Table 7 is the list of variables used in the model.

Figure 5. The queuing model of the proposed scheme.

Table 7. The variables used in the queueing model.

According to Little’s Law [37], the flow processing time in the system can be calculated as T = N/λ, where N is the average number of flows in the system and λ is the arrival rate of the flows. For obtaining the flow processing time of the proposed scheme, T_F, firstly the average number of flows in the system, N_F, needs to be obtained. In the following formula R_m is the match rate of the ExTable, and ρ_p, ρ_f, and ρ_e are the utilization of ExTable, flow table, and AEU, respectively. Note that there exists a direct path from each flow table to AEU. The rate of sending packets directly to AEU from the flow tables are {Rd₀, Rd₁, …, Rd_{n_t}₋₂}, where n_t ≥ 2. In order to calculate N_F, the average number of flows in ExTable, flow table 0, flow table 1, flow table (n_t−1) and AEU, N_p, N_f₀, N_f₁, N_f_{(n_t−1)}, and N_e, should be estimated. They are calculated as follows:

N_{p} = \frac{ρ_{p}}{1 - ρ_{p}} = \frac{λ n_{p} t_{f}}{1 - λ n_{p} t_{f}},

(27)

N_{f 0} = \frac{ρ_{f 0}}{1 - ρ_{f 0}} = \frac{(1 - R_{m}) λ n_{f} t_{f}}{1 - (1 - R_{m}) λ n_{f} t_{f}},

(28)

N_{f 1} = \frac{ρ_{f 1}}{1 - ρ_{f 1}} = \frac{(1 - R_{m} - R d_{0}) λ n_{f} t_{f}}{1 - (1 - R_{m} - R d_{0}) λ n_{f} t_{f}},

(29)

N_{f (n_{t} - 1)} = \frac{ρ_{f (n_{t} - 1)}}{1 - ρ_{f (n_{t} - 1)}} = \frac{[(1 - R_{m}) - \sum_{i = 0}^{n_{t} - 2} R d_{i}] λ n_{f} t_{f}}{1 - [(1 - R_{m}) - \sum_{i = 0}^{n_{t} - 2} R d_{i}] λ n_{f} t_{f}}, n_{t} \geq 2,

(30)

N_{e} = \frac{ρ_{e}}{1 - ρ_{e}} = \frac{λ t_{e}}{1 - λ t_{e}} .

(31)

The average number of flows in the system, N_F, is as follows:

\begin{array}{l} N_{F} = N_{p} + N_{f 0} + & N_{f 1} + \dots + N_{f (n_{t} - 1)} + N_{e} \\ = λ (\frac{n_{p} t_{f}}{1 - λ n_{p} t_{f}} + \frac{(1 - R_{m}) n_{f} t_{f}}{1 - (1 - R_{m}) λ n_{f} t_{f}} + \frac{(1 - R_{m} - R d_{0}) n_{f} t_{f}}{1 - (1 - R_{m} - R d_{0}) λ n_{f} t_{f}} + \dots \\ + \frac{[(1 - R_{m}) - \sum_{i = 0}^{n_{t} - 2} R d_{i}] n_{f} t_{f}}{1 - [(1 - R_{m}) - \sum_{i = 0}^{n_{t} - 2} R d_{i}] λ n_{f} t_{f}} + \frac{t_{e}}{1 - λ t_{e}}) . \end{array}

(32)

Note that there exist one ExTable, n_t flow tables and one AEU. As a result, the flow processing time, T_F, is obtained as follows:

\begin{array}{l} T_{F} = \frac{n_{p} t_{f}}{1 - λ n_{p} t_{f}} + & \frac{(1 - R_{m}) n_{f} t_{f}}{1 - (1 - R_{m}) λ n_{f} t_{f}} + \frac{(1 - R_{m} - R d_{0}) n_{f} t_{f}}{1 - (1 - R_{m} - R d_{0}) λ n_{f} t_{f}} + \dots \\ + \frac{[(1 - R_{m}) - \sum_{i = 0}^{n_{t} - 2} R d_{i}] n_{f} t_{f}}{1 - [(1 - R_{m}) - \sum_{i = 0}^{n_{t} - 2} R d_{i}] λ n_{f} t_{f}} + \frac{t_{e}}{1 - λ t_{e}} . \end{array}

(33)

The queueing model of the flow processing time for ExTable is used later in computer simulation. The proposed scheme is simulated and compared with the existing schemes in the following section.

4. Performance Evaluation

In this section computer simulation is conducted to evaluate the TCAM compression ratio, prediction accuracy, and match rate of the proposed approach.

4.1. Simulation Environment

The simulation is conducted on Intel Core i5 process, 3.2 GHz PC with 8GB RAM, and Matlab R2014a. The flows used in the simulation are generated following exponential distribution with λ = 1. A virtual SDN environment is built with Floodlight controller, Open vSwitch, and end nodes emulated by Mininet. Here the Floodlight controller is linked to an Open vSwitch, and two end nodes are connected to the switch. The performance of the proposed scheme is compared with the original MFT approach and existing schemes [20,21].

For testing the proposed entry aggregation scheme, eight different numbers of flow entries are generated randomly. The number of entries is varied from 100 to 800. For obtaining the flow processing time for MFT and ExTable, the queueing models of them are used in the simulation. The number of flow entries in a flow table is set to 20, while the size of ExTable is n_t∙k. The service rate of the ExTable, flow table, and AEU are set to be 1.67, 1.67, and 10 [19,21], respectively. Table 8 and Table 9 list the parameter values and the factors used in the simulation, respectively.

Table 8. The parameter setting of the simulation.

Table 9. The factors in the simulation.

The match rate and prediction accuracy of the proposed scheme are examined with various values of the operation parameters, and the flow processing time is compared with the existing schemes. Note that small processing time implies higher match rate and prediction accuracy.

The simulation is run 1000 times to achieve dependable result. The accuracy of the proposed HMM-based prediction is then calculated as:

A c c u r a c y = \frac{N_S u c c e s s}{N_E x p e r i m e n t} \times 100 % .

(34)

Here, N_Success indicates the number of selected flow entries actually having the largest number of matches, and N_ Experiment is the whole number of simulations. The match rate of the proposed ExTable scheme is obtained as follows:

M a t c h r a t e = \frac{N_M a t c h}{N_F l o w s} \times 100 % .

(35)

Here, N_Match is the number of incoming flows matching the ExTable, while N_Flows is the total number of incoming flows.

4.2. Simulation Results

Figure 6 shows the average compression ratios with eight different numbers of flow entries. In order to achieve dependable result, the simulation is run 1000 times for each number of flow entries generated randomly. Here compression ratio is the number of entries after reduction to that of original entries [8,32]. Therefore, lower compression ratio means more entries are aggregated. The pruning alone reduces the table size by almost 25%. The proposed scheme (pruning + QM) shows the best compression ratio among the four schemes compared.

Figure 6. The compression ratios of four different schemes.

Figure 7 shows the prediction accuracy of the proposed HMM-based scheme with different sizes of flow table. The number of flows is 500 and ΔT (ExTable update period) is set to 20. In order to see the effect of T_H, it is set to 50, 70, and 90. Observe that, as the number of entries of the flow table increases, the prediction accuracy increases. Also, the accuracy slightly decreases as T_H increases because many obsolete flow entries may remain in the flow tables as time goes by.

Figure 7. The prediction accuracy vs size of flow table.

Figure 8 compares the prediction accuracy of the proposed HMM-based scheme with the scheme based on only match frequency. Here the number of flows is 500 and (ΔT,T_H) are set to (20,70). Notice from the figure that the proposed scheme shows consistently higher accuracy, while the accuracy of both the schemes increases as the number of flow entries grows. Note that high match frequency may not be the only indicator of the popularity of an entry. Even though an entry has low match frequency, it could be a popular one if its match probability is high.

Figure 8. The comparison of prediction accuracies vs size of flow table.

In Figure 9, the match rate of popular flow entries is obtained with different ΔT of 20, 40, and 60. According to the result of Figure 6, the space saving due to the compression is about 45%. Since n_t∙k ≤ 0.45∙20n_t, n_t and k are set to 5 and 9, respectively. Observe from the figure that the match rate decreases as ΔT increases, which demonstrates that ExTable needs to be updated as quickly as possible. The proposed scheme achieves nearly 68% match rate when ΔT is 20. As ΔT becomes large, some entries may not be popular any more. There exists a trade-off between the match rate and update cost.

Figure 9. The match rates of ExTable with different ΔT.

Figure 10 evaluates the match rate of ExTable with different number of popular entries per flow table (k). Note that k can be varied from 1 to 9 each allowing different compression ratio. Here n_t is set to 5 and ΔT to 40, while k is varied from 3 to 9. Notice from Figure 10 that the match rate increases as k grows as expected. A proper number of popular entries needs to be selected which allows high match rate while requiring reasonable operation cost.

Figure 10. The match rates with different k.

In Figure 11, the match rate is obtained with different number of flow tables, n_t, of 3, 5, and 7. Here ΔT is 40 and k is 7. It is clear that the match rate of ExTable decreases as n_t grows since the number of total flow entries becomes larger. Therefore, k needs to be set properly considering various design parameters.

Figure 11. The match rates with different n_t.

Figure 12 compares the flow processing time of the three schemes, original MFT, MILE, and the proposed Agg-ExTable. Here (ΔT,R_m,n_p,n_f,k,n_t) of the proposed scheme are set to (20,0.68,35,20,7,5) using the data obtained from the simulation. The rate of using the direct path, Rd_n, is decided randomly between 0 and 0.1. Observe that the proposed Agg-ExTable scheme achieves much smaller flow processing time than the other schemes as the ExTable provides an express forwarding path for a large portion of incoming flows. The difference gets more significant as the flow arrival rate increases. Note that high arrival rate means high network load, and the flow processing time of all the schemes grows as the network load increases. Another important merit of the proposed scheme is that the processing time is almost constant regardless of the load unlike the other schemes. Figure 13 shows the flow processing times with different settings of (40,0.6,35,20,5,7). The proposed scheme consistently outperforms the other schemes.

Figure 12. The comparison of flow processing times with varied arrival rate.

Figure 13. The comparison of flow processing times with varied arrival rate.

5. Conclusions

In this paper a novel flow management scheme has been proposed for MFTs of SDN switches. In the proposed Agg-ExTable scheme, the flow entries in the MFT are periodically aggregated by applying the pruning and Quine–Mccluskey algorithm. Utilizing the memory space saved by the aggregation, ExTable is constructed, keeping popular flow entries and allowing early match with the incoming flows. The proposed scheme is able to save about 45% TCAM space of MFT and efficiently decrease the flow processing time through the express forwarding path provided by the front-end ExTable. Popular flow entries are selected from the flow tables using the HMM, where popularity is decided based on the match frequency and match probability. Computer simulation revealed that the proposed scheme significantly outperforms the existing schemes in terms of flow processing time.

In the future, we plan to investigate the approach further reducing the memory space used for ExTable. Various parameters are involved in the proposed scheme. A formal model will be developed with which proper parameter values can be decided for the given condition. In addition, the match rate of the ExTable will be further improved with more sophisticated techniques such as machine learning and fuzzy theory in the selection of popular entries. The proposed approach will also be tested with a real test bed for various operational conditions and SDN environments.

Author Contributions

Conceptualization, C.W.; Methodology, C.W. and H.Y.Y.; Software, C.W.; Validation, C.W., and H.Y.Y.; Formal Analysis, C.W.; Investigation, H.Y.Y.; Resources, H.Y.Y.; Data Curation, C.W.; Writing–Original Draft Preparation, C.W.; Writing–Review & Editing, H.Y.Y.; Visualization, C.W.; Supervision, H.Y.Y.; Project Administration, H.Y.Y.; Funding Acquisition, H.Y.Y.

Funding

This work was partly supported by Institute for Information & communications Technology Promotion(IITP) grant funded by the Korea government(MSIT) (No.2016-0-00133, Research on Edge computing via collective intelligence of hyper-connection IoT nodes), Korea, under the National Program for Excellence in SW supervised by the IITP(Institute for Information & communications Technology Promotion)(2015-0-00914), Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (2016R1A6A3A11931385, Research of key technologies based on software defined wireless sensor network for real time public safety service, 2017R1A2B2009095, Research on SDN-based WSN Supporting Real-time Stream Data Processing and Multi-connectivity), the second Brain Korea 21 PLUS project.

Conflicts of Interest

The authors declare that there is no conflict of interest.

References

McKeown, N.; Anderson, T.; Balakrishnan, H.; Parulkar, G.; Peterson, L.; Rexford, J.; Shenker, S.; Turner, J. OpenFlow: Enabling innovation in campus networks. ACM SIGCOMM Comput. Commun. Rev. 2008, 38, 69–74. [Google Scholar] [CrossRef]
Kreutz, D.; Ramos, F.M.V.; Verissimo, P.E.; Rothenberg, C.E.; Azodolmolky, S.; Uhlig, S. Software-defined networking: A comprehensive survey. Proc. IEEE 2015, 103, 14–76. [Google Scholar] [CrossRef]
Stevens, M.; Ng, B.; Streader, D.; Welch, I. Global and local knowledge in SDN. In Proceedings of the 2015 International Telecommunication Networks and Applications Conference (ITNAC), Sydney, NSW, Australia, 18–20 November 2015. [Google Scholar]
Feamster, N.; Rexford, J.; Zegura, E. The road to SDN. Queue 2013. [Google Scholar] [CrossRef]
Feamster, N.; Rexford, J.; Zegura, E. The road to SDN: an intellectual history of programmable networks. ACM SIGCOMM Comput. Commun. Rev. 2014, 44, 87–98. [Google Scholar] [CrossRef]
Open Networking Foundation, SDN in the Campus Environment. Available online: https://www.opennetworking.org/images/stories/downloads/sdn-resources/solution-briefs/sb-enterprise-campus.pdf (accessed on 3 April 2019).
Wang, S.; Li, D.; Xia, S. The problems and solutions of network update in SDN: A survey. In Proceedings of the 2015 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Hong Kong, China, 26 April–1 May 2015. [Google Scholar]
Chao, T.-Y.; Wang, K.; Wang, L.; Lee, C.-W. In-switch dynamic flow aggregation in software defined networks. In Proceedings of the 2017 IEEE International Conference on Communications (ICC), Paris, France, 21–25 May 2017. [Google Scholar]
Yu, M.; Wundsam, A.; Raju, M. NOSIX: A lightweight portability layer for the SDN OS. ACM SIGCOMM Comput. Commun. Rev. 2014, 44, 28–35. [Google Scholar] [CrossRef]
Berde, P.; Gerola, M.; Hart, J.; Higuchi, T.; Kobayashi, M.; Koide, T.; Lantz, B.; O’Connor, B.; Radoslavov, P.; Snow, W.; et al. ONOS: Towards an open, distributed SDN OS. In Proceedings of the third workshop on Hot topics in software defined networking, Chicago, IL, USA, 22 August 2014. [Google Scholar]
Bannour, F.; Souihi, S.; Mellouk, A. Distributed SDN control: Survey, taxonomy, and challenges. IEEE Commun. Surv. Tutorials 2018, 20, 333–354. [Google Scholar] [CrossRef]
Jain, S.; Kumar, A.; Mandal, S.; Ong, J.; Poutievski, L.; Singh, A.; Venkata, S.; Wanderer, J.; Zhou, J.; Zhu, M.; et al. B4: Experience with a globally-deployed software defined wan. In Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM, Hong Kong, China, 12–16 August 2013. [Google Scholar]
Kempf, J.; Bellagamba, E.; Kern, A.; Jocha, D.; Takacs, A.; Skoldstrom, P. Scalable fault management for OpenFlow. In Proceedings of the 2012 IEEE International Conference on Communications (ICC), Ottawa, ON, Canada, 10–15 June 2012. [Google Scholar]
Scott-Hayward, S.; O’Callaghan, G.; Sezer, S. SDN security: A survey. In Proceedings of the 2013 IEEE SDN For Future Networks and Services (SDN4FNS), Trento, Italy, 11–13 November 2013. [Google Scholar]
Hu, Z.; Wang, M.; Yan, X.; Yin, Y.; Luo, Z. A comprehensive security architecture for SDN. In Proceedings of the 18th International Conference on Intelligence in Next Generation Networks, Paris, France, 17–19 February 2015. [Google Scholar]
Open Networking Foundation, Openflow Switch Specification Version 1.3.0. Available online: https://www.opennetworking.org/images/stories/downloads/sdn-resources/onf-specifications/openflow/openflow-spec-v1.3.0.pdf (accessed on 3 April 2019).
Open Networking Foundation, The Benefits of Multiple Flow Tables and Ttps. Available online: https://www.opennetworking.org/images/stories/downloads/sdn-resources/technical-reports/TR_Multiple_Flow_Tables_and_TTPs.pdf (accessed on 3 April 2019).
Leng, B.; Huang, L.; Wang, X.; Xu, H.; Zhang, Y. A mechanism for reducing flow tables in software defined network. In Proceedings of the 2015 IEEE International Conference on Communications (ICC), London, UK, 8–12 June 2015. [Google Scholar]
Ozcevik, Y.; Erel, M.; Canberk, B. Spatio-Temporal Multi-Stage OpenFlow Switch Model for Software Defined Cellular Networks. In Proceedings of the Vehicular Technology Conference (VTC Fall), Boston, MA, USA, 6–9 September 2015. [Google Scholar]
Liu, H. Reducing routing table size using ternary-cam. In Proceedings of the HOT 9 Interconnects. Symposium on High Performance Interconnects, Stanford, CA, USA, 22–24 August 2001. [Google Scholar]
Wu, Z.; Jiang, Y.; Yang, S. An Efficiency Pipeline Processing Approach for OpenFlow Switch. In Proceedings of the Local Computer Networks (LCN), Dubai, United Arab Emirates, 7–10 November 2016. [Google Scholar]
Iizawa, T. Fast tracker performance using the new “variable resolution associative memory” for ATLAS. In Proceedings of the Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC), Anaheim, CA, USA, 27 October–3 November 2012. [Google Scholar]
Quine, W.V. The problem of simplifying truth functions. Am. Math. Mon. 1952, 59, 521–531. [Google Scholar] [CrossRef]
Quine, W.V. A way to simplify truth functions. Am. Math. Mon. 1955, 62, 627–631. [Google Scholar] [CrossRef]
McCluskey, E.J. Minimization of Boolean functions. Bell Syst. Tech. J. 1956, 35, 1417–1444. [Google Scholar] [CrossRef]
Baum, L.E.; Petrie, T. Statistical inference for probabilistic functions of finite state Markov chains. Ann. Math. Stat. 1966, 37, 1554–1563. [Google Scholar] [CrossRef]
Baum, L.E.; Eagon, J.A. An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology. Bull. Am. Math. Soc. 1967, 73, 360–363. [Google Scholar] [CrossRef]
Baum, L.E.; Petrie, T.; Soules, G.; Weiss, N. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann. Math. Stat. 1970, 41, 164–171. [Google Scholar] [CrossRef]
Bevilacqua, V.; Daleno, D.; Cariello, L.; Mastronardi, G. Pseudo 2D Hidden Markov Models for face recognition using neural network coefficients. In Proceedings of the 2007 IEEE Workshop on Automatic Identification Advanced Technologies, Alghero, Italy, 7–8 June 2007. [Google Scholar]
Jurafsky, D.; Martin, J.H. Speech and Language Processing. Available online: http://www.cs.colorado.edu/~martin/SLP/Updates/1.pdf (accessed on 20 May 2019).
Liu, A.X.; Meiners, C.R.; Torng, E. TCAM Razor: A systematic approach towards minimizing packet classifiers in TCAMs. IEEE/ACM Trans. Networking (TON) 2010, 18, 490–500. [Google Scholar] [CrossRef]
Luo, S.; Yu, H. Fast incremental flow table aggregation in SDN. In Proceedings of the Computer Communication and Networks (ICCCN), Shanghai, China, 4–7 August 2014. [Google Scholar]
Brayton, R.K.; Hachtel, G.D.; McMullen, C.; Sangiovanni-Vincentelli, A. Logic Minimization Algorithms for VLSI Synthesis; Springer Science & Business Media: Dordrecht, The Netherlands, 1984. [Google Scholar]
Karnaugh, M. The Map Method for Synthesis of Combinational Logic Circuits. Trans. Am. Inst. Electr. Eng. Part I Commun. Elect. 1953. [Google Scholar] [CrossRef]
Rudell, R.L. Multiple-Valued Logic Minimization for PLA Synthesis. Available online: https://www2.eecs.berkeley.edu/Pubs/TechRpts/1986/ERL-86-65.pdf (accessed on 20 May 2019).
Holdsworth, B.; Woods, C. Digital Logic Design; Elsevier: Amsterdam, The Netherlands, 2002. [Google Scholar]
Adan, I.; Resing, J. Queueing Theory; Eindhoven University of Technology: Eindhoven, The Netherlands, 2002. [Google Scholar]