Instantaneous Networking Service Availability for Disaster Recovery

: Networking services may be broken down in a disaster situation while people in the disaster area(s) strongly demand networking services for both communication and information sharing among users. This requires the fast restoration of networking services to fulﬁl the demand–supply gap. Although there are a number of studies on restoring communication and networking in disasters, few studies have explicitly examined the service availability during the temporary and partial recovery process of network restoration. From the perspective of users in the disaster area, it is important to be able to communicate or share information with people whenever they want/need to do so. Therefore, partial and local recovery of the networking services also plays an important role for improving service availability in the disaster situations. To assess the restoration effectiveness of networking services with a measure of user satisfaction level, we propose to use instant networking service availability (I-NSA), a novel metric, and we examine the effectiveness of networking service restoration solutions using the metric. I-NSA allows us to clearly express the instant availability of networking services that drastically changes with the elapsed time from the disaster occurrence in disaster areas. This paper examines the effective improvement of I-NSA when Local-and-Instant Clouds (LI-Clouds) are applied to the disaster situation. LI-Cloud has been designed and practically developed to provide deployable networking services to users. We verify that LI-Cloud enables signiﬁcant improvement on the I-NSA performance in the fast restoration of networking services.


Information Sharing Systems for Disaster Resilience
In addition to communication systems, the information sharing system is also important for supporting people in emergencies, with functionalities such as warning, estimation of damage, safety confirmation, rescue guidance, etc.
Many efforts have been devoted to the development of information sharing systems for disaster situations and emergencies. Usuda et al. [20] introduced an information sharing platform, named the SIP4D system, that facilitates mutual information exchange among various information systems. The SIP4D system has an objective of realizing mutual information exchange among various systems. It employs an onsite response team to collect information and identifies the needs at the site of the disaster. The collected information is transmitted to the SIP4D information system, which manages the information and can release the information to public.
Furthermore, Uchida et al. [21] introduced a real-time information sharing system based on Twitter. The developed system provides location information and nearby information to users. Aydin et al. [22] proposed an information supporting system that allows users to actively play a role in disaster management.

Study of Statistical Communication Features, System Availability, and Reliability
Lambiotte et al. [11] investigated the statistical properties of communication networks that support phone calls and text messages. Based on the introduction of geographical home location, the study shows that the probability that two customers communicate with each other over the network decreases with the distance between the customers, following a gravity model [23].
It is observed that residents in the disaster situation tend to gather at the designated places such as shelters or evacuation centers [12]. Zhao et al. [13] examined the relationship between evacuation population size, emergency shelter capacity, and evacuation time. The proposed model was considered to facilitate decision-making on the position of emergency shelters and the allocation of evacuees to the shelters.
The availability of a communication system can be generally measured by mean time between failures (MTBF) and mean time to repair (MTTR). The availability can be computed as A = MTBF/(MTBF + MTTR). Meza et al. [9] studied the reliability of the data center. For a large system such as a data center, an entity or component instead of the whole system may have failures. The MTBF-and MTTR-based availability measurement can apply to measure the availability of a component or entirety of a system. Dominiak et al. [8] introduced the reliability of communication pairs in IEEE 802.16-based mesh networks. It is shown that selecting a reliable path by avoiding poor links is important for the network to achieve higher data rate and low bit error rate (BER).
Hazra et al. [24] introduced a resource allocation scheme in order to ensure the good communication quality in terms of latency and packet delivery ratio. The network resources include data mules and communication towers. The basic idea involves that each shelter is served by at least one network resource and involves trying to minimize the end-to-end network latency between the volunteer and control station. The proposed scheme increases the packet delivery ratio and reduces the communication latency compared with the resource-unplanned system. Nakamura [25] showed the results that the fixed telephone system did not have a good performance in terms of service connectivity under a large earthquake situation. However, social networking service (SNS) communication via mobile phone had high connectivity performance, since the control for packet communication was independent with that of voice communication.
In summary, a number of studies have addressed the communication systems and information sharing systems that support the capability of disaster resilience. However, few of the previous studies explicitly addressed the effectiveness of prompt and partial recovery of networking services and the speed of network restoration in the disaster situation according the communication features and population distribution. Moreover, it is difficult to utilize the conventional metrics on system availability and reliability to assess the effectiveness of temporary and partial recovery of networking services. Considering communication and population distribution in a disaster situation, local and partial recovery of networking services should also facilitate communication among users. This paper explicitly addresses the effectiveness of user service restoration procedure before the network is fully recovered, and examines the instant availability of networking service in a disaster situation. We evaluate the instant availability of networking service based on a local cloud system that enables the joint provision of communication support and information support for users in the disaster situation.

Conventional Restoration Process
A network or cloud service such as Facebook or Twitter basically relies on the Internet. Theses global services confront with breakdown in emergencies. The restoration process of global services is a procedure that allows servers and routers to work again or to be replaced. Hence, the restoration process may take a few days or even a few months when there is large-scale damage in a disaster.
In the conventional restoration method, both connections to the local infrastructure nodes (such as BSs) and global network are required to exist or to be recovered for communication between users.

System Overview
An LI-Cloud system that is introduced in this paper provides networking services such as messengers and voice communications. Hence, even when there are no global networking services, the launching of an LI-Cloud allows users in the cloud to communicate with each other.
Multiple LI-Clouds will be deployed at different places in a disaster region. The correlation of launching times may exist among multiple LI-Clouds according to their deployment sequence. When multiple clouds are deployed independently and in parallel, their launching times are independent of each other. When the LI-Clouds are deployed in sequence, the launching times are correlated in a time sequence. The global connection of each LI-Cloud is also carried out during the restoration. Figure 1 shows an example of a resilient network based on LI-Cloud service in disaster areas. In the example, there are three areas-A, B, and C. In conventional restoration, users can communicate with other users only when both local network and global network connections are recovered. By utilizing LI-Cloud, as shown in Figure 1, a user is able to communicate with other users in the same local area before the connection to the global network is recovered. When a disaster area is recovered, the users in the area are able to communicate with the users in other areas that are connected to the global networks.

LACS: Locally Accessible Cloud System
LACS (Locally Accessible Cloud System) is a portable LI-Cloud system that we have developed with an aim of providing conventional Internet services even when the Internet connectivity is not available. LACS attempts to deliver cloud service anytime anywhere and in various network environments, even in the devastated disaster situation [10].
As shown in Figure 2, an LACS server is comprised of a compact server, Wi-Fi access point, battery, and peripheral devices. As shown in the figure, functions offered by an LACS include important notices, a bulletin board, and messaging. These functions support information broadcasting, information sharing, and bidirectional, voice, and video communication among users in the area surrounding the LACS. A user is able to access the service using his/her network device, such as a smartphone, over Wi-Fi functionality. An LACS enables us to handle large size contents in the form of text, voice, moving picture, or still image. LACS collaborates with other systems on the Internet, such as a system for a disaster countermeasure. An LI-Cloud system involves the software and hardware development that support the networking services as a standalone server for the local user scope [26,27]. LACS offers a tool for communication in the disaster-affected area that provides various services such as: • Operation without power feed • Delivery information by local government • Collection and delivery of disaster information • Offer person-to-person bidirectional communication tool

Important Notice
This page is mainly used for delivery of important information from authorized organizations like local government and medical centers.

Bulletin board
This page is mainly used for sharing information among users of LACS

Messaging
Message function enables us to exchange message, still image and video between users like our friends, family members.

Registration
User registration is necessary for users to post articles and exchange messages on LACS.
Functions offered by LACS In addition to providing prompt services in emergencies, the LACS is also available for use as a local information system in a normal situation, such as information sharing in schools and shopping malls. The major energy consumption at the server of a LI-Cloud system involves the energy consumption at the Wi-Fi Access Point regrading communication activities, and the energy consumption at the CPU and data processing. The LI-Cloud adopts a battery power support and a backup battery. The coverage of the LI-Cloud is associated with a Wi-Fi-based communication system. There are several ways to extend the coverage of LI-Cloud such as employing multiple LI-Clouds and enabling interconnection of multiple LI-Clouds [14]. Furthermore, in emergencies, deploying the LI-Clouds to shelters facilitates the effective coverage of users when they are gathered at the shelters.

Networking Service Availability for Users
Let X and Y denote communication source and destination, and Ax and Ay denote the local areas where X and Y reside. For example, in Figure 1 the set of areas where X may reside is expressed as AxSet = {A, B, C}. The set of areas in which Y possibly resides is denoted by AySet = {A, B, C, G}, where G refers to globally connected areas. Hence, the combinational cases of X and Y location are represented by the Cartesian product set of AxSet and AySet, being denoted by AxySet = AxSet × AxySet = {AA, AB, AC, AG, BA, BB, BC, BG, CA, CB, CC, CG}. Due to the same communication requirement, we combine the cases of AB and BA, BC and CB, and AC and CA to simplify the number of cases to analyze. Hence, we simply express the cases of X and Y location combination, by Axy = {AA, AB, AC, AG, BB, BC, BG, CC, CG}. In this expression and what follows, AB refers to the fact that either X is in A and Y in B, or X is in B and Y in A. The same expression rule is applied to cases of AC and BC.
To explicitly measure the speed of network restoration procedure, we adopt the metric of the instant networking service availability (I-NSA) between X and Y at a time t as the probability that X is able to communicate with Y at the time t. We note that the I-NSA of a user X in a disaster region correlates with the integrated availability of local servers and global connectivity of the local areas that X and Y reside.
In a conventional way, the conditions that X can communicate with Y are: (1) the area where X resides is locally recovered with both local and the Internet connections, and (2) the area where Y exists is provided with both local and global network connections.
We can generally write the I-NSA of X at time t, for x in Ax, and y in Ay, as where P local (Ax, t) refers to the probability that the local server at Ax is launched, P local (Ay, t) refers to the probability that the local server at Ax is launched, P global (Ax, t) refers to the probability that the global connection at Ax is launched, and P global (Ay, t) refers to the probability that the global connection at Ay is launched.

Networking Service Availability Improvement by Utilizing LI-Clouds
We at first suppose that an LI-Cloud is immediately available for local users when the conventional networks breaks down. In later sections, we will discuss in detail about the network restoration process by taking into account the launching time of LI-Clouds. By utilizing the LI-Clouds that are immediately available for local users in an emergency, the conditions that X can communicate with Y are (1) X and Y reside in the same recovered LI-Cloud area even when there is no global network connection, or (2) the area where X resides is locally recovered with both local and global network connection, and the area where Y exists is provided with both local and Internet connections. Suppose that an LI-Cloud is immediately available for local users in an emergency. By utilizing an LI-Cloud, we have when Ax = Ay. Consequently, the improvement ratio on I-NSA by utilizing an immediately available LI-Cloud can be expressed as when X and Y are in the same area-that is, for the two users in the same area, the improvement of I-NSA is in reverse proportion to the product of the probabilities of the global connectivity in the areas where X and Y reside. We consider that the improvement of I-NSA by utilizing the LI-Cloud is mainly due to that LI-Clouds provide the alternative networking service of global networks if the LI-Cloud can be quickly deployed before the global network restoration.

Restoration of Networking Service Availability for Users in Multiple Areas
In what follows, we analyze the the temporal and spatial features of restoration process in detail regarding the networking service availability for users. We take the time consumption in launching each LI-Cloud and the global network into account. The list of symbols is shown in Table 1. The basic scenarios studied in the evaluation involve three areas, as shown in Figure 1. There can be various scenarios in a disaster situation. It may also be difficult to adopt a few scenarios to represent all others. Hence, we try to employ the scenarios that reflect the basic functionalities and fundamental operations in network restoration, involving (1) local connections and their restoration and (2) global connections and their restoration.

Conventional Restoration Process without LI-Cloud
Let PC A∼A (t) denote the probability of successful communication connection between X and Y, which are in A. Since any user needs to connect with another user through the global network, we then have In the same way, we can express and PC C∼C (t) = P C (t) · P CG (t).
Let PC A∼B (t) denote the probability of successful communication between X and Y in case either X is in A and Y in B, or X is in B and Y in A. We have To summarize the previous statement and equation, a user in A is able to communicate with B only when local servers are launched in both areas A and B, and both areas A and B are connected with the global network.
In case either X is in A and Y is in C, or X is in C and Y is in A, we have the probability of successful communication between X and Y as PC A∼C (t) = P A (t) · P AG (t) · P C (t)P CG (t).
In case X is in A and Y is a global destination, we have the probability of successful communication between X and Y as PC A∼G (t) = P A (t) · P AG (t).
In case either X is in B and Y is in C, or X is in C and Y is in C, we have the probability of successful communication between X and Y as In case X is in B and Y is a global destination, we have the probability of successful communication between X and Y as PC B∼G (t) = P B (t) · P BG (t).
In case X is in C and Y is in C or is a global destination, we have the probability of successful communication between X and Y as PC C∼G (t) = P C (t) · P CG (t).
Let CaseAA denote that one user in A attempts to communicate with one user in A, and the communication succeeds. Let PC CaseAA denote the probability for (CaseAA). We have In the same way, we can express that PC CaseCG = F CG · P C∼G (t).
Consequently, the availability probability of X ∼ Y is then expressed as

Network Restoration Based on LI-Cloud
By employing LI-Cloud, network restoration is enabled by two phases. One is the local restoration process. Another is the global restoration process. The local restoration process refers to the launching of LI-Cloud service for the local users. The procedure allows LI-Clouds to be deployed and configured in an isolated area after a disaster. The deployment of the LI-Cloud server can be carried out with cars, by a human, or just by switching on a predeployed server. The global restoration process refers to the procedure for connection to global services, such as social media tools. The analytical scenario adopted in this section assumes that there is no interconnection among LI-Clouds in different areas.
The differences of the proposed LI-Cloud method from the conventional networking service provision are as follows.
In case X is in A and Y is in A, we can write the probability of communication connection between X and Y as PL A∼A (t) = P A (t).
In case X is in B and Y is in B, we can write the probability of communication connection between X and Y as PL B∼B (t) = P B (t).
In case X is in C and Y is in C, the probability of communication connection between X and Y is Let the probability of CaseBG be PL CaseBG . We have Let the probability of CaseCC be PL CaseCC . We have

PC CaseCC
The emergence probability of CaseCC

PC CaseAB
The emergence probability of CaseAB

PC CaseAC
The emergence probability of CaseAC

PC CaseBC
The emergence probability of CaseBC

PC CaseAG
The emergence probability of CaseAG

PC CaseBG
The emergence probability of CaseBG

PC CaseCG
The emergence probability of CaseCG

F BB Probability that X is in B and Y is in B
F CC Probability that X is in C and Y is in C F AB Probability that either X is in A and Y is in B, or X is in B and Y is in A F AC Probability that either X is in A and Y is in C, or X is in C and Y is in A F AG Probability that X is in A and Y is in G F BC Probability that either X is in B and Y is in C, or X is in C and Y is in B F BG Probability that X is in B and Y is in G F CG Probability that X is in C and Y is in G Let CaseCG denote that one user in C communicates with one user over the global Internet. Let Probability of (CaseCG) be PL CaseCG . We have PL CaseCG = F CG · P C∼G (t). (21) The availability probability of X ∼ Y is then expressed as Consequently, the average improvement on the I-NSA by utilizing multiple local clouds can be expressed as To summarize the previous statement and equation, the average improvement of I-NSA for any two users, in all cases of their location, depends on the user location, and the probabilities of the local and global connectivity at the time.

Numerical Examination
Numerical examination is conducted to assess the performance of the proposed scheme in comparison with the conventional scheme. The aim is to identify the effectiveness of I-NSA in measuring networking service restoration that includes partial and/or temporal restoration phases on the way to the complete restoration. We then verify the I-NSA improvement by utilizing the restoration method based on LI-Cloud.
We attempted to allow the evaluation to involve the following three aspects. First, potential users are widely distributed in the disaster region, which comprises multiple local areas. A user in a local area may communicate with other users who are inside the local area, in the different local areas, or outside of the disaster region. Second, probability of communication occurrence among users depends on the distance between the users [23]-that is, a user communicates more with people in the nearby places rather than the people faraway. Third, partial and temporal restoration of a networking service can support the communication among users nearby.
Based on the aspects above, we set up the following evaluation scenarios. The disaster region consists of three local areas denoted by A, B, and C. A user in the disaster region tries to communicate with other users either in the same local area or outside the areas. We assume the probability of a user communication occurrence as shown in Table 2, in which the probabilities were derived based on the gravity model that a user has a higher probability to communicate with other users nearby than with users who are far away [23]. More specifically, we suppose that the probability of communication between two users is in reverse proportion to the distance between them. Let the probability of the emergence of communication between two users in A be 0.2. Suppose that the average distance between two users in A and B is about 2.8 times of the distance between two users in A. Then, we set the probability of communication between two users in A and B as 0.025. Although there can be various parameter configurations in evaluation, we employ the setup in Table 2 to represent the basic features that are involved in the evaluation aspects.
In what follows, we adopt notations of Ta, Tb, and Tc to denote the average launching time of LI-Clouds A, B, and C, respectively. T AG , T BG , and T CG denote the average launching time of global network connections at LI-Clouds A, B, and C. We consider two types of scenarios regarding the deployment of local and global networking services, as shown in Figure 3. The first setup mimics the independent deployment of networking services, allowing the average launching times of LI-Clouds A, B, and C to be independent and similar to each other. We call this deployment scenario the parallel deployment of networking services. The second scenario setup adopts sequential deployment of LI-Clouds and allows their launching time to be one after another with a certain time interval between each other. We call this deployment scenario the sequential deployment of networking services. The choice of the restoration methods depends on the disaster situation and the available resources, which include a number of available operators and vehicles to deliver and operate LI-Clouds. When there are enough resources, the parallel distribution of LI-Clouds for launching local networking services is possible. In the case of insufficient resources, however, the distribution of LI-Clouds has to be conducted in a sequential manner that takes much more time than the parallel distribution. In the evaluation, two types of launching time distributions are employed, including Poisson and Uniform distributions.    Figure 6 shows the evaluation results of utilizing LI-Clouds with uniform distribution launching time and sequential deployment. The I-NSA of using LI-Cloud achieves a 10.6% increase on average in networking service availability, and a maximum of 32.4% increase in I-NSA. Figure 7 shows the evaluation results of utilizing LI-Clouds with Poisson distribution of launching time and sequential deployment. Using LI-Cloud achieves an approximate 19% increase on average in networking service availability, and a maximum of 59.9% in I-NSA.
Better performance achieved by utilizing LI-Clouds is attributed to the following two facts. (1) The LI-Cloud enables the local networking services even though there is no global networking service.
(2) The fast launching of the LI-Cloud is enabled before the restoration of global networking services.
Networking service availability LI-Cloud scheme Conventional scheme

State Machine Diagram in Networking Service Recovery
The state transition diagrams of each LI-Cloud server and each user in a disaster area are illustrated in Figures 8 and 9, respectively. The set of states at an LI-Cloud server includes the state of no networking service, the state of local networking service launching, the state of local service area extension, and the state of global network service launching. The set of states at a user includes the state of no networking service, the state of local networking service, the state of connection to extended service area, and the state of global networking service available. For an LI-Cloud server, the state of no networking service refers to the state in which the server has not started its networking services yet. The state of local networking service launching is the state in which the LI-Cloud start to provide local networking services to users. The state of local service area extension refers to the state in which the LI-Cloud is connected to other LI-Clouds and enables the users in the LI-Cloud to communicate with users at other local clouds. The state of global network service launching refers to the state in which the LI-Cloud is connected to the Internet. For a user, the state of no networking service refers to the state in which the user has no networking services available. The state of local networking service is the state in which the user obtains the local networking services from its LI-Cloud server. The state of connection to an extended service area refers to the state in which the users are able to communicate with users at other local clouds. The state of global network service available refers to the state in which the user is able to connect to the Internet.
The conditions of state transition of an LI-Cloud server are described in Table 3, in which the number of transitions corresponds to the Figure 8. Moreover, the state transition conditions of a user are illustrated in Table 4, in which the number of transitions corresponds to the Figure 9.

Limited Communication Range of LI-Cloud
The deployable LI-Cloud, such as LACS, may adopt wireless communication based on WLANs that are widely used. The limited communication range puts a constraint to the networking service coverage. On the other hand, considering the fact that the users often gather to the emergency center in a disaster situation, the WLAN-based communication is expected to not only be easily used by users but also to promptly provide networking services with a certain coverage efficiency.

Extended System
The detailed analyses of I-NSA have been carried out for the LI-Cloud-based networks. When the disaster areas involve a much higher number of LI-Clouds, the clustering of LI-Clouds is a possible way to extend this work for coordination among many Li-Clouds to provide networking servicesthat is, two or three LI-Clouds form a cluster when these LI-Clouds enable the local service coverage and have been connected with each other. Multiple (two or three) clusters are able to further construct a larger coverage cluster. In this way, the evaluation model discussed is expected to be applied for a large number of LI-Clouds.

Conclusions
In this paper, we examined the network restoration by utilizing instant networking service availability (I-NSA). Compared with conventional methods for measuring the networking availability, I-NSA is an integrated factor that involves both network connectivity and user distribution among various areas. I-NSA has its unique features that facilitate the design of novel networking services such as Local-and-Instant Clouds for disaster resilience. First, the I-NSA measures the networking service availability from the temporary and partial recovery of the network. Second, I-NSA allows us to assess the functionality and effectiveness of the network restoration process before the network is fully recovered. We especially examined the I-NSA of using LI-Clouds in a disaster situation. The evaluation results showed the significant improvement of networking service availability with utilization of LI-Clouds in various deployment scenarios of LI-Clouds.
Funding: This research is in part supported by an SIP Program on national resilience (disaster prevention and mitigation).