The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications

Younus, Othman Isam; Hassan, Navid Bani; Ghassemlooy, Zabih; Zvanovec, Stanislav; Alves, Luis Nero; Le-Minh, Hoa

doi:10.3390/s21082826

Open AccessEditor’s ChoiceArticle

The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications^†

by

Othman Isam Younus

^1,*

,

Navid Bani Hassan

²,

Zabih Ghassemlooy

¹

,

Stanislav Zvanovec

³

,

Luis Nero Alves

⁴

and

Hoa Le-Minh

¹

Optical Communications Research Group, Faculty of Engineering and Environment, Northumbria University, Newcastle upon Tyne NE1 8ST, UK

²

Institute of Photonics, University of Strathclyde, Glasgow G1 1XQ, UK

³

Department of Electromagnetic Field, Faculty of Electrical Engineering, Czech Technical University in Prague, 16627 Prague, Czech Republic

⁴

Instituto de Telecomunicações and Departamento de Electrónica, Telecomunicações e Informática, Universidade de Aveiro, 3810-193 Aveiro, Portugal

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in Younus, O.I.; Hassan, N.B.; Ghassemlooy, Z.; Zvanovec, S.; Alves, L.N.; Haigh, P.A.; Minh, H.L. An Artificial Neural Network Equalizer for Constant Power 4-PAM in Optical Camera Communications. In Proceedings of the 2020 12th International Symposium on Communication Systems, Networks and Digital Signal Processing (CSNDSP), Porto, Portugal, 20–22 July 2020.

Sensors 2021, 21(8), 2826; https://doi.org/10.3390/s21082826

Submission received: 10 March 2021 / Revised: 29 March 2021 / Accepted: 14 April 2021 / Published: 16 April 2021

(This article belongs to the Special Issue Visible Light Communication, Networking, and Sensing)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we propose and validate an artificial neural network-based equalizer for the constant power 4-level pulse amplitude modulation in an optical camera communications system. We introduce new terminology to measure the quality of the communications link in terms of the number of row pixels per symbol

N_{pps}

, which allows a fair comparison considering the progress made in the development of the current image sensors in terms of the frame rates and the resolutions of each frame. Using the proposed equalizer, we experimentally demonstrate a non-flickering system using a single light-emitting diode (LED) with

N_{pps}

of 20 and 30 pixels/symbol for the unequalized and equalized systems, respectively. Potential transmission rates of up to 18.6 and 24.4 kbps are achieved with and without the equalization, respectively. The quality of the received signal is assessed using the eye-diagram opening and its linearity and the bit error rate performance. An acceptable bit error rate (below the forward error correction limit) and an improvement of ~66% in the eye linearity are achieved using a single LED and a typical commercial camera with equalization.

Keywords:

CP 4-PAM; optical camera communications; ANN equalizer

1. Introduction

Optical camera communication (OCC) systems, which are part of the optical wireless communications (OWC), leverage the use of off-the-shelf conventional, complementary metal-oxide-semiconductor (CMOS) image sensors (ISs) and standard light-emitting diodes (LEDs) as the receiver (Rx) and the transmitter (Tx), respectively.

The camera-based Rxs can capture intensity-modulated light signals from a range of LED light sources (i.e., traffic lights, advertising boards, signage, display screens, vehicle head, and taillights, streetlights, etc.). The OCC technology together with the visible and infrared light transmission could be used in different low data rate R_b applications, such as the Internet of Things (IoT) (e.g., as part of the fifth-generation wireless and beyond), motion capturing [1], intelligent transportation systems [2], indoor localization, security, virtual reality, and advertising [3]. OOC comprises a plurality of pixels (i.e., photodetectors (PDs)), where the signal strength of each pixel depends on the intensity of incident light [4]. Each pixel can detect signals at different wavelengths over the visible range, e.g., red, green, and blue (RGB), hence offering parallel detection capabilities and an adaptive field of view (FoV) feature. In addition, the transmitted information from many light sources, different directions, and locations via the line-of-sight (LOS) [5,6], non-LOS, and/or a combination of both paths [7] can be captured using a single-pixel or a pixel-array IS-based Rx. Thus resulting in a higher signal-to-noise ratio, improved mobility, and flexibility over a linkspan up to hundreds of meters [8].

On the contrary, the IS requires a higher sampling duration and lower number of quantization levels compared with the PDs due to the light integration time (known as the exposure time

T_{\exp}

), and the built-in analog to digital converter circuit [9]. The sequential-readout nature of CMOS IS-based Rx allows each pixel-row to capture the incident light at a different time, thus resulting in the so-called rolling shutter (RS) effect [9]. Note that, the performance of VLC with IS-based Rx is limited mainly by the camera capabilities, i.e., the frame rate R_f,

T_{\exp}

, and FoV. As a result, in OCC, the transmission bandwidth is rather low and limited to a few tens of kHz compared to the PD-based VLC systems. Although, low data should not be seen as a problem considering that there are many applications where low R_b is not critical at all (i.e., IoT, etc.). However, in OCC, lower R_b may result in the flickering effect at the Tx [10,11]. In IEEE 802.15.7m standard [12], different schemes have been proposed for OCC to mitigate flickering and to increase R_b [13]. For example, in [14], an optical orthogonal frequency division multiplexing VLC with a special IS-based Rx with a built-in PD-array was used to achieve a very high R_b of 55 Mbps. However, the fabrication process of the IS was too complex and, therefore, not commercially available. In [15,16], under-sampled frequency and phase shift on-off keying (OOK) modulation schemes were proposed to mitigate flickering in OCC with low R_b. In [17], Manchester coding was proposed to alleviate flickering in the RS mode, where it was shown that link performance in terms of R_b deteriorated with the transmission range [8,17].

Moreover, an OCC link with the under-sampled pulse amplitude modulation (PAM) with subcarriers was experimentally demonstrated with the increase R_b to 250 bps [18,19]. In addition, a multilevel-intensity modulation scheme for RS-based OCC with the frame rate R_f of 30 fps was proposed in [20] with R_b of 10 kbps over a link range of up to 2 m. Furthermore, a parallel transmission VLC system with color-shift-keying (i.e., different colors RGB-LEDs) was reported in [21] with an overall R_b of 5.2 kbps. In [22], the concept of parallel transmission was demonstrated over a range of up to 60 m and with R_b of 150 bps. Whereas a 16

\times

16 array

μ

LED and a high-speed camera with R_f of 960 fps and using R_b of 122.88 kb/s was reported in [23].

In OCC systems, equalization methods can also be deployed to compensate for spatial and temporal induced dispersion. In [24], an OOK VLC (a single LED) and camera-based Rx with a dual equalization scheme to compensate for both spatial and temporal dispersion were reported with increased R_b up to 14.37 kb/s. The artificial neural network (ANN) architecture has also been proposed for post-equalization to combat non-linear impairments in OWC [25,26]. The use of an ANN-based equalizer is one of the remarkable solutions adapted in PD-based OWCs, wherein the ANN act as the universal classifiers [27]. In [25], a 170 Mb/s OOK VLC link using an LED with a modulation bandwidth of 4.5 MHz and the ANN-based equalizer at the Tx was reported, where the superiority of ANN equalizers in mitigating intersymbol interference (ISI) was demonstrated compared with other equalization techniques. Note, in OCC with the ANN-based equalizer, the network needs to be trained once for a range of T_exp with the data being stored in a look-up table within the camera.

In [28], the variable transparent amplitude shape code (VTASC) scheme was experimentally evaluated for device-to-device (D2D) (i.e., smartphones) communications in the form of high-density modulation (HDM) with the ANN assisted demodulator. R_b of 2.66 Mbps over a 20 cm long transmission link was achieved. Note, the concept of D2D is one form of the multiple-input multiple-output system, where every pixel is transmitted and detected. Similarly, to allow transmission and reception of information under bad weather conditions, a convolution neural network (CNN)-based OCC was proposed in [29]. The CNN was used for classification and recognition of LED patterns and to decode the transmitted data streams even under an unclear state, where LED patterns are not visible to the camera due to blocking of the transmission path and/or weather conditions. In [30], an OCC link with an ANN-based decoder was reported to mitigate the gap-time effect between two adjacent frames, where OOK was transmitted using an RGB-LED with R_b of 47 kb/s. In [4], an OCC link using a single LED source and Manchester line code with the non-return to zero formats was reported with R_b of 14 kb/s.

In this work, the aim is to establish a flickering-free OCC system with improved R_b using a single LED and an ANNs-based equalizer. The key contributions extended from our previous work [31] are:

Comprehensive and systematical investigation of the applicability of CP-PAM for the LED- and camera-based VLC.
Development of a practical CP-PAM OCC prototype with a single Luxeon Rebel white LED (SR-01-WC310) and an IS (Thorlabs DCC1645C) as the Tx and the Rx, respectively.
Development of an efficient signal extraction algorithm for the RS-based OCC system.
Implementation of an ANN-based equalizer at the Rx to enhance the system performance.
Development of an experimental test-bed for the proposed system and evaluating it in terms of the Tx’s frequency, eye diagrams, and the bit error rate (BER) with and without the ANNs-based equalizer.
Proposing a new measurement metric for assessing the quality of the communications link in terms of the number of row pixels/symbol.

The remainder of the paper is organized as follows. Section 2 introduces the proposed CP PAM scheme, whereas Section 3 outlines the ANN equalizer model for IS-based OCC. The experimental setup is described in Section 4. Results and discussion are presented in Section 5. Finally, conclusions are given in Section 6.

2. Constant Power-PAM in RS-Based OCC System

The OCC system is mainly composed of a light source-based Tx with normalized length (diameter) represented in

L

, and a camera-based Rx, which is modeled using a single convex lens with a focal length f. The transmission speed in the RS-based OCC system is defined by the amount of the information that can be captured by an image at the distance

d

, which depends on the acquired number of samples (i.e., pixel rows) and is given by [32]:

N_{row} = 2 f \times t a n (\frac{FoV}{2}) = 2 f \times \frac{L}{2 d},

(1)

where

FoV

is the angular field of view.

Note that the acquired

N_{row}

is incorporated with the sampling frequency of the IS, known as the rolling rate of IS,

F_{s}

(i.e., the frequency at which the row pixels are sampled at the image plane).

Therefore, the maximum frequency of the transmitted signal is limited

\frac{F_{s}}{2}

according to Nyquist’s theorem. The

F_{s}

value depends on the pixel clock and T_exp (i.e., the time that every sample (pixel) of the IS is exposed to the light). Note, T_exp acts as a moving-average filter [10,33] with the frequency resolution given by:

Δ f = \frac{1}{T_{\exp}} = \frac{F_{s}}{N_{row} (d)},

(2)

F_{s}

is defined in terms of the bandwidth of the transmitted signal

f_{Tx}

and the number of received pixels per symbol,

N_{pps}

, which is given by:

F_{s} = N_{pps} . f_{Tx},

(3)

Note, (i)

N_{pps}

varies with the payload P_bit; and (ii) the maximum transmission distance is proportional to both

Δ f

and the size (diameter) of the light source. Higher T_exp results in increased signal intensity levels, and, therefore, higher signal-to-noise-ratio (SNR) at the cost of reduced Rx bandwidth. With reference to Equations (1)–(3), a communications link can be established at low R_f but with flickering, which is due to the variation in the mean value of light intensity during a time period larger than the optical bandwidth of the human eye. This may occur provided there are many consecutive symbols with the same logical state.

The flicker index is a relative measure of the cyclic variation in the output of various sources at given frequencies [34,35]. It considers the waveform of the light output and its amplitude, which can be determined by dividing the area above the line of average light output by the total area under the light output curve for a single curve, see Figure 1, and is given by:

Flicker index = \frac{area 1}{area 1 + area 2},

(4)

The flicker index has a range of 0 to 1.0, with 0 representing the steady light output level. Area 2 may be close to zero provided the light output varies as periodic spikes, thus leading to a flickering index close to 1. Higher values indicate an increased possibility of noticeable flickering.

To mitigate flickering, CP-PAM can be adopted to equalize the mean intensity value of all symbols, i.e.,

I_{ave}

[18]. In CP-PAM, each PAM symbol is temporally divided into two equal chips, (i) the 1st chip for the intensity of the PAM symbol

I_{S}

; and (ii) the 2nd chip for the stabilization level, i.e.,

2 I_{ave} - I_{S}

, see Figure 2. For example, a symbol with a level of “2” will be stabilized in the following chip with another symbol with a level of “1” to ensure performance equality, as clarified in Table 1.

It is also noted that considering the R_b efficiency of CP 4-PAM is reduced by half due to the stabilization level (also used for error detection), the CP N-PAM offers a higher coding efficiency compared with Manchester coding [17].

3. ANN Equalizer

In RS-based OCC systems, the IS sampling process limits the available bandwidth and results in ISI at higher data rates, thus impacting the performance of the communications link. The ability to detect the slow rise-time symbol may be impacted by the existence of the transition between different illumination levels. Equalization is one option that is being adopted to mitigate the ISI. Note, the ISI is predicted by the training filter coefficients based on a training sequence. Alternatively, the ISI can be viewed as a classification problem, where class decision boundaries are created to classify symbols based on training [4]. Hence, determining the optimal threshold boundaries in a practical channel can be seen as a nonlinear process, and consequently, the ANN-based equalizer with the adaptive algorithm can be employed to mitigate ISI and, therefore, increase the data rate. Unlike other communication systems, OCC training of the ANN network is carried out only once for a specific exposure time with the data being stored with a look-up table [25,26].

An ANN is an interconnected network of processing elements (neurons). It comprises of two distinct stages: (i) The training phase, where the ANN estimates an input-output map between the received and training data to determine the weighted input from each neuron. The weighted values are updated in each training iteration until either the required performance is achieved, or the entire training set is used; and (ii) the operation phase, where the ANN is deployed without the knowledge of the dataset under test. The multilayer perceptron (MLP) is a popular ANN architecture, which has been demonstrated with high effectiveness in signal equalization [36]. It offers the ability to map any non-linear input-output sequence, provided there are sufficient neurons in the hidden layer(s), and the SNR is sufficiently high.

The MLP structure consists of at least three layers; (i) a single input layer x; (ii) (

M - 1

) hidden layers; and (iii) a single output layer y. The input layer (also called the observation vector) has the same structure as a conventional linear equalizer for sequential equalization, i.e., it is a tapped delay line o^(m−1) = [

o_{1}^{(m - 1)}

,

o_{2}^{(m - 1)}

, …,

o_{N_{m - 1}}^{(m - 1)}

], where N is the number of neurons, and m is the layer number. This is illustrated in Figure 3, where weights

w_{k n}^{(m)}

relate the nth input to the kth neuron. Each neuron can be biased with a value C^(m), which is in turn scaled by a threshold factor

v_{k}^{(m)}

.

The output

o_{k}^{(m)}

of the kth neuron is mapped via a non-linear activation function f(.) as given by [25]:

o_{k}^{(m)} = f (\sum_{n = 1}^{N_{m - 1}} w_{k n}^{(m)} o_{n}^{(m - 1)} + C^{(m)} v_{k}^{(m)}) .

(5)

The output of each layer is usually connected to each of the neurons in the next layer, i.e., a fully connected mode, therefore, using the observation vector

o^{(m)}

for the mth layer and the

N_{m} \times N_{m - 1}

connection matrix between layers

m

and

m - 1

, the output is given in the vector form by:

o^{(m)} = f (W^{(m)} o^{(m - 1)} + C^{(m)} v^{(m)}),

(6)

where

W^{(m)}

and

v^{(m)}

are given by:

W^{(m)} = [\begin{array}{l} w_{1}^{(m)} \\ w_{2}^{(m)} \\ . \\ . \\ . \\ w_{N m}^{(m)} \end{array}],

(7)

v^{(m)} = {[v_{1}^{(m)} v_{2}^{(m)} \dots v_{N_{m}}^{(m)}]}^{T} .

(8)

Considering the

N_{0} \times 1

input vector,

N_{M} \times 1

output vector,

o^{(0)} = x

and

o^{(M)} = y

, the following observation vector

o^{(m)}

is given by:

\begin{array}{l} x = {[x_{1} x_{2} \dots x_{N_{o}}^{(m)}]}^{T}, \end{array}

(9)

\begin{array}{l} y = {[y_{1} y_{2} \dots y_{N_{M}}]}^{T} . \end{array}

(10)

Therefore,

\begin{array}{l} o^{(1)} = f [W^{(1)} x + C^{(1)} v^{(1)}), \end{array}

(11)

\begin{array}{l} o^{(2)} = f [W^{(2)} o + C^{(2)} v^{(2)}), \end{array}

(12)

\dots

\begin{array}{l} y = f [W^{(M)} o^{(M - 1)} + C^{(M)} v^{(M)}) . \end{array}

(13)

MLP will record its trained information in

w_{k n}^{(m)}

and in the threshold factors

v_{n}^{(m)}

, since

C^{(m)}

is given as a constant for all layers (i.e., set as

C^{(m)} = 1, m = 1, 2, \dots, M

). Resilient back-propagation (RBP) is a supervised back-propagation (BP) training method, which updates the weights to converge more rapidly than the standard BP training technique [26]. Figure 3 depicts a single neuron for the case where the layers are interconnected with different weight coefficients. The RBP adjusts the MLP weights to reduce the error cost function

E_{n}

as given by [25]:

E_{k} = | | d_{k} - y_{k} | |^{2},

(14)

where

d_{k}

and

y_{k}

are the ideal and actual received symbols, respectively. It should be noted that, for the training sequence, d is known. Each iteration of the RBP algorithm has a dynamic step size, which varies based on the magnitude of the gradient descent of

E_{k}

.

4. Experimental Setup

The schematic block diagram of the proposed OCC system is shown in Figure 4a. A pseudorandom binary sequence (PRBS) with a length of

2^{16}

–1 bits was generated using MATLAB, which was then up-sampled with

n_{samp}

of 50 and modulated using CP-PAM. Based on the output labels provided for 4-PAM, see Table 1, mapping of the data to the corresponding symbols was carried out. The PRBS s(t) was divided into sub-sequences with effective symbols per packet with lengths of P_bit-symbols, which depends on the transmitter bandwidth

f_{Tx}

. Each subsequence was encoded with a pre- and post-amble to form a Qth Tx packet, where Qth represents the packet number and each packet consists of 3-symbol pre-amble [1.5 1.5 1.5], P_bit-symbol payload, and 3-symbol post-amble [1.5 1.5 1.5].

The symbols in the overhead signal (i.e., pre-amble and post-amble) were chosen to ensure constant average optical power when compared with the payload. The signal was then sequentially uploaded onto an arbitrary wave generator (AWG, AFG3252C, 240 MHz bandwidth), see Figure 4b. The uploading process was done through the generation of Qth Tx packet at different

f_{Tx}

, the output of which was used for intensity modulation of a Luxeon Rebel LED (SR-01-WC310) with a peak wavelength at 630 nm. Note, a linewidth of 118 nm was used for transmission of the modulated light over a short LoS free-space channel (i.e., 50 cm).

At the Rx, a diffuser was used to scatter the light over the capturing area of the IS (Thorlabs DCC1645C RS) with a standard T_exp of 2 ms was adopted in this study. The observed frames

P_{U \times V \times 3}^{Q}

at the output of the camera were processed off-line in MATLAB using both Algorithms 1 and 2. In Algorithm 1, the data set

z_{i}^{Q}

was retrieved by accumulating the intensities for all pixels in each row. The received signal was then normalized to remove the DC by capturing 20 frames with no signal, see Figure 5a,b.

Algorithm 1 Signal Extraction Algorithm

Next, to ensure that a full packet was captured by the IS, Algorithm 2 was applied to select the optimum frame (i.e., including both pre- and post-ambles) for each Qth Tx packet per

f_{Tx}

, in which

10 \times P_{U \times V \times 3}^{Q}

are captured at Qth Tx packet to maintain the synchronization between both the Tx and the Rx.

A resampling process was then applied to resize the signal length based on the packet size observed in pixels. Next, a correlation algorithm was used to maintain the synchronization between the transmitted Qth Tx packet and received

z_{cal}^{Q}

signals, where a filtered version of

z_{cal}^{Q}

was simulated based on the encoded Qth packet using a moving average filter. Note, the window size of the filter was set to

n_{samp}

since it provided an optimal match compared with the observed signal. Next,

z_{cal}

in the vector form was applied to an MLP equalizer using an array of tapped-delay lines as previously described. The MLP used here included single input, hidden, and output layers. All the key system parameters are listed in Table 2.

Algorithm 2 Find the frame with full packet inclusion (i.e., includes both pre- and post-ambles)

5. Results and Discussion

The experimental work was focused on deploying an MLP-based equalization to mitigate the ISI due to the limited modulation bandwidth of the CMOS IS-based Rx. The measured and simulated CIS for T_exp of 2 ms are highlighted in Figure 6, showing that the obtained IS bandwidth (i.e., a 3 dB point) was 250 Hz. It is also noted that the mismatch between the measured and simulated response was caused by aliasing due to the limited sampling frequency of the IS and utilization of image compression techniques [38]. The CP 4-PAM encoded signal was then generated at a different bandwidth

f_{Tx}

of up to 1520 Hz.

The captured frames at the Rx are processed with P_bit of up to 70 symbols per packet. Figure 7 illustrates examples of the captured frames and the processed signals for P_bit of 5, 10, 15, 20, 50, and 70, i.e.,

f_{Tx}

220, 320, 420, 520, 1120, and 1520 Hz, respectively. Note, the width of the received Qth packet and the recorded F_s are 666 pixels and 13.31 kHz, respectively, based on the demodulated signal, see Figure 7. Increasing

f_{Tx}

decreases the number of received pixels for each CP 4-PAM symbol, thus, reducing the quality of data transmission. The number of pixels utilized for each CP 4-PAM symbol is indicated in Table 3. For the link with the ANN-based equalizer deployed at the Rx side, the quality of the received signal was measured using the eye diagrams and the BER performance. As illustrated in the eye diagrams, see Figure 8, the eye-openings indicate the impact of the ISI on the received signal. Note, (i) the threshold levels can be differentiated for P_bit of 5 and 20 symbols, see Figure 8a,b, respectively, but not for P_bit of 50 and 70 symbols as in see Figure 8c,d, respectively; (ii) the five levels are shown in the eye diagrams, where one of the levels represents the packet overhead designed to maintain the same average power for CP 4-PAM; and (iii) the overhead level is removed at the Rx side using Algorithm 1.

An example of transmitted and received signals with and without equalizer for

N_{pps}

of 8.7 pixels per symbol is illustrated in Figure 9. The equalized signal at the Rx side shows a significant improvement in reducing the impact of the ISI on the received signal with minimal signal distortions.

The eye linearity of the received signals is measured based on the average amplitude levels is given by [39]:

Eye linearity = \frac{\min (V_{up}, V_{mid}, V_{low})}{\max (V_{up}, V_{mid}, V_{low})},

(15)

where

V_{up}, V_{mid}, and V_{low}

are the average amplitude levels.

Figure 10 shows the eye linearity of the received signals with respect to

N_{pps}

for the link with and without ANN equalizer and for T_exp of 2 ms. Note, we have used

N_{pps}

, i.e., new terminology for a fair comparison considering the progress made in the development of ISs. As shown, for the link with no equalizer, the eye linearity increases with

N_{pps}

reaching a maximum level of 0.6 at

N_{pps}

of ~26, beyond which it drops linearly with rapidly

N_{pps}

. However, with the ANN equalizer, the eye linearity is improved significantly for both the test and trained cases reaching the optimal linearization of almost 1 at

N_{pps}

of 18 and remaining constant beyond

N_{pps}

> 18 (i.e., being independent of

N_{pps}

). Thus, the ANN equalizer show an improvement of ~66% in the eye linearity for

N_{pps}

> 18 pixels/symbol (i.e.,

f_{Tx}

< 920 Hz) for both training and testing sets.

Next, we measured the BER as a function of

N_{pps}

for the link with and without ANN equalizer, as illustrated in Figure 11. In addition, shown is the forward error correction (FEC) BER limit line of 3.8 × 10⁻³. Note, at the FEC limit the

N_{pps}

value is reduced from 30 to 20 for pixels per symbol for the links without and with the ANN equalizer, respectively, compared with the test plot. Thus, the effective R_b (i.e., no post- and pre-ambles) is estimated by:

R_{b} = 2 \frac{V}{N_{pps}} \cdot R_{f},

(16)

where

V

represents the pixel row.

The effective

R_{b}

at the FEC limit for the case with and without the ANN equalizer with a range of IS resolutions is indicated in Table 4. It demonstrates with the ANN equalizer R_b of 24.4 and 12.2 kbps for R_f of 60, and 30 fps, respectively, can be achieved compared with the case of no equalizer with R_b of 18.6, and 9.3 kbps for R_f of 60, and 30 fps, respectively.

6. Conclusions

We proposed an ANN-based equalization technique for a CP 4-PAM based OCC system. An experimental setup was developed to demonstrate non-flickering communications using a single light-emitting diode with a transmission rate of R_b of 24.4 kbps. The quality of received signals was measured based on the eye-diagram opening, eye linearity, and the BER. We demonstrated the ability to mitigate the intersymbol interference and hence to transmit a signal with an acceptable BER (below the FEC limit) for

N_{pps}

of 20, and 30 for unequalized and equalized systems, respectively. An improvement of ~66% in the eye linearity was achieved using a single LED, and a typical commercial camera with equalization technique was achieved. The limitation of the proposed system was assessed by the system complexity, including the associative memories needed for the look-up table training data as well as the IS resolution, gap-time and exposure time, and reading time.

Author Contributions

Conceptualization: O.I.Y., N.B.H. and Z.G.; investigation: O.I.Y. and N.B.H.; project administration: Z.G., S.Z., L.N.A. and H.L.-M.; software: O.I.Y. and N.B.H.; writing—original draft preparation, O.I.Y.; writing—review and editing, N.B.H., Z.G., S.Z., L.N.A. and H.L.-M. All authors have read and agreed to the published version of the manuscript.

Funding

One of the authors (Othman Younus) is funded by the Northumbria University PhD scholarship. The work is supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement no 764461 (VISION), and the EU COST Action on Newfocus (CA19111).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Teli, S.R.; Zvanovec, S.; Ghassemlooy, Z. Performance evaluation of neural network assisted motion detection schemes implemented within indoor optical camera based communications. Opt. Express 2019, 27, 24082–24092. [Google Scholar] [CrossRef]
Eso, E.; Ghassemlooy, Z.; Zvanovec, S.; Gholami, A.; Burton, A.; Hassan, N.B.; Younus, O.I. Experimental Demonstration of Vehicle to Road Side Infrastructure Visible Light Communications. In Proceedings of the 2019 2nd West Asian Colloquium on Optical Wireless Communications (WACOWC), Tehran, Iran, 27–28 April 2019; pp. 85–89. [Google Scholar]
Saeed, N.; Guo, S.; Park, K.-H.; Al-Naffouri, T.Y.; Alouini, M.-S. Optical camera communications: Survey, use cases, challenges, and future trends. Phys. Commun. 2019, 37, 100900. [Google Scholar] [CrossRef]
Younus, O.I.; Hassan, N.B.; Ghassemlooy, Z.; Haigh, P.A.; Zvanovec, S.; Alves, L.N.; Minh, H.L. Data Rate Enhancement in Optical Camera Communications Using an Artificial Neural Network Equaliser. IEEE Access 2020, 8, 42656–42665. [Google Scholar] [CrossRef]
Lee, H.-Y.; Lin, H.-M.; Wei, Y.-L.; Wu, H.-I.; Tsai, H.-M.; Lin, K.C.-J. RollingLight: Enabling Line-of-Sight Light-to-Camera Communications. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services, Florence, Italy, 18–22 May 2015; pp. 167–180. [Google Scholar]
Nguyen, T.; Islam, A.; Hossan, T.; Jang, Y.M. Current Status and Performance Analysis of Optical Camera Communication Technologies for 5G Networks. IEEE Access 2017, 5, 4574–4594. [Google Scholar] [CrossRef]
Bani Hassan, N.; Ghassemlooy, Z.; Zvanovec, S.; Biagi, M.; Vegni, A.M.; Zhang, M.; Luo, P. Non-Line-of-Sight MIMO Space-Time Division Multiplexing Visible Light Optical Camera Communications. J. Lightwave Technol. 2019, 37, 2409–2417. [Google Scholar] [CrossRef]
Eso, E.; Teli, S.; Bani Hassan, N.; Vitek, S.; Ghassemlooy, Z.; Zvanovec, S. 400 m rolling-shutter-based optical camera communications link. Opt. Lett. 2020, 45, 1059–1062. [Google Scholar] [CrossRef]
Lauxtermann, S.; Lee, A.; Stevens, J.; Joshi, A. Comparison of Global Shutter Pixels for CMOS Image Sensors. In Proceedings of the 2007 International Image Sensor Workshop, Ogunquit, ME, USA, 7–10 June 2007. [Google Scholar]
Li, X.; Hassan, N.B.; Burton, A.; Ghassemlooy, Z.; Zvanovec, S.; Perez-Jimenez, R. A Simplified Model for the Rolling Shutter Based Camera in Optical Camera Communications. In Proceedings of the 2019 15th International Conference on Telecommunications (ConTEL), Graz, Austria, 3–5 July 2019; pp. 1–5. [Google Scholar]
Sturniolo, A.; Cossu, G.; Ciaramella, E.; Hassan, N.B.; Shou, Z.; Huang, Y.; Ghassemlooy, Z. ROI Assisted Digital Signal Processing for Rolling Shutter Optical Camera Communications. In Proceedings of the 2018 11th International Symposium on Communication Systems, Networks & Digital Signal Processing (CSNDSP), Budapest, Hungary, 18–20 July 2018; pp. 1–6. [Google Scholar]
Nguyen, T.; Islam, A.; Yamazato, T.; Jang, Y.M. Technical Issues on IEEE 802.15.7m Image Sensor Communication Standardization. IEEE Commun. Mag. 2018, 56, 213–218. [Google Scholar] [CrossRef]
Cahyadi, W.A.; Chung, Y.H.; Ghassemlooy, Z.; Hassan, N.B. Optical Camera Communications: Principles, Modulations, Potential and Challenges. Electronics 2020, 9, 1339. [Google Scholar] [CrossRef]
Goto, Y.; Takai, I.; Yamazato, T.; Okada, H.; Fujii, T.; Kawahito, S.; Arai, S.; Yendo, T.; Kamakura, K. A New Automotive VLC System Using Optical Communication Image Sensor. IEEE Photonics J. 2016, 8, 1–17. [Google Scholar] [CrossRef]
Roberts, R.D. Undersampled frequency shift ON-OFF keying (UFSOOK) for camera communications (CamCom). In Proceedings of the 2013 22nd Wireless and Optical Communication Conference, Chongqing, China, 16–18 May 2013; pp. 645–648. [Google Scholar]
Luo, P.; Ghassemlooy, Z.; Le Minh, H.; Tang, X.; Tsai, H. Undersampled phase shift ON-OFF keying for camera communication. In Proceedings of the Sixth International Conference on Wireless Communications and Signal Processing (WCSP), Hefei, China, 23–25 October 2014. [Google Scholar]
Nguyen, D.T.; Chae, Y.; Park, Y. Enhancement of Data Rate and Packet Size in Image Sensor Communications by Employing Constant Power 4-PAM. IEEE Access 2018, 6, 8000–8010. [Google Scholar] [CrossRef]
Luo, P.; Ghassemlooy, Z.; Minh, H.L.; Tsai, H.; Tang, X. Undersampled-PAM with subcarrier modulation for camera communications. In Proceedings of the 2015 Opto-Electronics and Communications Conference (OECC), Shanghai, China, 28 June–2 July 2015; pp. 1–3. [Google Scholar]
Luo, P.; Zhang, M.; Ghassemlooy, Z.; Zvanovec, S.; Feng, S.; Zhang, P. Undersampled-Based Modulation Schemes for Optical Camera Communications. IEEE Commun. Mag. 2018, 56, 204–212. [Google Scholar] [CrossRef]
Rachim, V.P.; Chung, W. Multilevel Intensity-Modulation for Rolling Shutter-Based Optical Camera Communication. IEEE Photonics Technol. Lett. 2018, 30, 903–906. [Google Scholar] [CrossRef]
Hu, P.; Pathak, P.H.; Zhang, H.; Yang, Z.; Mohapatra, P. High Speed LED-to-Camera Communication using Color Shift Keying with Flicker Mitigation. IEEE Trans. Mobile Comput. 2020, 19, 1603–1617. [Google Scholar] [CrossRef]
Luo, P.; Zhang, M.; Ghassemlooy, Z.; Minh, H.L.; Tsai, H.; Tang, X.; Png, L.C.; Han, D. Experimental Demonstration of RGB LED-Based Optical Camera Communications. IEEE Photonics J. 2015, 7, 1–12. [Google Scholar] [CrossRef]
Griffiths, A.D.; Herrnsdorf, J.; Strain, M.J.; Dawson, M.D. Scalable visible light communications with a micro-LED array projector and high-speed smartphone camera. Opt. Express 2019, 27, 15585–15594. [Google Scholar] [CrossRef]
Liu, L.; Deng, R.; Chen, L. Spatial and Time Dispersions Compensation With Double-Equalization for Optical Camera Communications. IEEE Photonics Technol. Lett. 2019, 31, 1753–1756. [Google Scholar] [CrossRef]
Haigh, P.A.; Ghassemlooy, Z.; Rajbhandari, S.; Papakonstantinou, I.; Popoola, W. Visible Light Communications: 170 Mb/s Using an Artificial Neural Network Equalizer in a Low Bandwidth White Light Configuration. J. Lightwave Technol. 2014, 32, 1807–1813. [Google Scholar] [CrossRef]
Rajbhandari, S.; Ghassemlooy, Z.; Angelova, M. Effective Denoising and Adaptive Equalization of Indoor Optical Wireless Channel with Artificial Light Using the Discrete Wavelet Transform and Artificial Neural Network. J. Lightwave Technol. 2009, 27, 4493–4500. [Google Scholar] [CrossRef]
Rajbhandari, S.; Faith, J.; Ghassemlooy, Z.; Angelova, M. Comparative study of classifiers to mitigate intersymbol interference in diffuse indoor optical wireless communication links. Optik 2013, 124, 4192–4196. [Google Scholar] [CrossRef]
Willy Anugrah, C.; Yeon Ho, C. Smartphone camera-based device-to-device communication using neural network-assisted high-density modulation. Opt. Eng. 2018, 57, 1–9. [Google Scholar]
Islam, A.; Hossan, M.T.; Jang, Y.M. Convolutional neural networkscheme–based optical camera communication system for intelligent Internet of vehicles. Int. J. Distrib. Sens. Netw. 2018, 14, 1550147718770153. [Google Scholar] [CrossRef]
Liu, L.; Deng, R.; Chen, L.-K. 47-kbit/s RGB-LED-based optical camera communication based on 2D-CNN and XOR-based data loss compensation. Opt. Express 2019, 27, 33840–33846. [Google Scholar] [CrossRef]
Younus, O.I.; Hassan, N.B.; Ghassemlooy, Z.; Zvanovec, S.; Alves, L.N.; Haigh, P.A.; Minh, H.L. An Artificial Neural Network Equalizer for Constant Power 4-PAM in Optical Camera Communications. In Proceedings of the 2020 12th International Symposium on Communication Systems, Networks and Digital Signal Processing (CSNDSP), Porto, Portugal, 20–22 July 2020; pp. 1–6. [Google Scholar]
Nguyen, H.; Pham, T.L.; Nguyen, H.; Jang, Y.M. Trade-off Communication distance and Data rate of Rolling shutter OCC. In Proceedings of the 2019 Eleventh International Conference on Ubiquitous and Future Networks (ICUFN), Zagreb, Croatia, 2–5 July 2019; pp. 148–151. [Google Scholar]
Chau, J.C.; Little, T.D.C. Analysis of CMOS active pixel sensors as linear shift-invariant receivers. In Proceedings of the 2015 IEEE International Conference on Communication Workshop (ICCW), London, UK, 8–12 June 2015; pp. 1398–1403. [Google Scholar]
Lehman, B.; Wilkins, A.; Berman, S.; Poplawski, M.; Miller, N.J. Proposing measures of flicker in the low frequencies for lighting applications. In Proceedings of the 2011 IEEE Energy Conversion Congress and Exposition, Phoenix, AZ, USA, 16–21 September 2011; pp. 2865–2872. [Google Scholar]
DiLaura, D.L.; Houser, K.W.; Mistrick, R.; Stey, S.G. The Lighting Handbook Reference and Application, 10th ed.; Illuminating Engineering Society of North America (IES): New York, NY, USA, 2011; p. 247. [Google Scholar]
Rajbhandari, S.; Chun, H.; Faulkner, G.; Haas, H.; Xie, E.; McKendry, J.J.D.; Herrnsdorf, J.; Gu, E.; Dawson, M.D.; O’Brien, D. Neural Network-Based Joint Spatial and Temporal Equalization for MIMO-VLC System. IEEE Photonics Technol. Lett. 2019, 31, 821–824. [Google Scholar] [CrossRef]
1.3Mp CMOS Digital Image Sensor. In ON Semiconductor; Application Note 98AON94065F; Semiconductor Components Industries, LLC: Aurora, CO, USA, 2017; Available online: https://www.onsemi.com/pdf/datasheet/mt9m131-d.pdf (accessed on 20 March 2021).
Cossu, G.; Sturniolo, A.; Ciaramella, E. Modelization and Characterization of a CMOS Camera as an Optical Real-Time Oscilloscope. IEEE Photonics J. 2020, 12, 1–13. [Google Scholar] [CrossRef]
AN 835: PAM4 Signaling Fundamentals. Intel; Application Note AN-835; Intel Corporation: Santa Clara, CA, USA, 2019; Available online: https://www.intel.com/content/dam/www/programmable/us/en/pdfs/literature/an/an835.pdf (accessed on 10 March 2021).

Figure 1. Defining Flicker Index [34,35].

Figure 2. An example of a generated packet signal with

f_{Tx}

of 220 Hz.

Figure 2. An example of a generated packet signal with

f_{Tx}

of 220 Hz.

Figure 3. A structure of the kth neuron in the layer m.

Figure 4. The CP 4-PAM OCC scheme: (a) System block diagram, and (b) photograph of the experimental setup.

Figure 5. An example of the received Qth Tx packet signal at a T_exp of 2 ms: (a) without DC gain normalization, and (b) with normalization.

Figure 6. Measured and estimated bandwidth of the for IS with T_exp of 2 ms.

Figure 7. Examples of the frame acquisition based on CIS for CP 4-PAM and

f_{Tx}

of: (a) 220 Hz, (b) 320 Hz, (c) 420 Hz, (d) 520 Hz, (e) 1120 Hz, and (f) 1520 Hz.

Figure 7. Examples of the frame acquisition based on CIS for CP 4-PAM and

f_{Tx}

of: (a) 220 Hz, (b) 320 Hz, (c) 420 Hz, (d) 520 Hz, (e) 1120 Hz, and (f) 1520 Hz.

Figure 8. Examples of the captured eye diagrams of the CIS received signal for CP 4-PAM with

f_{Tx}

of: (a) 220 Hz, (b) 320 Hz, (c) 420 Hz, (d) 520 Hz, (e) 1120 Hz, and (f) 1520 Hz.

Figure 8. Examples of the captured eye diagrams of the CIS received signal for CP 4-PAM with

f_{Tx}

of: (a) 220 Hz, (b) 320 Hz, (c) 420 Hz, (d) 520 Hz, (e) 1120 Hz, and (f) 1520 Hz.

Figure 9. An example of the transmitted and received signal with and without equalization for CP 4-PAM and

N_{pps}

of 18.5 row pixels/symbol: (a) training sets, and (b) testing sets.

Figure 9. An example of the transmitted and received signal with and without equalization for CP 4-PAM and

N_{pps}

of 18.5 row pixels/symbol: (a) training sets, and (b) testing sets.

Figure 10. The eye linearity against

N_{pps}

for the proposed system with and without equalization and for T_exp of 2 ms.

Figure 10. The eye linearity against

N_{pps}

for the proposed system with and without equalization and for T_exp of 2 ms.

Figure 11. The BER measurements as a function of

N_{pps}

for the proposed system with and without equalization and for T_exp of 2 ms.

Figure 11. The BER measurements as a function of

N_{pps}

for the proposed system with and without equalization and for T_exp of 2 ms.

Table 1. Proposed CP 4-PAM levels.

Input Data	Conventional PAM Level	Constant Power 4-PAM
Input Data	Conventional PAM Level	First Level $(I_{S})$	Stabilization Level $(2 I_{a v e} - I_{S})$
11	3	3	0
10	2	2	1
01	1	1	2
00	0	0	3

Table 2. System parameters.

Description		Value
Tx	LED type	Luxeon Rebel LED (SR-01-WC310)
	Tx signal bandwidth $f_{Tx}$ (Hz)	220–1520 Hz
	Tx bias current	180 mA
Camera Rx	Camera model	Thorlabs DCC1645C-HQ
	Exposure time T_exp	2 ms
	Maximum SNR of IS	44 dB [37]
	Lens type	Navitar 12 mm F/1.8 2/3” 10 MP
	Pixel clock	10 MHz
	Camera raw image resolution	1280 × 1024 pixels
	Captured symbols per frame	11–76 symbols
Packet Generator	Data format	CP-PAM
	Symbol per packet P_bit	5–70 symbols
	Packet generator sample rate	11.125 kHz
	Number of samples $n_{samp}$	10
Channel	Channel length	50 cm
ANN Equalizer	Activation function	Hyperbolic tangent sigmoid
	Number of neurons in input layer	200
	Number of neurons in output layer	1
	Number of neurons in hidden layer	200
	Number of hidden layers	2
	Percentage of the train to test	0.8
	Maximum epochs	1000
	learning rate parameter η	0.01
	Network training function	Resilient back-propagation

Table 3. Results with R_f of 30 fps and CIS width of 1024 px.

Payload Symbol/Packet (P_bit)	Total Number of Symbols/Packet	$Number of Row Pixels / Symbol (N_{p p s})$	$f_{T x} (Hz)$
5	11	60.54	220
10	16	41.62	320
15	21	31.71	420
20	26	25.61	520
30	36	18.50	720
35	41	16.24	820
40	46	14.48	920
50	56	11.89	1120
70	76	8.76	1520

Table 4. Effective R_b at different ISs resolutions at the FEC limits.

ISs Resolutions	$R_{b} (bps) at N_{p p s} = 26$ (i.e., w/o Equalization)		$R_{b} (bps) at N_{p p s} = 20$ (i.e., with Equalization)
ISs Resolutions	R_f = 30 fps	R_f = 60 fps	R_f = 30 fps	R_f = 60 fps
1200 × 1800	3794	7588	5040	10,080
1500 × 2100	4486	8972	5940	11,880
1800 × 2400	5178	10,357	6840	13,680
2100 × 3000	6563	13,126	8640	17,280
2400 × 3000	6563	13,126	8640	17,280
3300 × 4200	9332	18,665	12,240	24,480

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Younus, O.I.; Hassan, N.B.; Ghassemlooy, Z.; Zvanovec, S.; Alves, L.N.; Le-Minh, H. The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications. Sensors 2021, 21, 2826. https://doi.org/10.3390/s21082826

AMA Style

Younus OI, Hassan NB, Ghassemlooy Z, Zvanovec S, Alves LN, Le-Minh H. The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications. Sensors. 2021; 21(8):2826. https://doi.org/10.3390/s21082826

Chicago/Turabian Style

Younus, Othman Isam, Navid Bani Hassan, Zabih Ghassemlooy, Stanislav Zvanovec, Luis Nero Alves, and Hoa Le-Minh. 2021. "The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications" Sensors 21, no. 8: 2826. https://doi.org/10.3390/s21082826

APA Style

Younus, O. I., Hassan, N. B., Ghassemlooy, Z., Zvanovec, S., Alves, L. N., & Le-Minh, H. (2021). The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications. Sensors, 21(8), 2826. https://doi.org/10.3390/s21082826

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications^†

Abstract

1. Introduction

2. Constant Power-PAM in RS-Based OCC System

3. ANN Equalizer

4. Experimental Setup

5. Results and Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications †

Abstract

1. Introduction

2. Constant Power-PAM in RS-Based OCC System

3. ANN Equalizer

4. Experimental Setup

5. Results and Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

The Utilization of Artificial Neural Network Equalizer in Optical Camera Communications^†