SHVC Tile-Based 360-Degree Video Streaming for Mobile VR: PC Offloading Over mmWave
Abstract
:1. Introduction
2. Background
2.1. Tiled-HEVC(SHVC) Decoding on Mobile Cores
2.2. Viewport-Based 360-Degree Video Streaming
2.3. Data Rate Requirement for Mobile VR and mmWave UDP Throughput in Indoor Environments
- Pixels/degree: 40 (number of pixels per degree).
- Resolution: 11,520 (3 times the 4K vertical resolution).
- Frame rate: 90 fps (90 fps can prevent nausea through low delay).
- Environment-based or scene-based audio (360-degree surround sound and object-oriented audio).
- Maximum video and audio delay: 20 ms (time between user interaction and VR image and audio).
3. Mobile VR: Concept Architecture and Proposed Methods
3.1. Tiled-SHVC for 360-Degree Video Streaming
3.1.1. Challenge: Reference to Undecoded Tiles in the Temporal Inter Prediction (TIP)
- Proposed Step 1: The motion vector of undecoded tile at EL is replaced by the upsampled BL.In step 1, we suggest the use of the upsampled PUs at the BL to overcome the problem mentioned in Figure 8a. As shown in Figure 8b, the encoder considers an upsampled PU at the BL as the reference picture, and does not consider the one at the EL. Therefore, the viewport tiles selected at the EL can refer to all areas of the reference picture to eliminate decoding errors at the EL. In addition, the BL covers the entire picture but the EL represents only viewport tiles. However, because the EL does not use TIP, the bitrate increases significantly.
- Proposed Step 2: Available tile encoding in EL using upsampled BL and decoded tile of EL.Step 1 solves the problem of referring to the outer region of the viewport. However, as step 1 uses only an upsampled PU at the BL as a reference list, that PUis still available for ILP. Therefore, as shown in Figure 8c, when the MV of the TIP points to a position within the same position tile, the PU of the current picture, () refers to the PU of the previous picture, (). When calculating the rate distortion (RD) cost of finding the optimal PU, both the upsampled BL and the previous picture of the EL are included. The encoder chooses the PU with a more efficient RD cost than others from these options. Thus, Step 2 demonstrates an optimized encoding result that is better than Step 1.
3.1.2. Available Tile Encoding for HEVC Decoder Using Intra Prediction
3.1.3. Implementation of Tiled Extractor by Modifying the TIP Information
3.2. Proposed Tiled-HEVC Partitioning for Mobile Devices
3.3. PC Offloading over an mmWave Connection
- To solve the high resolution of 360-degree viewport streaming. Instead of other wireless 802.11 technologies, the mmWave link is applied to support high speed.
- To avoid the overflow issue or quite low performance issue of high-resolution video processing in mobile devices in terms of decoding rate [27], the decoding task is offloaded to a powerful PC. Figure 12 shows the proposed system with an offloading mechanism. This mechanism helps mobile devices reducing the computation and power required for a decoding process. The PC could receive encoded bitstream from a server or directly from the mobile devices to decode and transmit the decoded viewport to mobile device over mmWave links.
- As mentioned in Section 2.3, to reduce the effect of deafness, the beam misalignment and blockage problems, this study implements synchronization mechanisms to ensure the performance of the connected link. When data packets drop, the ACK packets are fed back from the mobile device to the PC, confirming successfully received packets. Next, ACK is completed on the PC that then sends the next packet to the mobile device. If the confirmation fails, the PC will re-send the current packet until it succeeds, or timeout is reached.
4. Experimental Results and Discussion
4.1. Performance Evaluation of Tiled-SHVC Extractor for VR Streaming
4.2. Performance Evaluation of Non-Uniform Tile Partitioning on Mobile Core
4.3. Performance Evaluation of PC Offloading for VR Streaming
4.3.1. PC Offloading Scenario
4.3.2. Proposed Synchronization Method Performance
5. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Champel, M.; Stockhammer, T.; Fautier, T.; Thomas, E.; Koenen, R. Quality Requirements for VR. In Proceedings of the 116th MPEG Meeting of ISO/IEC JTC1/SC29/WG11, MPEG 116/m39532, Chengdu, China, 17–21 October 2016. [Google Scholar]
- Jeong, J.; Jang, D.; Son, J.; Ryu, E.S. 3DoF+ 360 Video Location-Based Asymmetric Down-Sampling for View Synthesis to Immersive VR Video Streaming. Sensors 2018, 18, 3148. [Google Scholar] [CrossRef] [PubMed]
- Gaddam, V.R.; Riegler, M.; Eg, R.; Griwodz, C.; Halvorsen, P. Tiling in interactive panoramic video: Approaches and evaluation. IEEE Trans. Multimedia 2016, 18, 1819–1831. [Google Scholar] [CrossRef]
- Sun, Y.; Chen, Z.; Tao, M.; Liu, H. Communication, Computing and Caching for Mobile VR Delivery: Modeling and Trade-off. arXiv, 2018; arXiv:1804.10335. [Google Scholar]
- Baik, H.; Song, H. A complexity-based adaptive tile partitioning algorithm for HEVC decoder parallelization. In Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada, 27–30 September 2015; pp. 4298–4302. [Google Scholar]
- Reichelt, S.; Häussler, R.; Fütterer, G.; Leister, N. Depth cues in human visual perception and their realization in 3D displays. In Proceedings of the Three-Dimensional Imaging, Visualization, and Display 2010 and Display Technologies and Applications for Defense, Security, and Avionics IV, Orlando, FL, USA, 5–9 April 2010; International Society for Optics and Photonics: Bellingham, WA, USA, 2010; Volume 7690, p. 76900B. [Google Scholar]
- Sánchez, Y.; Skupin, R.; Schierl, T. Compressed domain video processing for tile based panoramic streaming using HEVC. In Proceedings of the 2015 IEEE International Conference on Image Processing (ICIP), Quebec City, QC, Canada, 27–30 September 2015; pp. 2244–2248. [Google Scholar]
- Wang, Y.K.; Hendry, M.K. Viewport dependent processing in VR: Partial video decoding. In Proceedings of the 116th MPEG Meeting of ISO/IEC JTC1/SC29/ WG11, MPEG 116/ m38559, Chengdu, China, 17–21 October 2016. [Google Scholar]
- Zare, A.; Aminlou, A.; Hannuksela, M.M.; Gabbouj, M. HEVC-compliant tile-based streaming of panoramic video for virtual reality applications. In Proceedings of the 2016 ACM on Multimedia Conference, Amsterdam, The Netherlands, 15–19 October 2016; ACM: New York, NY, USA, 2016; pp. 601–605. [Google Scholar]
- Qi, Y.; Hunukumbure, M.; Nekovee, M.; Lorca, J.; Sgardoni, V. Quantifying data rate and bandwidth requirements for immersive 5G experience. In Proceedings of the 2016 IEEE International Conference on Communications Workshops (ICC), Kuala Lumpur, Malaysia, 22–27 May 2016; pp. 455–461. [Google Scholar]
- Nitsche, T.; Cordeiro, C.; Flores, A.B.; Knightly, E.W.; Perahia, E.; Widmer, J.C. IEEE 802.11 ad: Directional 60 GHz communication for multi-Gigabit-per-second Wi-Fi. IEEE Commun. Mag. 2014, 52, 132–141. [Google Scholar] [CrossRef]
- Lu, J.S.; Steinbach, D.; Cabrol, P.; Pietraski, P. Modeling human blockers in millimeter wave radio links. ZTE Commun. 2012, 10, 23–28. [Google Scholar]
- Ramirez, D.; Huang, L.; Wang, Y.; Aazhang, B. On opportunistic mmWave networks with blockage. IEEE J. Sel. Areas Commun. 2017, 35, 2137–2147. [Google Scholar] [CrossRef]
- Gapeyenko, M.; Samuylov, A.; Gerasimenko, M.; Moltchanov, D.; Singh, S.; Aryafar, E.; Yeh, S.P.; Himayat, N.; Andreev, S.; Koucheryavy, Y. Analysis of human-body blockage in urban millimeter-wave cellular communications. In Proceedings of the 2016 IEEE International Conference on Communications (ICC), Kuala Lumpur, Malaysia, 22–27 May 2016; pp. 1–7. [Google Scholar]
- Seed Studio. Available online: https://www.seeedstudio.com/WiGig-USB3.0-Dongle-p-2827.html (accessed on 31 October 2018).
- Shokri-Ghadikolaei, H.; Fischione, C.; Popovski, P.; Zorzi, M. Design aspects of short-range millimeter-wave networks: A MAC layer perspective. IEEE Netw. 2016, 30, 88–96. [Google Scholar] [CrossRef] [Green Version]
- Boyce, J.M.; Ye, Y.; Chen, J.; Ramasubramonian, A.K. Overview of SHVC: Scalable extensions of the high efficiency video coding standard. IEEE Trans. Circuits Syst. Video Technol. 2016, 26, 20–34. [Google Scholar] [CrossRef]
- Sullivan, G.J.; Ohm, J.R.; Han, W.J.; Wiegand, T. Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circuits Syst. Video Technol. 2012, 22, 1649–1668. [Google Scholar] [CrossRef]
- Misra, K.; Segall, A.; Horowitz, M.; Xu, S.; Fuldseth, A.; Zhou, M. An overview of tiles in HEVC. IEEE J. Sel. Top. Signal Process. 2013, 7, 969–977. [Google Scholar] [CrossRef]
- JCT-VC. Available online: https://hevc.hhi.fraunhofer.de/shvc (accessed on 31 October 2018).
- JCT-VC. Available online: https://hevc.hhi.fraunhofer.de/ (accessed on 31 October 2018).
- Feldmann, C.; Bulla, C.; Cellarius, B. Efficient stream-reassembling for video conferencing applications using tiles in HEVC. In Proceedings of the International Conferences on Advances in Multimedia (MMEDIA), Venice, Italy, 22–26 April 2013; pp. 130–135. [Google Scholar]
- Bossen, F.; Bross, B.; Suhring, K.; Flynn, D. HEVC complexity and implementation analysis. IEEE Trans. Circuits Syst. Video Technol. 2012, 22, 1685–1696. [Google Scholar] [CrossRef]
- Roh, H.J.; Ryu, Y.; Ryu, E.S. Video Decoding Complexity Analysis Based on HEVC Resolution. In Proceedings of the Fall Conference of Korea Information Processing Society (KIPS), Seoul, Korea, 30–31 October 2015; pp. 1603–1606. [Google Scholar]
- Ryu, Y.; Roh, H.J.; Kang, S.J.; Kim, S.K.; Ryu, E.S. Non-Uniform HEVC Tile Partitioning Method for Asymmetric Multicores. In Proceedings of the 11th Asia Pacific International Conference on Information Science and Technology (APIC-IST 2016), Hokkaido, Japan, 26–29 June 2016; pp. 229–231. [Google Scholar]
- Ryu, Y.; Roh, H.J.; Ryu, E.S. Tile Partitioning-based HEVC Parallel Decoding Optimization for Asymmetric Multicore Processor. J. Korean Inst. Inf. Sci. Eng. 2016, 43, 1060–1065. [Google Scholar] [CrossRef]
- Van Dien, N.; Ryu, E.S. Performance comparison of SIMD-based HEVC decoders on mobile processor. In Proceedings of the 2017 International Conference on Information and Communications (ICIC), Hanoi, Vietnam, 26–28 June 2017; pp. 298–303. [Google Scholar]
- OpenHEVC. Available online: https://github.com/OpenHEVC/openHEVC (accessed on 31 October 2018).
- Zstandard, F. Zstandard. Available online: http://facebook.github.io/zstd/ (accessed on 31 October 2018).
Parameters | Type | Unit |
---|---|---|
Radiation type | Enfire/Broadside | |
Polarization | Linear | |
2Tx Array Peak Gain | 10 | dBi |
2Rx Array Peak Gain | 10 | degree |
3 dB Beamwidth @co-plane(include BF) | 110 | degree |
Distance | 0.5 m | 1 m | 1.5 m | 2 m | 2.5 m | 3 m | 3.5 m | 4 m |
928.5 Mbps | 922.6 Mbps | 893.7 Mbps | 889.6 Mbps | 883.4 Mbps | 748.4 Mbps | 670.5 Mbps | 577.7 Mbps | |
Obstacle | Obstacle: Book | Obstacle: Hand | Obstacle: Head | |||||
828.7 Mbps | 875.6 Mbps | 685.6 Mbps | ||||||
Beam Align | 0° | 15° | 25° | 35° | ||||
693.3 Mbps | 706.7 Mbps | 657.4 Mbps | 275.1 Mbps |
Name | Resolution | Frame Length | Frame Rate |
---|---|---|---|
KiteFlite | 8192 × 4096 | 300 | 30 fps |
Harbor | 8192 | 300 | 30 fps |
Trolley | 8192 | 300 | 30 fps |
GasLamp | 8192 | 300 | 30 fps |
Coding Option | SHM Parameter | HM Parameter |
---|---|---|
Version | 12.3 | 16.16 |
CTU size | 64 × 64 | |
Coding structure | RA | |
QP | - | 22 |
BL QP | 22 | - |
EL QP | 22 | - |
Tile | Uniformly 3 × 3 = 9 tiles | |
Slice Mode | Disable all slice options | |
WPP mode | Disable all wpp options |
Name | Modified SHM | Modified HM |
---|---|---|
KiteFlite | 6% | 8% |
Harbor | 5% | 8% |
Trolley | 10% | 13% |
GasLamp | 13% | 17% |
Average bitrate increase | 8% | 11% |
Name | Modified SHM | Modified HM |
---|---|---|
KiteFlite | −0.04 dB | −0.05 dB |
Harbor | −0.03 dB | −0.02 dB |
Trolley | −0.05 dB | −0.07 dB |
GasLamp | −0.05 dB | −0.06 dB |
Average PSNR increase | −0.04 dB | −0.05 dB |
Name | HM | SHM | ||
---|---|---|---|---|
4 Tiles Bitrate Saving (apx) | 1 Tile Bitrate Saving (apx) | 4 Tiles Bitrate Saving (apx) | 1 Tile Bitrate Saving (apx) | |
KiteFlite | 51% | 87% | 52% | 88% |
Harbor | 51% | 87% | 53% | 88% |
Trolley | 49% | 87% | 50% | 87% |
GasLamp | 47% | 86% | 49% | 87% |
Average bitrate saving | 49% | 86% | 51% | 87% |
Coding Options | Parameters |
---|---|
Coding Structure | RA (Random Access) AI (All Intra) LDB (Low-Delay B) |
QP | 22,27,32,37 |
Number of Tiles | 6 (3 × 2) |
Mobile device | Samsung GalaxyS7 |
PC | Core-i7/4 cores |
mmWave device | Wigig USB3.0 Dongle |
Distance | 1 m |
SHVC bitstream | DrivingInCity (3840 × 1920_1920 × 1080) |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Nguyen, D.V.; Le, T.T.; Lee, S.; Ryu, E.-S. SHVC Tile-Based 360-Degree Video Streaming for Mobile VR: PC Offloading Over mmWave. Sensors 2018, 18, 3728. https://doi.org/10.3390/s18113728
Nguyen DV, Le TT, Lee S, Ryu E-S. SHVC Tile-Based 360-Degree Video Streaming for Mobile VR: PC Offloading Over mmWave. Sensors. 2018; 18(11):3728. https://doi.org/10.3390/s18113728
Chicago/Turabian StyleNguyen, Dien Van, Tuan Thanh Le, Sangsoon Lee, and Eun-Seok Ryu. 2018. "SHVC Tile-Based 360-Degree Video Streaming for Mobile VR: PC Offloading Over mmWave" Sensors 18, no. 11: 3728. https://doi.org/10.3390/s18113728
APA StyleNguyen, D. V., Le, T. T., Lee, S., & Ryu, E.-S. (2018). SHVC Tile-Based 360-Degree Video Streaming for Mobile VR: PC Offloading Over mmWave. Sensors, 18(11), 3728. https://doi.org/10.3390/s18113728