Entropy 2013, 15(4), 1289-1310; doi:10.3390/e15041289
Article

HydroZIP: How Hydrological Knowledge can Be Used to Improve Compression of Hydrological Data

1,* email, 2 and 1
Received: 31 January 2013; in revised form: 27 March 2013 / Accepted: 1 April 2013 / Published: 10 April 2013
(This article belongs to the Special Issue Applications of Information Theory in the Geosciences)
Download PDF [1208 KB, uploaded 10 April 2013]
Abstract: From algorithmic information theory, which connects the information content of a data set to the shortest computer program that can produce it, it is known that there are strong analogies between compression, knowledge, inference and prediction. The more we know about a data generating process, the better we can predict and compress the data. A model that is inferred from data should ideally be a compact description of those data. In theory, this means that hydrological knowledge could be incorporated into compression algorithms to more efficiently compress hydrological data and to outperform general purpose compression algorithms. In this study, we develop such a hydrological data compressor, named HydroZIP, and test in practice whether it can outperform general purpose compression algorithms on hydrological data from 431 river basins from the Model Parameter Estimation Experiment (MOPEX) data set. HydroZIP compresses using temporal dependencies and parametric distributions. Resulting file sizes are interpreted as measures of information content, complexity and model adequacy. These results are discussed to illustrate points related to learning from data, overfitting and model complexity.
Keywords: data compression; algorithmic information theory; hydrology; inference; streamflow; MOPEX
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Export to BibTeX |
EndNote


MDPI and ACS Style

Weijs, S.V.; van de Giesen, N.; Parlange, M.B. HydroZIP: How Hydrological Knowledge can Be Used to Improve Compression of Hydrological Data. Entropy 2013, 15, 1289-1310.

AMA Style

Weijs SV, van de Giesen N, Parlange MB. HydroZIP: How Hydrological Knowledge can Be Used to Improve Compression of Hydrological Data. Entropy. 2013; 15(4):1289-1310.

Chicago/Turabian Style

Weijs, Steven V.; van de Giesen, Nick; Parlange, Marc B. 2013. "HydroZIP: How Hydrological Knowledge can Be Used to Improve Compression of Hydrological Data." Entropy 15, no. 4: 1289-1310.


Entropy EISSN 1099-4300 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert