nection, cloud fees, and workstation performance can
significantly improve the effective throughput and
significantly reduce cloud cost when compared to un-
compressed and the default compressed file trans-
fers. Throughput-effective upload and download mo-
des favor utilities that offer tradeoffs between com-
pression ratio and throughput, such as pzstd/zstd and
lbzip2/pbzip2, whereas cost-effective download mo-
des favor utilities with higher compression ratio, such
as lbzip2/pbzip2 and pxz/xz. These findings may
guide throughput and cost optimizations of big data
transfers in the cloud and encourage the development
of data transfer frameworks conscientious of the ex-
isting parameters for real-time selection of optimal
transfer modes.
ACKNOWLEDGEMENTS
This work has been supported by AWS Cloud Credits
for Research and in part by National Science Founda-
tion grants CNS-1205439 and CNS-1217470.
REFERENCES
Abadi, D. J. (2009). Data management in the cloud: li-
mitations and opportunities. IEEE Data Eng. Bull.,
32(1):3–12.
Abu-Libdeh, H., Princehouse, L., and Weatherspoon, H.
(2010). RACS: A case for cloud storage diversity.
In Proceedings of the 1st ACM Symposium on Cloud
Computing, SoCC ’10, pages 229–240. ACM.
Amante, C. and B. W. Eakins (2009). ETOPO1 1 arc-
minute global relief model: Procedures, data sources
and analysis.
Amazon (2017). Amazon web services (AWS) - cloud com-
puting services. https://aws.amazon.com/.
Apple (2017). Data compression | apple developer docu-
mentation. https://tinyurl.com/ybwc77su.
Barr, K. and Asanovi
´
c, K. (2003). Energy aware lossless
data compression. In Proceedings of the 1st Internati-
onal Conference on Mobile Systems, Applications and
Services (MobiSys’03), pages 231–244. ACM Press.
Barr, K. C. and Asanovi
´
c, K. (2006). Energy-aware lossless
data compression. ACM Transactions on Computer
Systems, 24(3):250–291.
Bicer, T., Yin, J., Chiu, D., Agrawal, G., and Schuchardt, K.
(2013). Integrating online compression to accelerate
large-scale data analytics applications. In 2013 IEEE
27th International Symposium on Parallel Distributed
Processing (IPDPS), pages 1205–1216.
Bonfield, J. K. and Mahoney, M. V. (2013). Compression
of FASTQ and SAM format sequencing data. PLoS
ONE, 8(3):1–10.
Chen, M., Mao, S., and Liu, Y. (2014). Big data: A survey.
Mobile Networks and Applications, 19(2):171–209.
Chen, Y., Ganapathi, A., and Katz, R. H. (2010). To com-
press or not to compress - compute vs. IO tradeoffs
for mapreduce energy efficiency. In Proceedings of
the First ACM SIGCOMM Workshop on Green Net-
working, Green Networking ’10, pages 23–28. ACM.
Cyan (2013). RealTime data compression: Finite
state entropy - a new breed of entropy coder.
https://tinyurl.com/p5ehc54.
Dzhagaryan, A. and Milenkovi
´
c, A. (2015). On effecti-
veness of lossless compression in transferring mhe-
alth data files. In 2015 17th International Conference
on E-health Networking, Application Services (Healt-
hCom), pages 665–668.
Dzhagaryan, A. and Milenkovi
´
c, A. (2016). Models for
evaluating effective throughputs for file transfers in
mobile computing. In 2016 25th International Con-
ference on Computer Communication and Networks
(ICCCN), pages 1–9.
Dzhagaryan, A. and Milenkovi
´
c, A. (2017). A framework
for optimizing file transfers between mobile devices
and the cloud. In 2017 IEEE 28th Annual Internatio-
nal Symposium on Personal, Indoor, and Mobile Ra-
dio Communications (PIMRC), pages 1–7.
Dzhagaryan, A., Milenkovi
´
c, A., and Burtscher, M. (2013).
Energy efficiency of lossless data compression on a
mobile device: An experimental evaluation. In 2013
IEEE International Symposium on Performance Ana-
lysis of Systems and Software (ISPASS), pages 126–
127.
Dzhagaryan, A., Milenkovi
´
c, A., and Burtscher, M. (2015).
Quantifying benefits of lossless compression utilities
on modern smartphones. In 2015 24th International
Conference on Computer Communication and Net-
works (ICCCN), pages 1–9.
Dzhagaryan, A., Milenkovi
´
c, A., Milosevic, M., and Jova-
nov, E. (2016a). An environment for automated mea-
surement of energy consumed by mobile and embed-
ded computing devices. Measurement, 94:103–118.
Dzhagaryan, A., Milenkovi
´
c, A., Milosevic, M., and Jova-
nov, E. (2016b). An environment for automated me-
asuring of energy consumed by android mobile de-
vices. In Proceedings of the 6th International Joint
Conference on Pervasive and Embedded Computing
and Communication Systems - Volume 1: PEC,, pages
28–39.
Facebook (2017). Zstandard - real-time data compression
algorithm. https://tinyurl.com/zf4vfz6.
Google (2017a). Google cloud computing, hosting services
& APIs. https://cloud.google.com/.
Google (2017b). google/brotli. https://github.com/google/
brotli.
Harnik, D., Kat, R., Margalit, O., Sotnikov, D., and Trae-
ger, A. (2013). To zip or not to zip: Effective resource
usage for real-time compression. In Proceedings of
the 11th USENIX Conference on File and Storage
Technologies, FAST’13, pages 229–241. USENIX.
Krintz, C. and Sucu, S. (2006). Adaptive on-the-fly com-
On Effectiveness of Compressed File Transfers to/from the Cloud: An Experimental Evaluation
45