2016) and disaster recovery (Chang, 2015). Inside
one DC, the job can be completed within one VPN
and is in a trust network. Job across DCs may span
public network and it would absolutely have to be
over one VPN. A framework integrated with some
major security technologies, such as firewall, iden-
tity management and encryption has been proposed
for business cloud (Chang et al., 2015) (Chang and
Ramachandran, 2016). Focusing on GHadoop, the
authors of (Zhao et al., 2014) explores the framework
for big data computing across data centers. All these
works can be coordinated with the G-framework to
refine the architecture.
ACKNOWLEDGEMENTS
This work was financially supported by National
High Technology Research and Development
Program of China (No. 2015AA016008), Na-
tional Science and Technology Major Project (No.
JC201104210032A), National Natural Science
Foundation of China (No. 11371004, 61402136),
Natural Science Foundation of Guangdong Province,
China (No. 2014A030313697), International Ex-
change and Cooperation Foundation of Shenzhen
City, China (No. GJHZ20140422173959303),
Shenzhen Strategic Emerging Industries Pro-
gram (No.ZDSY20120613125016389), Shen-
zhen Overseas High Level Talent Innova-
tion and Entrepreneurship Special Funds (No.
KQCX20150326141251370), Shenzhen Applied
Technology Engineering Laboratory for Internet
Multimedia Application of Shenzhen Development
and Reform Commission (No. [2012]720), Public
Service Platform of Mobile Internet Application
Security Industry of Shenzhen Development and
Reform Commission (No. [2012]900).
REFERENCES
Amazon. Amazonproduct. http://aws.amazon.com/.
Apache. hadoop. http://hadoop.apache.org/.
Bard, H. understanding-hadoop-clusters-and-the-network.
http://bradhedlund.com/2011/09/10/
understanding-hadoop-clusters-and-the-network/.
Bard, J. (1991). Some properties of the bilevel program-
ming problem. Journal of optimization theory and
applications, 68(2):371–378.
Chang, H., Kodialam, M., Kompella, R., Lakshman, T.,
Lee, M., and Mukherjee, S. (2011). Scheduling in
mapreduce-like systems for fast completion time. In
INFOCOM, 2011 Proceedings IEEE, pages 3074–
3082.
Chang, V. (2015). Towards a big data system disaster recov-
ery in a private cloud. Ad Hoc Networks, 35:65–82.
Chang, V., Kuo, Y. H., and Ramachandran, M. (2015).
Cloud computing adoption frameworka security
framework for business clouds. Future Generation
Computer Systems, 57:2441.
Chang, V. and Ramachandran, M. (2016). Towards achiev-
ing data security with the cloud computing adoption
framework. IEEE Transactions on Services Comput-
ing, pages 1–1.
Chang, V. and Wills, G. (2015). A model to compare cloud
and non-cloud storage of big data. Future Generation
Computer Systems.
EIA. Usapowerprice. http://www.eia.gov/state/data.cfm?
sid=CT.
Fan, X., Weber, W.-D., and Barroso, L. A. (2007).
Power provisioning for a warehouse-sized computer.
SIGARCH Comput. Archit. News, 35(2):13–23.
google. Google data centers locations. http://www.google.
com/about/datacenters/inside/locations/index.html.
Greenberg, A., Hamilton, J., Maltz, D. A., and Patel, P.
(2008). The cost of a cloud: research problems in data
center networks. ACM SIGCOMM Computer Com-
munication Review, 39(1):68–73.
He, C., Weitzel, D., Swanson, D., and Lu, Y. (2012). Hog:
Distributed hadoop mapreduce on the grid. In High
Performance Computing, Networking, Storage and
Analysis (SCC), 2012 SC Companion:, pages 1276–
1283. IEEE.
Jayalath, C., Stephen, J., and Eugster, P. (2014). From
the cloud to the atmosphere: Running mapreduce
across data centers. Computers, IEEE Transactions
on, 63(1):74–87.
Kulkarni, S. Cooling hadoop: Temperature aware sched-
ulers in data centers.
Kuo, J.-J., Yang, H.-H., and Tsai, M.-J. (2014). Optimal ap-
proximation algorithm of virtual machine placement
for data latency minimization in cloud systems. In IN-
FOCOM, 2014 Proceedings IEEE, pages 1303–1311.
IEEE.
Lublinsky, B., Smith, K. T., and Yakubovich, A. (2013).
Professional Hadoop solutions. John Wiley & Sons,
Inc.
Maheshwari, N., Nanduri, R., and Varma, V. (2012). Dy-
namic energy efficient data placement and cluster re-
configuration algorithm for mapreduce framework.
Future Generation Computer Systems, 28(1):119127.
Moghaddam, F. F., Moghaddam, R. F., and Cheriet, M.
(2014). Carbon-aware distributed cloud: multi-level
grouping genetic algorithm. Cluster Computing,
pages 1–15.
Schadt, E. E., Linderman, M. D., Sorenson, J., Lee, L., and
Nolan, G. P. (2010). Computational solutions to large-
scale data management and analysis. Nature Reviews
Genetics, 11(9):647–657.
Sun, H., Gao, Z., and Wu, J. (2008). A bi-level program-
ming model and solution algorithm for the location of
logistics distribution centers. Applied Mathematical
Modelling, 32(4):610 – 616.
Tannir, K. (2014). Optimizing Hadoop for MapReduce.
Packt Publishing Ltd.