users, customer feedback collected by the marketing
system and other data sources and converged to the
centralized data center for unified storage manage-
ment (Tang, Ding, 2021).
Power big data is one of the supporting elements
for building a stable, reliable, efficient and energy-
saving smart grid. By analyzing power big data, we
can improve the lean management level of smart grid,
formulate a more scientific production plan, optimize
energy transmission scheduling, and establish a more
accurate user behavior model.
Power big data has the 3V characteristics of large
volume, many types and fast speed, which are specif-
ically reflected in:
(1) Large volume. With the in-depth promotion of
the construction of smart grid, terminal data collec-
tion equipment such as device sensors and smart me-
ters have been intensively deployed, and the scale of
collected data will increase exponentially, reaching
the TB or even Pb level. Taking Zhejiang Province as
an example, there are 22million power users in the
province. If all smart meters are installed, according
to the requirements of State Grid Corporation of
China to collect one piece of power consumption in-
formation every 15 minutes, 2.1 billion pieces of
power consumption records will be added every day
(Wang, Bai, Dong, et al., 2021).
(2) There are many types. In addition to the tradi-
tional structured data, production management, mar-
keting and other systems produce a large number of
semi-structured and unstructured data such as audio
and video materials. The diversity of data types re-
quires the diversity of storage and processing technol-
ogies. This paper focuses on the data acquisition and
processing system of power consumption infor-
mation, which still focuses on structured data, and
does not discuss the processing of semi-structured
and unstructured data.
(3) Speed. The collection and processing of power
big data have extremely fast speed. The surge in the
number of terminals requires the storage system to
meet the requirements of high-throughput data access
hundreds of thousands of times per second.
In addition, power big data also has some unique
characteristics. According to the "white paper on the
development of power big data in China (2013)",
power big data also has the characteristics of 3E;
(1) Data is energy. Power big data contains ex-
tremely important information such as users' power
consumption rules and optimal transmission schedul-
ing strategies. This information plays a unique and
huge role in reasonably arranging production and re-
ducing energy consumption losses, and promotes the
reduction of energy consumption and sustainable de-
velopment of the power grid, thus embodying the
characteristics of data as energy.
(2) Data is exchange. Through the interaction and
aggregation with big data of other industries, and in-
depth mining and analysis, the information contained
in power big data has extremely important reference
value for the country's high-level decision-making
and economic situation judgement (Chen, Li, Cui et
al., 2019).
(3) Data is empathy. Power big data provides a
new way for State Grid Corporation of China to
timely and accurately discover and meet the needs of
users. Empathy is empathy. Both production and mar-
keting rely on power big data to provide more high-
quality, safe and reliable power services to the major-
ity of power users, so as to achieve the goal of com-
mon development.
3 CLOUD COMPUTING AND ITS
ADVANTAGES
Cloud computing is a new large-scale distributed
computing model, which originates from the demand
of Internet companies for a large number of compu-
ting and storage resources and the pursuit of scalabil-
ity, high performance, high availability and other
characteristics. Cloud computing aggregates a large
number of distributed and heterogeneous resources,
providing users with powerful massive data storage
and computing capabilities. Cloud computing pro-
vides users with on-demand services through virtual-
ization, dynamic resource allocation and other tech-
nologies, avoids resource waste and competition, and
improves resource utilization and application perfor-
mance. Cloud computing provides horizontal scaling
and dynamic load balancing capabilities, that is,
Cloud Computing supports adding new nodes to the
data center at runtime, and the system will automati-
cally migrate part of the load to the new nodes, and
maintain the load balance between nodes, thereby en-
hancing the business carrying capacity of the whole
system. Resources in the cloud computing environ-
ment are organized in the form of data centers. A data
center contains thousands or even tens of thousands
of nodes. Nodes are interconnected through high-
speed networks to jointly provide users with compu-
ting and storage resources. Cloud computing has de-
veloped very rapidly. At present, it has gone out of
the laboratory, and a series of mature products and
technologies have emerged. In addition to Internet