segmentation model for strategic marketing
planning.
In this paper, we propose a segmentation model
of the Internet health information market in Korea
by using decision tree, a widely used data mining
algorithm. The suggested segmentation model is
expected to be used for improved health counselling,
disease consultation, health commodity shopping, or
hospital marketing.
2 POPULATION OF THE
RESEARCH
As the population of the research, we used Inchon
city and Kangwon Province. Inchon is a typical big
city in Korea with population of 2.6 millions, and
the number of residents of Kangwon Province is
about 1.5 million with sparse distribution.
10,325 respondents are selected based on region,
gender, and age (20 or over). Telephone survey with
structured questionnaire was performed, and finally
8,656 completed interviews were used for the
analysis out of 10,325 respondents. The survey was
conducted from July 2006 to October 2006.
3 MEASUREMENT
The questionnaire contained: demographic data,
health condition, smoking, drinking, and usage of
the Internet for health information. As for the health
information access, the following are asked:
information access experience during the previous
year, type of the information such as getting general
health tips, disease consultation, health commodity
shopping, and hospital selection. We allowed plural
choices and investigated the respondent's
experience.
4 ANALYSIS
A decision tree analysis uses a tree structure to
classify data and predict the following action
according to given decision rules. CHAID (Chi-
squared Automatic Interaction Detection), CART
(Classification and Regression Tree),
QUEST(Quick, Unbiased, Efficient Statistical Tree)
algorithms are widely used for decision tree
analysis. In this paper, we used CHAID algorithm
where chi-square statistics is used to find an
optimum split. It is noted that CHAID can produce
multiple splits, unlikely CART or QUEST where
only binary split is allowed.
Dependent variables in this research are
experience of the Internet health information access,
disease consultation, health commodity shopping,
and hospital selection, while independent variables
are demographic data and health conditions.
5 RESULT
5.1 Experience of the Internet Health
Information Access
Among 8,656 respondents, 1,665 (19.2%) have used
the Internet for health information search during the
previous year. The main purposes of the search was,
allowing plural choice, for general health tips
(64.2%), disease consultation(32.0%), health
commodity shopping(23.7%), or hospital selection
(19.3%).
Table 1: The main purpose of searching the Internet health
information (unit: person (%)).
Category
Male
(N=726)
Female
(N=939)
Total
(N=1665)
General
health tips
452(62.3) 617(65.7) 1069(64.2)
Disease
consultation
225(31.0) 307(32.7) 532(32.0)
Health
commodity
shopping
148(20.4) 247(26.3) 395(23.7)
Hospital
selection
127(17.5) 194(20.7) 321(19.3)
* allowing plural choice
5.2 Decision Tree Analysis to
Categorize Internet Health
Information Search
5.2.1 General Health Tips
The decision tree analysis of the health information
search for general health tips showed that the key
decision factor was health status. 68.2% of healthy
person used the Internet for general health tips,
however, only 44.35% of unhealthy person searched
the Internet.
5.2.2 Disease Consultation
The most important variable that affects the Internet
access for disease consultation is also the health
CUSTOMER BEHAVIOR ANALYSIS FOR INTERNET HEALTH INFORMATION MARKET SEGMENTATION IN
KOREA
257