the duration of prediction and the dependent variable,
linear support vector regressors and basic average
forecast combinations surpass other methods,
particularly in the realm of short-term forecasting
(Milunovich, 2020). The research presented a novel,
reflective method for forecasting housing expenses,
integrating data from public facilities and satellite
imagery. In terms of precision, this system surpassed
other machine and deep learning models due to its
adept understanding of intricate feature
interconnections. Using and analyzing both
conventional and sophisticated machine learning
methods to forecast housing costs, concentrating on
multiple aspects, leads to positive results in precise
price prediction (Wang et al, 2021)(Truong et al,
2020). Analysis of data mining techniques such as
random forest, gradient boosting, and linear regressor
on real estate data from the University of California
Irvine revealed that gradient boosting regression is
the most efficient, exhibiting an average absolute
error rate of 3.92 and a test set that includes 20%
(Uzut & Buyrukoglu, 2020). Collectively, these
research works emphasize the dynamic and
developing aspects of forecasting housing prices,
underscoring the critical need for ongoing
enhancement and progression of techniques to
improve precision and dependability in this
economically important field.
This research primarily aims to enhance the
precision of housing price forecasts using the
sophisticated computational power of MLP. The
focus of this study is on creating a complex MLP
structure, adept at deciphering the complex data
associated with housing markets. This encompasses a
comprehensive examination of property details,
geographical features, and wider economic metrics.
The initial phase entails customizing the MLP
structure to integrate various hidden layers and
activation techniques, to thoroughly understand the
complex connections present in the dataset.
Additionally, the research establishes a stringent
process for both the training and validation phases.
The third phase involves an in-depth evaluation and
comparison of the MLP model's effectiveness against
conventional regression models. Furthermore, the
research highlights the significance of preprocessing
data and formulating strategic guidelines. The
experimental results indicate that the MLP model
attains a precision surpassing 90%. This model's
efficiency is evidenced by its performance,
underscoring the transformative power of MLPs in
altering housing price prediction methods. This
research holds significant practical value, providing
vital understanding for prospective purchasers,
vendors, property analysts, and decision-makers. This
study enhances real estate choices by offering a more
precise and dependable method for forecasting
housing costs.
2 METHODOLOGY
2.1 Dataset Description and
Preprocessing
This research utilizes the "House Price Dataset", a
compilation derived from Kaggle. The dataset
includes a wide range of characteristics relevant to the
real estate sector, addressing elements such as
housing costs, their positioning, dimensions, and
other pertinent details. The database comprises more
than 20,000 records, encompassing a wide range of
details including the count of bedrooms, bathrooms,
living spaces, dimensions of lots, and construction
year. In the preliminary stages of processing, the
research segments the dataset into training and testing
parts, ensuring a consistent division ratio of 4:1.
Characteristics that barely affect property values, like
a random identification number, are excluded. To
make computational tasks easier, categorical factors
such as neighborhood and house style are transformed
into dummy variables. Suitable imputation
techniques are utilized to augment missing data,
tailored to the distinct characteristics of each variable.
Moreover, anomalies, particularly in terms of cost
and size, are pinpointed and eliminated to bolster the
predictive models' resilience. Techniques of
normalization are utilized to normalize the
dimensions of every numerical variable, guaranteeing
a uniform distribution of weights throughout the
modeling phase. The goal of this preprocessing
technique is to improve the dataset to precisely and
efficiently forecast housing expenses.
2.2 Proposed Approach
This research aims to create a robust model for
forecasting real estate values. As demonstrated in
Figure 1, this method includes various systematic
stages, each playing a role in improving and refining
the predictive model. Initially, a range of machine
learning models are presented, encompassing Linear
Regression, Decision Tree Regression, Random
Forest Regression, and Gradient Boosting
Regression. Linear Regression provides a crucial
perceptive perspective on the link between traits and
housing expenses, whereas Gradient Boosting
Regression enhances comprehension via its complex