relationship to salary. Personal interpretation to this
is that more mistakes made is on behalf of more
attempts in decisions-making on court, which
demonstrate their importance as well as
irreplaceability in their team.
3.3 Application of Linear Regression
Model
Based on previous result, points, turnovers and assists
should be input into the multiple linear regression
model. This model constructs a linear relation
regarding the three data and the corresponding salary
prediction. And according to the model, the final
result is:
𝒚 = 𝟗𝟒𝟔𝟓𝟔𝟕 ∗ [
𝑷𝑻𝑺
] + 𝟏𝟏𝟒𝟗𝟕𝟗𝟑 ∗ [′𝑨𝑺𝑻′] −
𝟒𝟕𝟖𝟏𝟓𝟑 ∗ [′𝑻𝑶𝑽′] − 𝟏𝟖𝟓𝟖𝟐𝟐𝟏 (1)
3.4 Discussion
The model has met the initial demand of this model,
that is offering a valid prediction of salary based on
the data of players. The advantage of this method is
that the rough data which is relatively accessible is
all the linear model needed. Additionally, this model
can deliver a prediction of all categories of player.
Though linear relationship of diverse data and the
corresponding salary prediction is figured out, this
model still has its systematically inevitable
limitations. The dataset originated from the statistics
in season 2022-2023, and this model could be invalid
after several seasons with the burgeoning
increasement of salary map in future. Due to these
reasons, the gap between prediction result from this
method will grow It is noted that this model hasn’t
considered regarding the combination of two data as
a new data itself, which could hold a potential
stronger correlation relationship with the salary,
because of the complexity of permutation among
those data. For example, average points multiply the
field goals rate may be more comprehensive data that
reflects the ability to score and rebounds multiply the
frequency to block could reflect a player’s ability to
protect the paint area comprehensively, thus the
combination potentially have a stronger relationship
with salary. Also, to optimize the overall prediction
result, data for specific players whose salary is
obviously out of portion with its performance could
be omitted. The common case for disproportion
among data and salary is injuries and rookie contacts
for extremely talented young players. Anyhow, the
residual error and the overall degree of fitting is
acceptable; also, the final result constructed by this
model accords with intuition in most cases.
Turnovers itself have a positive linear relationship
to the salary, but when it comes to taking
consideration of multiple factors together, turnovers
have negative impact on the predicted salary. The
personal interpretation to this phenomenon is as
follows. More turnovers mean more wrong decisions
on court, revealing that a player makes more
decisions on court, which demonstrate their
significance to the team, as the author has mentioned
before. Such that, when considering the turnovers as
a single factor to salary, it has a positive impact on
salary. However, in the case that we consider multiple
factors, points and assists can also measure a player’s
significance for the team, as they mean more shots
made and contribution in ball movements,
respectively. When the “significance” of players is
similar, that is to say, they take on the same portion
of responsibility on the offensive side on court. Thus,
less turnovers prevail that their overall decisions
made are more accurate and efficient, such that in this
case, turnovers have a negative impact on the
performance as well as the income prediction of the
player. Additionally, as mentioned before, salary of
players which experience serious injuries, rookie
players and top superstars in the league could not be
measured by this model. The reason is that in those
cases either data couldn’t reveal the true value or
contribution of those players or those players have
unique commercial values for their teams that
couldn’t be measured by data on court.
4 CONCLUSION
This model meets the initial aim of presenting a direct
relationship between data for basketball players and
their corresponding salary in national basketball
association. As a consequence, this enable the
prediction of the amount of their contract amount, as
long as their statistics is given. The main finding of
this study is finding a function regarding of points,
turnovers, assists and salary. More specifically, each
point obtained increases the salary by 946567 dollars
per year; each assist made increases the salary by
1149793 dollars per year; in contrary, each turnover
decreases the amount of salary by 478153 dollars.
The final prediction is the sum of previous three
functions subtracted by constant 1858221, according
to the linear regression model. Take Seth Curry-a
guard player in Maverick as an example. In season
2022-2023, the player got 9.2 points, 1.6 assists and
0.8 turnover per game, and the predicting salary is