Figure 3: Bar chart template.
Figure 4: Line chart and scatter plot template.
methods. The development and research of this area
would bring a significant improvement to chart com-
prehension for the visually impaired.
6 CONCLUSIONS
In this paper, we introduced Alt-Texify, a pipeline
to classify and extract information from SVG-based
line, bar and scatter charts to create alt-text. The gen-
erated alt-text can assist the visually-impaired in in-
terpreting visualizations on the internet. Our pipeline
consists of three stages: chart classification, extract-
ing the data and alt-text generation. The first stage,
based on previous work, has 86% classification accu-
racy. The second stage achieves 99.74% data extrac-
tion accuracy for bar charts. And the last stage in-
serts the extracted data and labels into a template that
has been created based on previous research on what
makes alt-text helpful and accessible.
ACKNOWLEDGEMENTS
Support for this work from ARC Laureate Program
FL190100035 and Discovery Project DP200100020
is gratefully acknowledged.
REFERENCES
Battle, L., Duan, P., Miranda, Z., Mukusheva, D., Chang,
R., and Stonebraker, M. (2018). Beagle: Automated
Extraction and Interpretation of Visualizations from
the Web, page 1–8. Association for Computing Ma-
chinery, New York, NY, USA.
Berners-Lee, T. and Connolly, D. (1995). Hypertext markup
language - 2.0. Retrieved 9 October 2021 from,
https://datatracker.ietf.org/doc/html/rfc1866.
Carterette, E. C. and Jones, M. H. (1967). Visual and au-
ditory information processing in children and adults.
Science, 156(3777):986–988.
Draper, N. and Smith, H. (1981). Applied Regression Anal-
ysis. Number pt. 766 in Applied Regression Analysis.
Wiley.
D
¨
urnegger, B., Feilmayr, C., and W
¨
oß, W. (2010). Guided
generation and evaluation of accessible scalable vec-
tor graphics. In Proceedings of the 12th International
Conference on Computers Helping People with Spe-
cial Needs: Part I, ICCHP’10, page 27–34, Berlin,
Heidelberg. Springer-Verlag.
Gatt, A. and Krahmer, E. (2018). Survey of the state
of the art in natural language generation: Core
tasks, applications and evaluation. J. Artif. Int. Res.,
61(1):65–170.
Gleason, C., Carrington, P., Cassidy, C., Morris, M. R., Ki-
tani, K. M., and Bigham, J. P. (2019). “it’s almost like
they’re trying to hide it”: How user-provided image
descriptions have failed to make twitter accessible. In
The World Wide Web Conference, WWW ’19, page
549–559, New York, NY, USA.
HTML Standard (2021). Html living stan-
dard. Retrieved 30 September 2021 from,
https://html.spec.whatwg.org/multipage/images.html.
Huang, W. and Tan, C. L. (2007). A system for understand-
ing imaged infographics and its applications. In Pro-
ceedings of the 2007 ACM Symposium on Document
Engineering, DocEng ’07, page 9–18.
Jung, C., Mehta, S., Kulkarni, A., Zhao, Y., and Kim, Y.-S.
(2021). Communicating visualizations without visu-
als: Investigation of visualization alternative text for
people with visual impairments. IEEE Transactions
on Visualization and Computer Graphics, 12(1):1–11.
Jung, D., Kim, W., Song, H., Hwang, J.-i., Lee, B., Kim,
B., and Seo, J. (2017). Chartsense: Interactive data
extraction from chart images. In Proceedings of the
2017 CHI Conference on Human Factors in Comput-
ing Systems, CHI ’17, page 6706–6717.
Liu, C., Xie, L., Han, Y., Wei, D., and Yuan, X. (2020).
Autocaption: An approach to generate natural lan-
guage description from visualization automatically.
In 2020 IEEE Pacific Visualization Symposium (Paci-
ficVis), pages 191–195.
Morash, V. S., Siu, Y.-T., Miele, J. A., Hasty, L., and Lan-
dau, S. (2015). Guiding novice web workers in mak-
ing image descriptions using templates. ACM Trans.
Access. Comput., 7(4).
Morris, M. R., Johnson, J., Bennett, C. L., and Cutrell, E.
(2018). Rich Representations of Visual Content for
Screen Reader Users, page 1–11. Association for
Computing Machinery, New York, NY, USA.
Morris, M. R., Zolyomi, A., Yao, C., Bahram, S., Bigham,
J. P., and Kane, S. K. (2016). ”with most of it being
pictures now, i rarely use it”: Understanding twitter’s
ENASE 2022 - 17th International Conference on Evaluation of Novel Approaches to Software Engineering
280