ACKNOWLEDGEMENTS
The preparation of this paper was supported by the en-
able Cluster and is catalogued by the enable Steering
Committee as enable 065 (http://enable-cluster.de).
This work was funded by a grant of the German
Ministry for Education and Research (BMBF) FK
01EA1807A.
REFERENCES
Achananuparp, P. and Weber, I. (2016). Extracting food
substitutes from food diary via distributional similar-
ity. In 10th ACM Conference on Recommender Sys-
tems.
Altosaar, J. (2017). food2vec - augmented cook-
ing with machine intelligence. https://jaan.io/
food2vec-augmented-cooking-machine-intelligence/.
Accessed: 2020-09-05.
Beltagy, I., Lo, K., and Cohan, A. (2019). Scibert: A pre-
trained language model for scientific text. In Proceed-
ings of the 2019 Conference on Empirical Methods
in Natural Language Processing and the 9th Inter-
national Joint Conference on Natural Language Pro-
cessing (EMNLP-IJCNLP), pages 3606–3611.
Bernhardsson, E. (2019). Annoy. https://github.com/
spotify/annoy. Accessed: 2020-07-04.
Boscarino, C., Nedovi
´
c, V., Koenderink, N. J., and Top, J. L.
(2014). Automatic extraction of ingredient’s substi-
tutes. In Proceedings of the 2014 ACM International
Joint Conference on Pervasive and Ubiquitous Com-
puting: Adjunct Publication, pages 559–564.
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-
Fei, L. (2009). Imagenet: A large-scale hierarchical
image database. In 2009 IEEE conference on com-
puter vision and pattern recognition, pages 248–255.
Ieee.
Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K.
(2018). Bert: Pre-training of deep bidirectional trans-
formers for language understanding. arXiv preprint
arXiv:1810.04805.
Druck, G. and Pang, B. (2012). Spice it up? mining refine-
ments to online instructions from user generated con-
tent. In Proceedings of the 50th Annual Meeting of the
Association for Computational Linguistics (Volume 1:
Long Papers), pages 545–553, Jeju Island, Korea. As-
sociation for Computational Linguistics.
Engstler, L. N. (2020). Ontology learning from text in the
food domain. Unpublished master’s thesis. Technical
University of Munich, Munich, Germany.
Harris, Z. S. (1954). Distributional structure. Word, 10(2-
3):146–162.
He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep resid-
ual learning for image recognition. In Proceedings of
the IEEE conference on computer vision and pattern
recognition, pages 770–778.
Hinds, R. (2016). Unsupervised learning in scala using
word2vec. https://automateddeveloper.blogspot.com/
2016/10/unsupervised-learning-in-scala-using.html.
Accessed: 2020-09-05.
Honnibal, M. and Montani, I. (2017). spaCy 2: Natural lan-
guage understanding with Bloom embeddings, convo-
lutional neural networks and incremental parsing. To
appear.
Hugging Face (2020). Run language modeling.
https://github.com/huggingface/transformers/blob/
master/examples/language-modeling. Accessed:
2020-07-04.
Joachim, D. (2010). The Food Substitutions Bible. Robert
Rose Inc., Toronto, Ontario, Canada, 2 edition.
Kazama, M., Sugimoto, M., Hosokawa, C., Matsushima,
K., Varshney, L. R., and Ishikawa, Y. (2018). A neural
network system for transformation of regional cuisine
style. Frontiers in ICT, 5:14.
Landis, J. R. and Koch, G. G. (1977). The measurement of
observer agreement for categorical data. Biometrics,
33(1).
Lawo, D., B
¨
ohm, L., and Esau, M. (2020). Supporting
plant-based diets with ingredient2vec.
Lee, J., Yoon, W., Kim, S., Kim, D., Kim, S., So, C. H.,
and Kang, J. (2020). Biobert: a pre-trained biomedi-
cal language representation model for biomedical text
mining. Bioinformatics, 36(4):1234–1240.
Loshchilov, I. and Hutter, F. (2019). Decoupled weight
decay regularization. In International Conference on
Learning Representations.
Marin, J., Biswas, A., Ofli, F., Hynes, N., Salvador, A., Ay-
tar, Y., Weber, I., and Torralba, A. (2019). Recipe1m+:
A dataset for learning cross-modal embeddings for
cooking recipes and food images. IEEE Trans. Pat-
tern Anal. Mach. Intell.
Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013).
Efficient estimation of word representations in vector
space. arXiv preprint arXiv:1301.3781.
ˇ
Reh
˚
u
ˇ
rek, R. and Sojka, P. (2010). Software Framework
for Topic Modelling with Large Corpora. In Proceed-
ings of the LREC 2010 Workshop on New Challenges
for NLP Frameworks, pages 45–50, Valletta, Malta.
ELRA. http://is.muni.cz/publication/884893/en.
Reiplinger, M., Wiegand, M., and Klakow, D. (2014).
Relation extraction for the food domain without la-
beled training data–is distant supervision the best so-
lution? In International Conference on Natural Lan-
guage Processing, pages 345–357. Springer.
Rieger, B. B. (1991). On distributed representation in word
semantics. International Computer Science Institute
Berkeley, CA.
Rubenstein, H. and Goodenough, J. B. (1965). Contextual
correlates of synonymy. Communications of the ACM,
8(10):627–633.
Sauer, C., Haigh, A., and Rachleff, J. (2017). Cooking up
food embeddings.
Shidochi, Y., Takahashi, T., Ide, I., and Murase, H. (2009).
Finding replaceable materials in cooking recipe texts
considering characteristic cooking actions. In Pro-
ceedings of the ACM multimedia 2009 workshop on
HEALTHINF 2021 - 14th International Conference on Health Informatics
76