loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Vitaly Romanov ; Vladimir Ivanov and Giancarlo Succi

Affiliation: Innopolis University, Innopolis, Russia

Keyword(s): Source Code, Embeddings, Hierarchical Embeddings, Graph, Dataset, Machine Learning, Python, Java.

Abstract: Any source code can be represented as a graph. This kind of representation allows capturing the interaction between the elements of a program, such as functions, variables, etc. Modeling these interactions can enable us to infer the purpose of a code snippet, a function, or even an entire program. Lately, more and more work appear, where source code is represented in the form of a graph. One of the difficulties in evaluating the usefulness of such representation is the lack of a proper dataset and an evaluation metric. Our contribution is in preparing a dataset that represents programs written in Python and Java source codes in the form of dependency and function call graphs. In this dataset, multiple projects are analyzed and united into a single graph. The nodes of the graph represent the functions, variables, classes, methods, interfaces, etc. Nodes for functions carry information about how these functions are constructed internally, and where they are called from. Such graphs ena ble training hierarchical vector representations for source code. Moreover, some functions come with textual descriptions (docstrings), which allows learning useful tasks such as API search and generation of documentation. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.15.26.184

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Romanov, V. ; Ivanov, V. and Succi, G. (2020). Representing Programs with Dependency and Function Call Graphs for Learning Hierarchical Embeddings. In Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 2: ICEIS; ISBN 978-989-758-423-7; ISSN 2184-4992, SciTePress, pages 360-366. DOI: 10.5220/0009511803600366

@conference{iceis20,
author={Vitaly Romanov and Vladimir Ivanov and Giancarlo Succi},
title={Representing Programs with Dependency and Function Call Graphs for Learning Hierarchical Embeddings},
booktitle={Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 2: ICEIS},
year={2020},
pages={360-366},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009511803600366},
isbn={978-989-758-423-7},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 2: ICEIS
TI - Representing Programs with Dependency and Function Call Graphs for Learning Hierarchical Embeddings
SN - 978-989-758-423-7
IS - 2184-4992
AU - Romanov, V.
AU - Ivanov, V.
AU - Succi, G.
PY - 2020
SP - 360
EP - 366
DO - 10.5220/0009511803600366
PB - SciTePress