Learning from Code Repositories to Recommend Model Classes

By: Thibaut Capuano, Houari Sahraoui, Benoit Frenay, Benoit Vanderose

Abstract

With the growing popularity of machine learning algorithms, dramatic advances have been made for code completion, and specifically method-call completion. These advances were also possible thanks to the availability of large code repositories to learn from and to the well-defined boundaries of the method-call completion problem. This is, however, not the case for design completion, where model repositories are scarce and the space of possibilities for design completion is theoretically infinite. We propose in this paper an approach that learns numeric representations of domain concepts and their relations from code repositories in order to recommend classes for UML class diagrams.

Keywords

Model completion, document embedding, Doc2Vec.

Cite as:

Thibaut Capuano, Houari Sahraoui, Benoit Frenay, Benoit Vanderose, “Learning from Code Repositories to Recommend Model Classes”, Journal of Object Technology, Volume 21, no. 3 (July 2022), pp. 3:1-11, doi:10.5381/jot.2022.21.3.a4.

PDF | DOI | BiBTeX | Tweet this | Post to CiteULike | Share on LinkedIn

The JOT Journal   |   ISSN 1660-1769   |   DOI 10.5381/jot   |   AITO   |   Open Access   |    Contact