Issue |
E3S Web Conf.
Volume 431, 2023
XI International Scientific and Practical Conference Innovative Technologies in Environmental Science and Education (ITSE-2023)
|
|
---|---|---|
Article Number | 05012 | |
Number of page(s) | 7 | |
Section | IT and Mathematical Modeling in the Environment | |
DOI | https://doi.org/10.1051/e3sconf/202343105012 | |
Published online | 13 October 2023 |
The principles of building a machine-learning-based service for converting sequential code into parallel code
Kazan National Research Technical University named after A. N. Tupolev – KAI, Kazan, Russia
* Corresponding author: landwatersun@mail.ru
This article presents a novel approach for automating the parallelization of programming code using machine learning. The approach centers on a two-phase algorithm, incorporating a training phase and a transformation phase. In the training phase, a neural network is trained using data in the form of Abstract Syntax Trees, with Word2Vec being employed as the primary model for converting the syntax tree into numerical arrays. The choice of Word2Vec is attributed to its efficacy in encoding words with less reliance on context, compared to other natural language processing models such as GloVe and FastText. During the transformation phase, the trained model is applied to new sequential code, transforming it into parallel programming code. The article discusses in detail the mechanisms behind the algorithm, the rationale for the selection of Word2Vec, and the subsequent processing of code data. This methodology introduces an intelligent, automated system capable of understanding and optimizing the syntactic and semantic structures of code for parallel computing environments. The article is relevant for researchers and practitioners seeking to enhance code optimization techniques through the integration of machine learning models.
© The Authors, published by EDP Sciences, 2023
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.