Issue |
E3S Web of Conf.
Volume 413, 2023
XVI International Scientific and Practical Conference “State and Prospects for the Development of Agribusiness - INTERAGROMASH 2023”
|
|
---|---|---|
Article Number | 03011 | |
Number of page(s) | 15 | |
Section | Social and Human Ecology | |
DOI | https://doi.org/10.1051/e3sconf/202341303011 | |
Published online | 11 August 2023 |
The pipeline processing of NLP
Tashkent State University of Uzbek Language and Literature named after A.Navai, 100100 Tashkent, Uzbekistan
* Corresponding author: elov@navoiy-uni.uz
The problem of NLP should be divided into several small parts and solved step by step. In this article, where NLP is necessary at every stage of solving the problem, all forms of text processing are considered. The step-by-step text processing is called a pipeline process in NLP. When creating any NLP model, the pipeline process is a sequence of steps that must be carried out. The planning and development of the text processing is considered as the starting point for the creation of any NLP project. This article discusses the steps involved in implementing a pipeline process and their role in solving NLP tasks. This article analyzed the most common preliminary processing steps on the NLP processing pipeline. All processing stages are pre-trained in various NLP libraries, identified as usable models. If necessary, additional, modified preprocessing steps can be developed depending on the given problem condition. One can determine how a particular initial processing stage serves a given NLP problem by many experimentations.
© The Authors, published by EDP Sciences, 2023
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.