Issue |
E3S Web Conf.
Volume 297, 2021
The 4th International Conference of Computer Science and Renewable Energies (ICCSRE'2021)
|
|
---|---|---|
Article Number | 01058 | |
Number of page(s) | 7 | |
DOI | https://doi.org/10.1051/e3sconf/202129701058 | |
Published online | 22 September 2021 |
Arabic Topic Identification: A Decade Scoping Review
1 Faculty of Sciences, Mohammed First University, Oujda, Morocco
2 L-STI, T-IDMS, Faculty of Sciences and Techniques Errachidia, Moulay Ismail University, Meknes, Morocco
* Corresponding author: elkah.anoual.mri@gmail.com
Arabic topic identification is a part of text classification that aims to assign a given text a set of pre-defined classes (i.e., topics) based on its content and extracted features. This task can be performed using rule-based methods or data-driven approaches. These latter gained more popularity since they require much less human effort to accurately classify a large number of documents. Due to the tremendous growth of Web contents primarily in news websites and social media, topic identification had received a great deal of attention over the last years, and has become a cornerstone of both search engines and information retrieval. The Arabic language is the fourth most used language on the web and records the highest growth in the last two decades (2000–2020). Based on these facts currently available, it seems fair to look closer at the advancements in the Arabic topic identification in the last decade. To this end, we performed the first of its kind scoping review that addresses recent studies in the field of Arabic topic identification that follows the PRISMA-ScR guidelines. This review is based on various online bibliographic databases (e.g., Springer, ScienceDirect, and IEEE Xplore) and datasets search engines (e.g., Google Dataset Search).
© The Authors, published by EDP Sciences, 2021
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.