Issue |
E3S Web Conf.
Volume 627, 2025
VI International Conference on Geotechnology, Mining and Rational Use of Natural Resources (GEOTECH-2025)
|
|
---|---|---|
Article Number | 04017 | |
Number of page(s) | 14 | |
Section | Automation, Digital Transformation and Intellectualization for the Sustainable Development of Mining and Transport Systems, Energy Complexes and Mechanical Engineering | |
DOI | https://doi.org/10.1051/e3sconf/202562704017 | |
Published online | 16 May 2025 |
Graph-based neural networks with neural ODEs for robust speech processing in environmental and human-centric systems
1 Turan University, Almaty, Kazakhstan
2 Almaty Technological University, Almaty, Kazakhstan
3 International IT University, Almaty, Kazakhstan
4 Al Farabi Kazakh National University, Almaty, Kazakhstan
* Corresponding author: mambetov.saken@gmail.com
This paper introduces H-STGNN-ODE-DA, a novel model for voice sentiment analysis that combines multi-scale acoustic feature extraction, hierarchical graph neural networks (GNNs), Neural Ordinary Differential Equations (Neural ODEs), and domain-adversarial adaptation. Designed to enhance accuracy and robustness under real-world conditions, the model was evaluated on IEMOCAP, MELD, and EmoDB datasets, outperforming state-of-the-art approaches such as LSTM, CNN, GCN, GAT, DANN, and SPECTRA. Notably, it achieved a 4.0% improvement over GAT on MELD. Neural ODEs enabled effective modeling of continuous emotional transitions, while domain-adversarial adaptation ensured robustness to domain shifts. Ablation studies confirmed the critical role of each component in achieving high performance. The model demonstrated strong cross-domain transferability, maintaining high accuracy across diverse recording conditions. These results position H- STGNN-ODE-DA as a robust and versatile solution for real-world applications in speech processing, including virtual assistants, social media analysis, and customer service systems.
© The Authors, published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.