Centre for Language and Semantic Technologies

News

HSE Participates in Conference on International Exchange of Professionals in China

October 28

All news

Publications

Book

Kuznetsov S.

Data Analytics and Management in Data Intensive Domains: 25th International Conference, DAMDID/RCDL 2023, Moscow, Russia, October 24–27, 2023, Revised Selected Papers

This book constitutes the post-conference proceedings of the 25th International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2023, held in Moscow, Russia, during 24-27 October 2023.

The 21 papers presented here were carefully reviewed and selected from 75 submissions. These papers are organized in the following topical sections: Data Models and Knowledge Graphs; Databases in Data Intensive Domains; Machine learning methods and applications; Data Analysis in Astronomy & Information extraction from text. Papers from keynote talks have also been included in this book.

Vol. 2086: Communications in Computer and Information Science. Springer, 2024.
Article

Sohrabi M., Fathollahi-Fard A. M., Messaoudi M. et al.

Addressing Operating Room Planning and Scheduling Problem by Genetic Engineering Algorithm

Operating room (OR) planning and scheduling is a highly complex combinatorial optimization problem that involves patient assignment, OR allocation, and surgical sequencing within a constrained planning horizon. This study addresses the OR scheduling problem by incorporating downstream bed availability in both hospital wards and intensive care units (ICUs), aiming to improve operational efficiency and patient outcomes. A mixed-integer programming (MIP) model is developed to minimize the total completion time across all operating rooms during the planning period. Given the NP-hard nature of the problem, a novel Genetic Engineering Algorithm (GEA) is proposed as an advanced extension of the traditional Genetic Algorithm (GA). The GEA incorporates three innovative search strategies inspired by genetic engineering principles, refining the crossover and mutation operators to improve solution quality and convergence speed. To evaluate the effectiveness of the proposed algorithm, GEA and its variants are tested on 18 benchmark instances of varying problem sizes for the proposed OR planning and scheduling problem. A real-world case study from a hospital in Pakistan illustrates the practical applicability of the proposed approach using different real-world criteria. The results demonstrate that GEA consistently outperforms state-of-the-art metaheuristic algorithms. Its robustness and accuracy are further validated using 10 standard mathematical benchmark functions. The results confirm the GEA’s effectiveness and efficiency in addressing complex OR planning and scheduling problems and underscore its potential for advancing metaheuristic algorithm design in combinatorial optimization.

IEEE Access. 2025.
Book chapter

Kim J., Lee H., Jeon H. et al.

From Patterns to Predictions: A Shapelet-Based Framework for Directional Forecasting in Noisy Financial Markets

Directional forecasting in financial markets requires both accuracy and interpretability. Before the advent of deep learning, interpretable approaches based on human-defined patterns were prevalent, but their structural vagueness and scale ambiguity hindered generalization. In contrast, deep learning models can effectively capture complex dynamics, yet often offer limited transparency. To bridge this gap, we propose a two-stage framework that integrates unsupervised pattern extracion with interpretable forecasting. (i) SIMPC segments and clusters multivariate time series, extracting recurrent patterns that are invariant to amplitude scaling and temporal distortion, even under varying window sizes. (ii) JISC-Net is a shapelet-based classifier that uses the initial part of extracted patterns as input and forecasts subsequent partial sequences for short-term directional movement. Experiments on Bitcoin and three S&P 500 equities demonstrate that our method ranks first or second in 11 out of 12 metric--dataset combinations, consistently outperforming baselines. Unlike conventional deep learning models that output buy-or-sell signals without interpretable justification, our approach enables transparent decision-making by revealing the underlying pattern structures that drive predictive outcomes.

In bk.: CIKM '25: Proceedings of the 34th ACM International Conference on Information and Knowledge Management. ACM, 2025.
Working paper

Меньшиков И. А., Бернадотт А. К., Elvimov N. S.

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset

Accurate segmentation of blood vessels in brain magnetic resonance angiography (MRA) is essential for successful surgical procedures, such as aneurysm repair or bypass surgery. Currently, annotation is primarily performed through manual segmentation or classical methods, such as the Frangi filter, which often lack sufficient accuracy. Neural networks have emerged as powerful tools for medical image segmentation, but their development depends on well-annotated training datasets. However, there is a notable lack of publicly available MRA datasets with detailed brain vessel annotations. To address this gap, we propose a novel semi-supervised learning lightweight neural network with Hessian matrices on board for 3D segmentation of complex structures such as tubular structures, which we named HessNet. The solution is a Hessian-based neural network with only 6000 parameters. HessNet can run on the CPU and significantly reduces the resource requirements for training neural networks. The accuracy of vessel segmentation on a minimal training dataset reaches state-of-the-art results. It helps us create a large, semi-manually annotated brain vessel dataset of brain MRA images based on the IXI dataset (annotated 200 images). Annotation was performed by three experts under the supervision of three neurovascular surgeons after applying HessNet. It provides high accuracy of vessel segmentation and allows experts to focus only on the most complex important cases. The dataset is available at https://git.scinalytics.com/terilat/VesselDatasetPartly.

Statistical mechanics. arXie. arXive, 2025

All publications

Centre for Language and Semantic Technologies

International Laboratory of Intelligent Systems and Structural Analysis

Laboratory of Models and Methods of Computational Pragmatics

Laboratory of Complex Systems Modelling and Control

Semantics Analysis Laboratory (in Russian)

News

HSE Participates in Conference on International Exchange of Professionals in China

Publications

Data Analytics and Management in Data Intensive Domains: 25th International Conference, DAMDID/RCDL 2023, Moscow, Russia, October 24–27, 2023, Revised Selected Papers

Addressing Operating Room Planning and Scheduling Problem by Genetic Engineering Algorithm

From Patterns to Predictions: A Shapelet-Based Framework for Directional Forecasting in Noisy Financial Markets

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset

About

The centre's main objectives are:

Structure

International Laboratory of Intelligent Systems and Structural Analysis

Laboratory of Models and Methods of Computational Pragmatics

Laboratory of Complex Systems Modelling and Control

Semantics Analysis Laboratory (in Russian)

Management

News

HSE Participates in Conference on International Exchange of Professionals in China

Publications

Data Analytics and Management in Data Intensive Domains: 25th International Conference, DAMDID/RCDL 2023, Moscow, Russia, October 24–27, 2023, Revised Selected Papers

Addressing Operating Room Planning and Scheduling Problem by Genetic Engineering Algorithm

From Patterns to Predictions: A Shapelet-Based Framework for Directional Forecasting in Noisy Financial Markets

Hessian-based lightweight neural network for brain vessel segmentation on a minimal training dataset