В старых версиях браузеров сайт может отображаться некорректно. Для оптимальной работы с сайтом рекомендуем воспользоваться современным браузером.
We use cookies in order to improve the quality and usability of the HSE website. More information about the use of cookies is available here, and the regulations on processing personal data can be found here. By continuing to use the site, you hereby confirm that you have been informed of the use of cookies by the HSE website and agree with our rules for processing personal data. You may disable cookies in your browser settings.
The programme is designed to exchange the results of solving Data Science experiments, assess the qualitative characteristics of these results, and upload the most promising solutions to the production server environment. It consists of API managing file data storage and MLOps-system, as well as a database for storing solutions offered by users.
Computer type: PC based on Intel x86 processor
OS: CentOS-7
The programme is designed to generate synthetic tabular data, time series and images, includes implementation and evaluation of regression and classification algorithms, assessing their performance on both generated and raw data. Functionality: loading of training data, analysis of unique values, generation of synthetic data using generative artificial intelligence (AI) models WGAN, VAE, Real NVP, use of metrics to measure the level of confidentiality and quality of generated synthetic data, their difference from real data, visualisation of research results.
Программа предназначена для процесса обучения символьной регрессии на основе автокодировщика. Обеспечивает устойчивость к шумам, возможность настройки априорных значений для символьного выражения для применений физики и др.
The program is designed for modelling radio wave propagation by ray tracing between a transmitter and a moving receiver, calculating signal quality indicators, collecting and processing the obtained data. The program contains libraries for ray tracing modelling, working with arrays and tables. The program includes a module for modelling radio waves with ray tracing, calculation of modelling indicators, saving of selected indicators, module for processing of obtained data.
Computer type: IBM PC - joint.PC based on 10*3.2 GHz processor.
OS: MAC OS 12.6
The software provides the user with the ability to convert ray tracing data into a frame sequence format, configure and train a neural network based on it and then save it.
The programme is designed to store terms and vector representations, exchange and manipulate data for semantic analysis and was developed due to the need to extract vectors and related features from heavyweight machine and deep learning models.
Computer type: HP ProLiant DL-160 G6 server or equivalent
OS: Ubuntu 14.04.5 LTS (Trusty Tahr) or higher
The program allows to solve the tasks of standardisation of data from different information systems to ensure quality, reliability and validity, the task of ensuring transparency of data acquisition by storing references to their sources and combining named entities and subject terms, search for terms in Russian and English. The application area of the programme is linguistic analysis of texts in Russian and English.
Computer type: HP ProLiant DL-160 G6 server or analogue
OS: Ubuntu 14.04.5 LTS (Trusty Tahr) and higher
The programme allows to predict the volume of the goods market by the textual following of the corresponding terms and is applicable for scientific and technological forecasting in the interests of strategic analytics and foresight studies.
Computer type: HP ProLiant DL-160 G6 server or analogue
The programme checks facts in several languages (Russian, English, Bulgarian) based on sources from Wikipedia. For each language the local version of Wikipedia is used. The input data is a set of several facts entered by the tool user. The output is the result of checking each statement, represented by one of three answers: 1) Truth; 2) Lying; 3) Not Enough Info.
The programme is based on keyword extraction. The algorithms are designed in such a way that the most relevant words and phrases are extracted as a result of text processing. The programme receives arrays of unstructured text data as input. The text of the document is broken down into sentences, dependencies between words are identified based on analysing the syntax of the processed sentence. Based on the identified dependencies between words, individual nouns and adjectives are combined into word combinations, n-grams, to which the NER model assigns tags, which are indications of named entities. By means of ranking implemented on the basis of selected TF-IDF, RAKE, BM25, KeyBERT algorithms, key metainformation is selected from the whole array of available metainformation. The programme is applicable for a wide range of tasks of semantic analysis and information retrieval.
Computer type: HP ProLiant DL-160 G6 server or analogue Ubuntu 14.04.5 LTS (Trusty Tahr) or higher
The programme is designed to automate the process of creating models for detecting objects and actions. Functionality includes: preprocessing of dataset, including images and annotations; generation of configuration files; auto selection of hyperparameters; selection of neural network architecture; early stopping algorithm with saving the best checkpoints, functionality for detecting correctness of arbitrary sequence of objects.
Computer type: IBM PC - joint. PC
OS: Linux, MacOS, Windows
The programme consists of three modules that train and test different graph networks based on the prepared data and draw additional graphs (ROC-, PR- and F1 score curves). As input, the programme receives the organism's genome, as well as the results of chip-atlas experiments converted into sparse vector format. It is also necessary to manually specify the model parameters and the balance of classes of the training dataset. The model is trained on a training sample (first module) and then tested on the full genome (second module). In the third module, ROC-, PR- and F1 score curves can be plotted on the required dataset.
Computer type: IBM PC-compatible. PC.
OS: Mac OS X 10.11 or higher, Linux Ubuntu 16.04 or higher
The framework integrates software modules for predicting the location of genomic functional elements (GFEs) using deep learning methods, allowing to select the type, architecture of neural networks for predicting the location of GFEs and omics data, to complement with transfer learning methods, to select prediction parameters and to evaluate its quality.
Computer type: IBM PC - joint
PC. OS: Mac OS X 10.11 or higher, Linux Ubuntu 16.04 or higher
The programme reulises full-genome analysis using the DNABERT transducer algorithm trained on experimentally identified sequences forming Z-DNA (Z-flipons). The algorithm provides a significant performance improvement (F1 = 0.83) over existing approaches and implements computational mutagenesis to assess the effect of base substitutions on Z-DNA formation.
The programme provides a graphical interface for applying a pre-trained machine learning model (multilayer perceptron) to assess the presence and degree of dyslexia in a schoolchild based on gender, age, school grade and oculography data. To use the programme, you need to specify the patient's demographic data and attach a .CSV file with oculography data (duration and pupil fixation parameters), the result is belonging to one of three classes - dyslexia, dyslexia risk or normal.
The first artificial intelligence-based solution with robust results for detecting dyslexia from eye movement data of Russian-speaking schoolchildren in grades one to six. Comprehensive study of the performance and hyperparameter fine-tuning of ten classification and seven regression algorithms. Proposed the second largest eye movement data set in this area of research, consisting of three discrete target values and one continuous one.
The environmental monitoring system reduces the time and increases the accuracy of predicting the spatial distribution of harmful substances in the atmospheric air, and has the ability to simultaneously cover a plurality of local, located at a single industrial site, and a plurality of territorial, connecting a plurality of local environmental monitoring systems of different industrial sites. The invention can be used for integrated planning and notification of risks of atmospheric air pollution of industrial enterprises by harmful substances.
The programme is designed to predict meteorological variables (temperature, precipitation, pressure, humidity, wind speed) using WRF (Weather Research and Forecasting) simulator. The programme consists of two modules. The first module generates the training sample. The second module is responsible for optimising the WRF hyperparameters.
Computer type: Intel processor based PC
OS: Fedora Linux 36
The programme is designed to predict meteorological variables (temperature, precipitation, pressure, humidity, wind speed) using WRF (Weather Research and Forecasting) simulator. The programme consists of two modules. The first module generates the training sample. The second module is responsible for optimising the WRF hyperparameters.
Computer type: Intel processor based PC
OS: Fedora Linux 36
The programme consists of a number of interrelated modules: a module for collecting and processing text materials, a module for classifying text materials by sentiment using artificial intelligence, and a module for calculating various sentiment metrics. It provides automated collection, processing and detection of sentiment (tone) of investment-themed text materials on Telegram channels.
The programme allows solving classification and regression problems in financial economics using dense layer neural network models and interpreting the results by means of explanatory artificial intelligence (AI) (Shepley vectors), for example, to identify determinants of stock and bond exchange characteristics. The programme includes modules for data loading and unloading, building neural network models for regression and classification tasks, explanatory AI and graph generation.
Computer type: IBM PC-compatible. PC
OS: Windows 7/8.1/10/11
The utility model relates to accessories for smartphones and can be used as a security device including additional functionality in the form of support for investment decisions based on artificial intelligence.
The programme solves the problem of automatic assignment of points for open-ended tasks of a digital tool for assessing the reading literacy of 3rd grade students. Three classification models are developed for each of the three tasks of the tool. The programme can be used by developers of open-type tasks and experts checking such tasks.
Computer type: IBM PC-compatible. PC
OS: Windows 7/8/8/8.1/10, Linux
The programme is a web application that allows to determine the ethical index of companies for different sectors of the economy. The programme automatically collects data, including text data, from public sources and saves it into a database. Based on the obtained data, an ethics index is calculated using artificial intelligence models, which can then be visualised for the user.
Computer type: IBM PC-compatible. PC
OS: Windows 7/8/8/8.1/10; Ubuntu 18.04/20.04/22.04
Forecasting the number of cases of arbitration courts of the Russian Federation
More
The program is designed to generate a forecast of the number of arbitration cases by category for future time periods based on the analysis of data accumulated in the data warehouse. The programme can visualise the number of existing data in the system in the form of graphs.
Computer type: IBM PC compatible computer, 4x2.5 GHz processor
OS: Windows 11
The programme is designed to find out the entropy structure of time series describing the number of new cases in the RF court. The program can be used for automation and improvement of quality of work of courts of the Russian Federation, departments of statistics and scientific divisions of legal centres. Main external functions: input of data on the number of new cases in the court for a certain period, region, category; obtaining of describing statistics; conclusion on the structure of the series.