Working paper

Development of a Multi-Agent System for Solving Domain Dictionary Construction Problem

Year:

2020

Published in:

SSRN
TF-IDF
RAKE
TextRank
Word2Vec
Schulze method
text data
frequency analysis
parallel com-puting
multi-agent system

The object of research is the use of multi-agent systems for text data mining. The need for this study arose with a tendency to increase the amount of textual information gene rated in the world. Accordingly, it is necessary to develop and research methods of its processing, as well as ways to use the results of this processing, because the methods can’t exist in isolation from practice. At the same time, there is a development of multi-agent sys-tems (MAS), where agents are endowed with some kind of intelligence, these systems can be easily scaled. The use of MAS for text analysis is a promising area. The following methods of text data analysis were used in this study: TF-IDF and RAKE methods, Word2Vec neural network models, and TextRank. The algorithms were compared for their work and the results were compared. The corpus of documents (10–12 texts, 5732–12331 words) from the subject areas of physics and biology were used as a test set. According to the results of the study, one method was chosen, on the basis of which the MAS was built to solve the problem. Additionally, Schulze methods (with one and several winners) were used for voting. With the received system additional researches concerning accuracy and speed of work, and also – influence are carried out system parameters for its operation. It has been found that TF-IDF-based analysis is useful for finding terms in documents with a weak context. The resulting system shows an accuracy of 75 % (3 of the 4 words proposed by the system are terms). The maxi-mum operating time on test cases is 2–3 seconds, which is achieved through the use of parallel calculations and modification of the Schulze method. The results obtained in this paper are heuristic (ontology is a rather vague concept) and require additional elaboration by experts in the relevant fields. However, the results are positive within this experiment.

Other publications by

12 publications found

2025
Journal article

The development of an electronic circuit simulation system using variable tabular bases

Publisher: Technology Center PC

Authors: Vadym Yaremenko, Bogdan Bulakh, Yaroslav Kornachevskyy, Oleksandr Beznosyk, Kostyantyn Kharchenko

2022
Journal article

A theoretically proposed algorithm in a decision tree format for choosing an efficient storage type of large datasets

Publisher: Technology Center PC

Authors: Sofiia Materynska, Vadym Yaremenko, Walery Rogoza

2020
Journal article

Використання штучних нейронних мереж для визначення наявності сердцево‑судинних хвороб та захворювань печінки при малих наборах даних.

Publisher: Луцький національний технічний університет

Authors: Vadym Yaremenko, Sofiia Materynska

2019
Journal article

Підхід до використання фільтра блума для багатокласової класифікації текстових даних в режимі реального часу.

Publisher: Technology Center PC

Authors: Vadym Yaremenko, Dmytro Budonnyi

2021
Working paper

Neural Networks and Monte‑Carlo Method Usage in Multi‑Agent Systems for Sudoku Problem Solving

Publisher: SSRN

Authors: Vadym Yaremenko, Kateryna Poloziuk

2024
Journal article

Forecasting software development costs in scrum iterations using ordinary least squares method

Publisher: Technology Center PC

Authors: Vadym Yaremenko, Kostyantyn Kharchenko, Oleksandr Beznosyk, Bogdan Bulakh, Bogdan Kyriusha

2021
Journal article

A comparative analysis of text data classification accuracy and speed using neural networks, Bloom filter and naive Bayes

Publisher: Technology Center PC

Authors: Olena Hryshchenko, Vadym Yaremenko

2020
Journal article

МОДЕЛЬ МУЛЬТИАГЕНТНОЇ СИСТЕМИ ДЛЯ СЕМАНТИЧНОГО АНАЛІЗУ ТЕКСТІВ

Publisher: Луцький національний технічний університет

Authors: Vadym Yaremenko, Andrii Khudiakov

2019
Journal article

COMPARATIVE ANALYSIS OF SOFTWARE LIBRARIES FOR THE CLASSIFICATION OF TEXT DATA USING ARTIFICIAL NEURAL NETWORKS

Publisher: Таврійський національний університет ім. В.І. Вернадського

Authors: Vadym Yaremenko, Mykola Tarasenko

2020
Journal article

Mobile Driving License System Deployment Model With Security Enhancement

Publisher: Theoretical and cryptographic problems of cybersecurity

Authors: Vadym Yaremenko, V. Blynkov