Skip to content

Latest commit

 

History

History
27 lines (19 loc) · 1.72 KB

README.rst

File metadata and controls

27 lines (19 loc) · 1.72 KB

Test status Test coverage Docs status

Название исследуемой задачи:Автоматическое выделение терминов для тематического моделирования
Тип научной работы:M1P
Автор:Никитина Мария Александровна
Научный руководитель:Доктор физико-математических наук, Воронцов Константин Вячеславович
Научный консультант:Аспирант, Потапова Полина Сергеевна

Abstract

Nowadays, new scientific terms appear every day. It is necessary to learn how to extract them in the collection of documents. Doing it manually is long and expensive, because you need to attract highly specialized specialists. This article discusses the problem of automatic term extraction. To solve it the collocation allocation method (TopMine) in combination with the modular technology of thematic modeling (using the BigARTM library) and modern methods based on neural network models of the language are used. These two methods have not been compared before.