Novosibirsk State University Journal of Information Technologies
Scientic Journal

ISSN 2410-0420 (Online), ISSN 1818-7900 (Print)

Switch to
Russian

All Issues >> Contents: Volume 15, Issue No 1 (2017)

On the Approach to the Thematic Classification of Documents
Anatoly Mikhailovich Fedotov, Oleg Vladimirovich Prozorov, Olga Anatolievna Fedotova, Arseny Audanbekovich Bapanov

Insitute of Computational Technologies SB RAS
Novosibirsk State University
State Public Scientific and Technical Library SB RAS
L. N. Gumilyov Eurasian National University

UDC code: 004.91

Abstract
The work is devoted to the analysis of approaches and algorithms for the classification of text documents. The approach to the thematic classification of documents is considered. For this purpose, a specially constructed measure of the proximity of documents is used, taking into account the specifics of the subject area. The values of the weight coefficients in the formula for computing the proximity measure are determined by the assumed a priori reliability of the data of the corresponding scale.

Key Words
document, coordinate indexing, measure of proximity, nominal scale

How to cite:
Fedotov A. M., Prozorov O. V., Fedotova O. A., Bapanov A. A. On the Approach to the Thematic Classification of Documents // Vestnik NSU Series: Information Technologies. - 2017. - Volume 15, Issue No 1. - P. 79-88. - ISSN 1818-7900. (in Russian).

Full Text in Russian

Available in PDF

References
1. Mihailov A. I., Chernyh A. I., Gilyarevskiy R. S. Fundamentals of Informatics. 2 ed. Мoscow, 1968. (in Russ.)
2. Fedotov A. M., Tusupov D. A., Sambetbaeva M. A., Erimbetova A. S., Bakieva A. M., Idrisova I. A. The model for determining the normal form of a word for the Kazakh language.
Vestnik NSU. Series: Information Technologies, 2015, vol. 13, no. 1, p. 107–116. ISSN 1818-7900. EISSN 2410-0420. (in Russ.)
3. Fedotov A. M., Idrisova I. A., Sambetbaeva M. A., Fedotova O. A. Use of the thesaurus in the scientific and educational information system. Vestnik NSU. Series: Information Technologies,
2015, vol. 13, no. 2, p. 86–102. ISSN 1818-7900. EISSN 2410-0420. (in Russ.)
4. Fedotov A. M., Barahnin V. B., Zhizhimov O. L., Fedotova O. A. Model of the information system for supporting scientific and pedagogical activity. Vestnik NSU. Series: Information Technologies, 2014, vol. 12, no. 1, p. 89–101. ISSN 1818-7900. EISSN 2410-0420. (in Russ.)
5. Bolshakova E. I., Klyshynskiy E. S., Lande D. V., Noskov A. A., Peskova O. V., Yagunova E. V. Automatic processing of texts in natural language and computer linguistics. Мoscow, 2011,
272 p. (in Russ.)

Publication information
Main title Vestnik NSU Series: Information Technologies, Volume 15, Issue No 1 (2017).
Parallel title: Novosibirsk State University Journal of Information Technologies Volume 15, Issue No 1 (2017).

Key title: Vestnik Novosibirskogo gosudarstvennogo universiteta. Seriâ: Informacionnye tehnologii
Abbreviated key title: Vestn. Novosib. Gos. Univ., Ser.: Inf. Tehnol.
Variant title: Vestnik NGU. Seriâ: Informacionnye tehnologii

Year of Publication: 2017
ISSN: 1818-7900 (Print), ISSN 2410-0420 (Online)
Publisher: Novosibirsk State University Press
DSpace handle


|Home Page| |All Issues| |Information for Authors| |Journal Boards| |Ethical principles| |Editorial Policy| |Contact Information| |Old Site in Russian|

inftech@vestnik.nsu.ru
© 2006-2017, Novosibirsk State University.