FORMAL-FUNCTIONAL MODELS OF THE UZBEK ELECTRON CORPUS

Authors

  • Nilufar Abdurakhmonova Associate professor of National University of Uzbekistan

Abstract

The paper is devoted to the structure and its linguistic annotation for building Uzbek Corpus. Linguistic annotation, metadata and corpus manager as formal-functional model of the corpus are important for usage for many purposes. The fact that the platform allows users to address language and literature issues, use it online. The Uzbek corpus based on structural and sub corpus models, which partially represented in this paper, is going on process to develop Uzbek language technology.

Keywords: Uzbek corpus, morphoanalyzer, metadata, parallel corpora, text analysis, corpus manager.

References

Sulevmanov, D., Gatiatullin, A., Prokopyev, N., Abdurakhmonova, N. (2020) Turkic morpheme web portal as a platform for turkology research International Conference on Information Science and Communications Technologies, ICISCT 2020, 2020, 9351500.

Khusainov, A., Suleymanov, D., Gilmullin, R., Minsafina, A., Kubedinova, L., Abdurakhmonova. N. (2020) First Results of the “TurkLang-7” Project: Creating Russian-Turkic Parallel Corpora and MT Systems CMLS 2020 CEUR Workshop Proceedings, 2020, pp. 90-101.

Khusainov, A., Suleymanov, D., Gilmullin, R., Gatiatullin, A. (2018) Building the Tatar-Russian NMT system based on re-translation of multilingual data Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11107 LNAI, pp. 163–170.

Абдураҳмонова Н. (2020) Замонавий корпусларнинг компьютер моделлари // Ўзбекистонда хорижий тиллар. -2020. - № 1(30). - Б. 50-58. https://doi.org/ 10.36078/

Мухамедшин, Д.Р., Сулейманов Д.Ш. (2018) Система корпус-менеджер: архитектура и модели корпусных данных Программные продукты и системы / Software & Systems 4 (31) – C. 6.

В. П. Захаров, И. В. Азарова, О. А. Митрофанова, А. М. Попов, М. В. Хохлова (2019) Моделирование в корпусной лингвистике Специализированные корпусы русского языка, Санкт-Петербургский государственный университет. -C. 19.

Erhard Hinrichs, Marie Hinrichs, Thomas Zastrow, Gerhard Heyer, Volker Boehlke, Uwe Quasthoff, Helmut Schmid, Ulrich Heid, Fabienne Fritzinger, Alexander Siebert, and Jorg Didakowski. (2009) Weblicht: Web-based LRT services for German. In Workshop on linguistic processing pipelines, GSCL Jahrestagung, Potsdam.

Аброскин А. А. Поиск по корпусу: проблемы и методы их решения // Национальный корпус русского языка: 2006-2008. Новые результаты и перспективы. СПб.: Нестор-История, 2009, 277-282.

https://uz.wikipedia.org/wiki/O%CA%BBzbek_tili

Jinyi Zhang, Tadahiro Matsumoto (2019) Corpus Augmentation for Neural Machine Translation with ChineseJapanese Parallel Corpora / Applied sciences (9), 2036.

Downloads

Published

2021-09-26

How to Cite

Abdurakhmonova, N. (2021). FORMAL-FUNCTIONAL MODELS OF THE UZBEK ELECTRON CORPUS. ANGLISTICUM. Journal of the Association-Institute for English Language and American Studies, 10(8), pp.59–66. Retrieved from https://www.anglisticum.org.mk/index.php/IJLLIS/article/view/2233

Issue

Section

Volume 10, No.8, August 2021