tcc-latex/tcc_resumo_eng.tex

10 lines
1.6 KiB
TeX
Raw Normal View History

2018-04-03 03:39:06 +00:00
Professionals from different fields of activity depend on the use of specific terms in their area, some of them in a foreign language, for efficient and effective communication, whose learning process requires contact with use cases to form a repertoire in the language or their dialect. However, the formation of such repertoire is usually done slowly and manually, or through non-free tools when the teaching methodology uses \textit{corpora}. In this work, we developed an open source internet application to extend the functionality having in sight similar existing applications while remaining user friendly.
2017-11-05 03:31:10 +00:00
%
2018-02-08 01:17:01 +00:00
The system has as inputs a user-supplied corpus. The system, using algorithms of segmentation, tagging and search, processes the corpus, having as outputs the list of sentences, word frequencies, word list, the automaton of each sentence, the automaton of the corpus as a whole and the tagged text. Optionally, a search expression can be provided for input, so to the output is added the collocation.
2017-11-05 03:31:10 +00:00
%
2018-02-08 01:19:32 +00:00
We have found that our system is modular and extensible through plug-ins, unlike the major solutions available on the market; the counterpart is that it is a new system, so not all the features present in other applications can be found in this one.
2017-11-05 03:31:10 +00:00
%
2018-04-03 03:39:06 +00:00
Therefore, the developed system can be used to support the study of the language by linguists as a research tool in the future, and it may be possible to use the plug-ins developed as product of those works to raise the state of the art of Automatic Processing of Natural Language.
2017-11-05 03:31:10 +00:00
2018-02-08 01:19:32 +00:00
Keywords: Automatic Processing of Natural Language. Web system. Corpus. Plug-in.