tcc-latex/tcc_resumo_eng.tex

10 lines
1.5 KiB
TeX

Professionals from different fields depend on the use of specific terms from their area, which some of these may come from a foreign language, for efficient and effective communication. Therefore, it is necessary the exposure of the individual with use cases to build their repertoire of the language. However, the formation of such repertoire is usually done slowly and manually, or through non-free tools when the teaching method makes use of \textit{corpora}. In this work, we developed an open source internet application to extend functionality, in view of similar existing applications, while remaining user friendly.
%
The system has as input a corpus provided by the user. The system, using segmentation, labeling and search algorithms, processes the corpus, having as outputs the list of sentences, word frequency, word list, the automaton of each sentence, the corpus automaton as a whole and the tagged text. Optionally, a search expression can be provided for input and output is added to the text placement of the terms found.
%
Our system is modular and extensible through plug-ins, unlike the main solutions available in the market; the counterpart is that it is a new system, so not all the existing features available on other existing softwares will be readly available on this one.
%
The developed system has a significantly lower cost than the its paid competitors and can be used to support the study of the language by linguists as a tool in future researches.
Keywords: Automatic Processing of Natural Language. Web system. \textit{Corpus}. Plug-in.