A leitura como um processo cognitivo
Figueiredo, Olívia Maria
1999-01-01
Search results
8 records were found.
Probabilistic Translation Dictionaries (PTD) are translation resources that can be obtained automatically from parallel corpora. Although this process is simple, it requires the existence of a parallel corpora for the involved languages. Minoritized languages have a limited amount of available resources. For example, while they can have a few parallel corpora, the number of parallel language-pairs uses to be restricted. We defend that if a minoritized language A has a parallel corpus with a language B, and language B has a parallel corpus with another language C, then we can obtain a helpful probabilistic translation dictionary between A and C. In this document we will formalize the probabilistic translation dictionaries triangulation, perform some experiments making the triangulation between Galician, English and Italian, and conclude...
This paper presents a parallel corpora-based bilingual terminology extraction method based on the occurrence of bilingual morphosyntactic patterns in probabilistic translation dictionaries. We discuss an experiment focused on two language pairs – English-Galician and English-Portuguese, and show results which experimentally confirm the high degree of accuracy of the proposed extraction technique.
Series : OASIcs - Open access series in informatics, ISSN 2190-6807, vol. 29
Even in the 21st century, paper dictionaries are still compiled and developed using standard word processors. Many publishing companies are, nowadays, working on converting their dictionaries into computer readable documents, so that they can be used to prepare new features, such as making them available online. Luckily, most of these publishers can pay review teams to fix and even enhance these dictionaries. Unfortunately, research institutions cannot hire that amount of workers.
In this article we present the process of retreading a Galician dictionary that was first de- veloped and compiled using Microsoft Word. This dictionary was converted, through automatic rewriting, into a Text Encoding Initiative schema subset. This process will be detailed, and the problems found will be discussed. Given a recent normative that changed the G...
Series: Lecture notes in computer science, ISSN 0302-9743, vol. 8854
This paper presents a research on parallel corpora-based bilingual terminology extraction based on the occurrence of bilingual morphosyntactic patterns in the probabilistic translation dictionaries generated by NATools. To evaluate this method, we carried out an experiment in which both the level of lexical cohesion of the term candidates and their specificity with respect to a non-terminological corpus of the target language were taken into account. The evaluation results show a high degree of accuracy of the terminology extraction based on probabilistic translation dictionaries complemented by bilingual syntactic patterns.
Neste documento apresenta-se o conceito de memórias de tradução distribuídas, discutindo-se o seu interesse na área da tradução, bem como as vantagens que uma ferramenta de tradução pode tirar do seu uso. É apresentada uma possível implementação de memórias de tradução distribuídas usando WebServices numa arquitectura de cooperativismo. São definidos as mensagens (API) que um serviço deste género deve implementar para que uma ferramenta de tradução possa tirar partido da colaboração entre tradutores.
Las memorias de traducción son muy útiles para la
traducción, pero son difíciles de compartir y de reutilizar por parte de la
comunidad de traductores. Mediante las memorias distribuidas de traducción,
los usuarios pueden construir conjuntamente y compartir memorias de
traducción. En este artículo, presentamos los detalles de la implementación de
esta arquitectura utilizando la tecnología de WebServices y un ejemplo de un
sistema distribuido entre Portugal y España.
