Resultados de pesquisa

Foram encontrados 5 registos.

This work presents parallel corpora automatically annotated with several NLP tools, including lemma and part of-speech tagging, named-entity recognition and classification, named-entity disambiguation, word-sense disambiguation, and coreference. The corpora comprise both the well-known Europarl corpus and a domain-specific question-answer troubleshooting corpus on the IT domain. English is common in all parallel corpora, with translations in five languages, namely, Basque, Bulgarian, Czech, Portuguese and Spanish. We describe the annotated corpora and the tools used for annotation, as well as annotation statistics for each language. These new resources are freely available and will help research on semantic processing for machine translation and cross-lingual transfer.
Biblioteca centralPalácio Ceia
Rua da Escola Politécnica, nº 141 - 147
1269-001 Lisboa, Portugal

Telefones: (+351) 300 002 922
(+351) 300 002 925 | (+351) 300 002 930
(+351) 300 002 931 | (+351) 300 002 932
Correio eletrónico: cdoc@uab.pt

Horário de atendimento:
Segunda a sexta, das 9h às 18h
Delegação de CoimbraRua Alexandre Herculano, nº 52
3000-019 Coimbra, Portugal

Telefone: (+351) 300 001 590
Correio eletrónico: cdocoimbra@uab.pt

Horário de atendimento:
Segunda a sexta, das 9h às 12h30 e das 14h às 18h
Delegação do PortoRua de Amial, nº 752
4200-055 Porto, Portugal

Telefone: (+351) 300 001 700
Correio eletrónico: cdocporto@uab.pt

Horário de atendimento:
Segunda a sexta, das 9h às 17h30