Tradução automática estatística baseada em sintaxe e linguagens de árvores

Beck, Daniel Emilio

Tradução automática estatística baseada em sintaxe e linguagens de árvores

Arquivos

4541.pdf (1.28 MB)

Data

2012-06-19

Autores

Beck, Daniel Emilio

Editor

Universidade Federal de São Carlos

Resumo

Machine Translation (MT) is one of the classic Natural Language Processing (NLP) applications. The state-of-the-art in MT is represented by statistical methods that aim to learn all necessary linguistic knowledge automatically through large collections of texts (corpora). However, while the quality of statistical MT systems had improved, nowadays these advances are not significant. For this reason, research in the area have sought to involve more explicit linguistic knowledge in these systems. One issue that purely statistical MT systems have is the lack of correct treatment of syntactic phenomena. Thus, one of the research directions when trying to incorporate linguistic knowledge in those systems is through the addition of syntactic rules. To accomplish this, many methods and formalisms with this goal in mind are studied. This text presents the investigation of methods which aim to advance the state-of-the-art in statistical MT through models that consider syntactic information. The methods and formalisms studied are those used to deal with tree languages, mainly Tree Substitution Grammars (TSGs) and Tree-to-String (TTS) Transducers. From this work, a greater understanding was obtained about the studied formalisms and their behavior when used in NLP applications.

Palavras-chave

Processamento da linguagem natural (Computação), Linguística - processamento de dados, Linguagem - tradução automática, Processamento da Língua Natural, Linguística Computacional, Tradução automática estatística, Gramáticas de substituição de árvores, Transdutores árvore-para-String, Natural language processing, Computational linguistics, Statistical machine translation, Tree substitution grammars, Tree-to-string transducers

Citação

BECK, Daniel Emilio. Tradução automática estatística baseada em sintaxe e linguagens de árvores. 2012. 94 f. Dissertação (Mestrado em Ciências Exatas e da Terra) - Universidade Federal de São Carlos, São Carlos, 2012.

URI

https://repositorio.ufscar.br/handle/20.500.14289/504

Coleções

Teses e Dissertações

Página do item completo

Tradução automática estatística baseada em sintaxe e linguagens de árvores

Arquivos

Data

Autores

Título da Revista

ISSN da Revista

Título de Volume

Editor

Resumo

Descrição

Palavras-chave

Citação

URI

Coleções

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced