CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

Frontiers of Multilingual Grammar Development

Ramona Enache
Göteborg : Chalmers University of Technology, 2013. ISBN: 978-91-628-8787-2.- 172 s.
[Doktorsavhandling]

The thesis explores a number of ways for developing multilingual grammars written in GF (Grammatical Framework). The goal is to enhance both the coverage of the grammars, in terms of content and number of languages, and to reduce the development effort by automating a larger part of the process. The first direction in grammar development targets the creation of general language resources. These are the starting point for building domain-specific grammars for the language. Developing resource grammars gives a good overview of the effort required and provides a solid base for subsequent experiments in automation. Our work resulted in building computational grammars for Romanian and Swedish. A further development step is multilingual domain-specific grammar creation. The technique we employed is converting structured models into grammars, which preserves the original structure of the model as a backbone of the grammar and uses the general GF resources for a smooth multilingual verbalization of the model. The use cases considered are an upper-domain ontology, a business model and an ontology describing cultural heritage artefacts, each posing a different challenge and illustrating another aspect of the GF grammars-ontology interoperability and its advantages. An orthogonal approach to multilingual grammar development aims at increasing the number of languages from a domain grammar. Our solution is an example-based prototype which partially replaces grammar programming with feedback from native informants and SMT tools (such as Google Translate). Last but not least, as an attempt to not only enhance GF grammars, but also use them in a novel way, we present the grammar-based hybrid system architecture combining GF grammars and SMT systems. This marks some of the first steps in using grammars for translating free text. As a side-effect of the work, we propose a technique for building bilingual GF lexicon resources from SMT phrase tables.

Nyckelord: multilingual grammar development, ontology verbalization, resource grammar development, hybrid machine translation, functional programming, domain specific languages



Den här publikationen ingår i följande styrkeområden:

Läs mer om Chalmers styrkeområden  

Denna post skapades 2013-09-17. Senast ändrad 2014-01-15.
CPL Pubid: 183497

 

Läs direkt!


Länk till annan sajt (kan kräva inloggning)


Institutioner (Chalmers)

Institutionen för data- och informationsteknik (GU) (GU)

Ämnesområden

Informations- och kommunikationsteknik
Språkteknologi (språkvetenskaplig databehandling)

Chalmers infrastruktur

Examination

Datum: 2013-10-29
Tid: 10:00
Lokal: Tuesday, 29th of October, 2013, 10:00, room EE, EDIT building, Johanneberg
Opponent: Dr. Keith B. Hall, Google