Title |
Minimal Resources for Arabic Parsing: an Interactive Method for the Construction of Evolutive Automata |
Authors |
Claude Audebert, Christian Gaubert and André Jaccarini |
Abstract |
We present scenarii showing the interactive construction of operators . Three grammars and their progressive refinements through the "feed-back" method are given as an example: a kernel of grammars for retrieving quotations, a grammar reflecting a set of current syntactic operations and a grammar dealing with morphology. They are designed as Finite State Automata, part of them made deterministic for better performance, using the Sarfiyya software developed on purpose which allows many operations on FST. Purely algorithmic, this approach uses minimal resources, is rather independent from lexicons, gives to the tool words a prominent place and bases parsing on surface structures. On the theoretical level, it aims at putting forward the specificity of Arabic language which allows to work without a lexicon (as a limit case) due to the high level of grammaticalization in this language. This work is thus of interest to the linguist who looks for the good balance between lexicon and grammar as well as to the specialist in cognitive sciences (duality between data and programs). On the practical level, this work aims at establishing a coherent methodology for the creation of multipurpose searching operators. |
Topics |
Exploitation of LRs in different types of applications (information extraction, information retrieval, speech dictation, translation, summarisation, web services, semantic web, etc.), Evaluation methodologies, protocols and measures, Taggers and Parsers |
Full paper |
Minimal Resources for Arabic Parsing: an Interactive Method for the Construction of Evolutive Automata |
Bibtex |
@InProceedings{AUDEBERT09.37,
author = {Claude Audebert, Christian Gaubert and André Jaccarini},
title = {Minimal Resources for Arabic Parsing: an Interactive Method for the Construction of Evolutive Automata},
booktitle = {Proceedings of the Second International Conference on Arabic Language Resources and Tools},
year = {2009},
month = {April},
date = {22-23},
address = {Cairo, Egypt},
editor = {Khalid Choukri and Bente Maegaard},
publisher = {The MEDAR Consortium},
isbn = {2-9517408-5-9},
language = {english}
} |