Please use this identifier to cite or link to this item: https://t2-4.bsc.es/jspui/handle/123456789/61536
Title: Orthography, phonology and morphology in the Arabic lexicon
Keywords: 2011
Description: Arabic script is essentially alphabetic, that is it uses different characters based on the pronunciation of words. However, much Arabic writing only includes the consonants, meaning that there is a lot of ambiguity where a written word could represent many different actual words or forms of those words. This project aims to apply a framework previously developed for mapping between spelling and pronunciation in European languages (English, Dutch, German and French) to define the relations between written and spoken forms in Modern Standard Arabic and then to apply a set of probabilities, extracted from Arabic corpora, to determine which of the possible pronunciations of a particular written form is the most likely. The resulting lexicon will be useful for a range of Arabic NLP (Natural Language Processing) applications, and the structure of the lexicon means that it will be possible to extend it to cover different varieties of Arabic.
URI: https://t2-4.bsc.es/jspui/handle/123456789/61536
Other Identifiers: 850541
10.5255/UKDA-SN-850541
https://doi.org/10.5255/UKDA-SN-850541
Appears in Collections:Cessda

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.