Item Details

In Basket

Journal Article

ID	172208
Title Proper	Machine Learning Approach to Suffix Separation on a Sandhi Rule Annotated Malayalam Data Set
Language	ENG
Author	Sebastian, Mary Priya ; Kumar, G. Santhosh
Summary / Abstract (Note)	This article explores in depth various sandhi (joining) rules in Kerala’s Malayalam language, which play a vital role in framing of the inflected and agglutinated forms of words and their compounds. It discusses significant progress in a scientific method to generate a specific annotated data set of Malayalam words that would be useful in many Natural Language Processing tasks which involve Malayalam preprocessing. The article discusses the results and issues encountered in developing this word-splitting tool for Malayalam, mainly in the context of improving the alignments between parallel texts that form a core resource in the Machine Translation task.
`In' analytical Note	South Asia Research Vol. 40, No.2; Jul 2020: p.231-249
Journal Source	South Asia Research 2020-08 40, 2
Key Words	Morphology ; Kerala ; Malayalam ; Machine learning ; Agglutinative Languages ; Dravidian Languages ; Sandhi Rules ; Suffix Separation