Item Details
Skip Navigation Links
   ActiveUsers:811Hits:19982874Skip Navigation Links
Show My Basket
Contact Us
IDSA Web Site
Ask Us
Today's News
HelpExpand Help
Advanced search

In Basket
  Journal Article   Journal Article
 

ID172208
Title ProperMachine Learning Approach to Suffix Separation on a Sandhi Rule Annotated Malayalam Data Set
LanguageENG
AuthorSebastian, Mary Priya ;  Kumar, G. Santhosh
Summary / Abstract (Note)This article explores in depth various sandhi (joining) rules in Kerala’s Malayalam language, which play a vital role in framing of the inflected and agglutinated forms of words and their compounds. It discusses significant progress in a scientific method to generate a specific annotated data set of Malayalam words that would be useful in many Natural Language Processing tasks which involve Malayalam preprocessing. The article discusses the results and issues encountered in developing this word-splitting tool for Malayalam, mainly in the context of improving the alignments between parallel texts that form a core resource in the Machine Translation task.
`In' analytical NoteSouth Asia Research Vol. 40, No.2; Jul 2020: p.231-249
Journal SourceSouth Asia Research 2020-08 40, 2
Key WordsMorphology ;  Kerala ;  Malayalam ;  Machine learning ;  Agglutinative Languages ;  Dravidian Languages ;  Sandhi Rules ;  Suffix Separation