malim – a new computational approach of malay morphology
DESCRIPTION
Mohd Yunus Sharum , Muhammad Taufik Abdullah, Md Nasir Sulaiman , Masrah Azrifah Azmi Murad & Zaitul Azma Zainon Hamzah. malim – a new computational approach of malay morphology. Ainun Najwa Bt Aziz P61811 Fatimah Zawani Bt Abdullah P61028 - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/1.jpg)
MALIM – A NEW COMPUTATIONAL APPROACH
OF MALAY MORPHOLOGY
Ainun Najwa Bt Aziz P61811Fatimah Zawani Bt Abdullah P61028Mohd Rashidie B. Ramli P62451
Mohd Yunus Sharum, Muhammad Taufik Abdullah, Md Nasir Sulaiman, Masrah Azrifah
Azmi Murad & Zaitul Azma Zainon Hamzah
![Page 2: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/2.jpg)
INTRODUCTION A major problem in Malay morphological
processing is in analysis. Existing model : finite-state, two-level
formalism. Hypothesis : higher accuracy of
morphological analysis can be achieved by widening the decision-selection domain.
Implements MALIM approach using S-A-P-I.
![Page 3: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/3.jpg)
MALAY MORPHOLOGY Basic target of S-A-P-I is to analyze
affixation, especially multiple affixations. Affixation could be one or several of these
processes (prefixation, suffixation, circumfixation and infixation).
3 basic categories of Malay reduplication:1. Full reduplication2. Partial reduplication3. Rhythmic reduplication
![Page 4: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/4.jpg)
THE S-A-P-I APPROACH Use the divide-and-conquer technique to
handle Malay morphological analysis. S-A-P-I (‘search-all-pick-if…) algorithm. Advantage : we can search for most
appropriate result, since we had gathered all possible options from the decision-selection domain.
Side-effect : multiple outputs due to ambiguity.
2 technique to improve the analysis’ results (separating and filtering).
![Page 5: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/5.jpg)
MALIM – MORPHOLOGICAL ANALYZER FOR LINGUISTIC INDECISION OF MALAY
A morphological analyzer which implements the S-A-P-I approach.
Developed with Perl. Characteristic of Perl :
1. Support regular expression, a notation which describes regular language.
2. Capability of supporting lexical processing. MALIM contains a basic set but
comprehensive root lexicon as reference (root lexicon: 5710 root words).
![Page 6: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/6.jpg)
MALIM contains a set of 80 morphosyntatic rules.
Limitations in implementation:1. Do not includes infixation analysis.2. Do not includes analysis on complex
affixation/reduplication.3. Do not analyze rhythmic and free
reduplication.4. Limited in analyzing affixation / reduplication
of compound word and phrase. Overcome the limitation : use a strategy
resembling direct mapping approach.
MALIM – MORPHOLOGICAL ANALYZER FOR LINGUISTIC INDECISION OF MALAY
![Page 7: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/7.jpg)
METHOD EXPERIMENT Types of experiment :
1) Testing processing model (S-A-P-I)2) Splitting lexicon (of mono-syllabic and multi-
syllabic)3) Morphosyntactic rule filtering4) First syllabic reduplication analysis5) Clitics/particles extraction6) The effects of ‘cheat-list’ (direct mapping)
![Page 8: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/8.jpg)
METHOD EXPERIMENT Experiment setting :
Set 1 : MALIM (complete) Set 2 : MALIM without lexicon splits Set 3 : MALIM without morphosyntactic rule
filtering Set 4 : MALIM without first syllabic reduplication
analysis Set 5 : MALIM without clitics/particles extraction Set 6 : MALIM without ‘cheat list’ Set X : MALIM with basic capabilities (fullfills all
Set 2 to Set 6) – use as control set
![Page 9: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/9.jpg)
![Page 10: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/10.jpg)
![Page 11: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/11.jpg)
![Page 12: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/12.jpg)
CONTRIBUTION1) Introducing a new and more accurate
approach of morphological analysis using S-A-P-I
2) Solved most of morphological problems involving Malay morphology, except involving multi-words (or compound word) and certain reduplicated words
![Page 13: malim – a new computational approach of malay morphology](https://reader035.vdocument.in/reader035/viewer/2022081507/5681660e550346895dd950b4/html5/thumbnails/13.jpg)
CONCLUSION 1) MALIM only uses controlled sample data
which is not from daily life usage.2) Thus, this may not pose the real challenge
as solving the real world problems.3) So, in future, we may perform a test-run
using real-life data such as from corpus to verify the performance.