A non word error spell checker for Indonesian using morphologically analyzer and HMM
Spell checker consists of two main methods, error detection and error correction. In this study, spell checker is built by using morphological analyzer and dictionary lookup as error detection method with two alternative optimization, binary search and hash. Whilst as for error correction, two alter...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Spell checker consists of two main methods, error detection and error correction. In this study, spell checker is built by using morphological analyzer and dictionary lookup as error detection method with two alternative optimization, binary search and hash. Whilst as for error correction, two alternative methods, namely forward reversed dictionary and probability of similarity is used. Forward reversed dictionary corrects the misspelled word by considering edit distance between the misspelled word and its candidates. Probability of similarity, which is the main proposed method for error correction, correct the misspelled word by calculating its similarity to a candidate word, based on the value of optimum subsequence between them. Candidate sorting was accomplished through the use of HMM (Hidden Markov Model), where the word is considered as observed state and the candidates as hidden state. By using HMM, the system does not only consider the similarity of the candidate word with misspelled words, but also consider the sequence of words in sentences where the word is located. The experiment result proves that sorting candidates by using HMM increase the precision accuracy. As for correction method, the result showed that using probability of similarity has better correctness accuracy than forward reversed dictionary. |
---|---|
ISSN: | 2155-6822 2155-6830 |
DOI: | 10.1109/ICEEI.2011.6021514 |