Welcome to the newly upgraded BRAC University Institutional Repository! Following our recent system upgrade, we are actively organizing our collections. While the category counters on the homepage are currently syncing and may temporarily display low numbers, rest assured that our full repository of over 27,000 items remains safely intact. Please use the search bar above to easily access all scholarly outputs, theses, and institutional documents while we complete this categorization process.

A light weight stemmer for Bengali and its use in spelling checker

Citation

Abstract

Stemming is an operation that splits a word into the constituent root part and affix without doing complete morphological analysis. It is used to improve the performance of spelling checkers and information retrieval applications, where morphological analysis would be too computationally expensive. For spelling checkers specifically, using stemming may drastically reduce the dictionary size, often a bottleneck for mobile and embedded devices. This paper presents a computationally inexpensive stemming algorithm for Bengali, which handles suffix removal in a domain independent way. The evaluation of the proposed algorithm in a Bengali spelling checker indicates that it can be effectively used in information retrieval applications in general.

Description

Includes bibliographical references (page 6).

Publisher Link

Type

Article