Tagged Malayalam Corpus

Home > Malayalam Corpus

This is a tagged Malayalam corpus used for POS tagging works. This dataset mainly consists of two columns-first one is words and the second one is their corresponding tags. This dataset is prepared using BIS(Bureau of Indian Standards) tagset and consists of 287588 words. The tagset contains 36 tags from BIS tagset.