IS 17627 : 2021 Linguistic Resources - POS Tag Set for Indian Languages - Guidelines for Designing Tagsets and Specification

ICS 35.020

LITD 20

New Standard from Last Update.

1. SCOPE

This Indian Standard provides guidelines for designing POS tagsets and labels for Indian languages. This standard also defines Tagsets for the Indian languages Bangla, Gujarati, Hindi, Kashmiri, Konkani, Maithili, Marathi, Punjabi, Urdu, and Dravidian Languages (Telugu, Kannada, Malayalam and Tamil).

Tagsets for the languages from the North-Eastern region are not defined in this standard. However, the methodologies for designing tagsets and labels specified in this standard are such that they can easily be extended to the remaining Indian languages.