INTRODUCTION

WordNet is a lexical database for different languages. It groups words into sets of synonyms called synsets, provides short, general definitions, and records the various semantic relations between these synonym sets. The purpose is twofold: to produce a combination of dictionary and thesaurus that is more intuitively usable, and to support automatic text analysis and artificial intelligence applications.

Taking Kashmiri language in this scenario, work is going on the development of Kashmiri Synsets. Kashmiri language is called k ә:sh ur or k ә:shir zaba:n by its native speakers. It is primarily spoken in the Kashmir Valley part of the Jammu and Kashmir State in India. It is also spoken by Kashmiri‟s settled in other parts of India, and other countries. The language spoken in and around Srinagar is regarded as the standard variety.

Kashmiri has several unique features amongst the Indo-Aryan languages. These include existence of central vowels / I/, / I:/, /ә/, / ә:/, insertion of epenthetic vowels, etc. Syntactically, from a word order perspective, Kashmiri shows both verb medial and verb final characteristics. Kashmiri also shows strong V2 features like Germanic, Yiddish, Dutch and Icelandic. In the root clause, the finite verb may be preceded not only by the subject, as in English, but also by other clause constituents, as is the case in the verb-second languages. Morphologically speaking, words in Kashmiri are either mono or poly-morphic. Grammatical categories can be divided into three main groups on the basis of their morphological characteristics.

 

 

KASHMIRI WORDNET DEVELOPMENT

The work on Kashmiri WordNet started as a part of the Project 'Development of Language Tools and Linguistic Resources for Kashmiri' funded by DIT , Ministry of Communication and Information Technology Govt. of India. Presently, the development of Kashmiri WordNet is a part of the 'Development of Indradhanush: an Integrated WordNet for Bengali, Gujarati, Konkani, Kashmiri, Oriya, Punjabi and Urdu' also funded by DIT. The Kashmiri group would like to thank Dr Swarn Lata and Prof Pushpak Bhattacharya without whom this work would not have been possible. We would also like to thank Prof Jyoti Pawar from Goa University, Dr. Somnath Chandra, Dr. Vijay Kumar and Dr. Manoj Jain from DIT who encouraged us at all times.

SYNSET DEVELOPMENT IN KASHMIRI

[Last Updated on 01 APRIL 2012]


No. of Universal Synsets Completed:           7,168

No. of Pan Indian Synsets Completed:        1,347

No. of Noun Synsets Completed:                 13,295

No. of Adjective Synsets Completed:           5,068

No. of Adverb Synsets Completed:              305

No. of Verb Synsets Completed:                  2,354

No. of Kashmiri Language Synsets Completed:      850