Automatic Discovery of Fuzzy Synsets from Dictionary Definitions
Hugo Gonçalo Oliveira and Paulo Gomes
In order to deal ambiguity in natural language, it is common to organise words according to their senses in synsets, which are groups of synonymous words, that can be seen as concepts. The manual creation of a broad-coverage synset base is a time-consuming task, so we take advantage of dictionary definitions for extracting synonymy pairs and clustering for identifying synsets. Since word senses are not discrete, we create fuzzy synsets, where each word has a membership probability. We report the results of the creation of a fuzzy synset base for Portuguese, using three electronic dictionaries. This resource can be used as a fuzzy thesaurus, larger than handcrafted thesauri for Portuguese.