Search the Community
Showing results for tags 'wordnet'.
-
I need an offline dictionary. I did some searches and some e-mails, and found this one... Wordnet. http://wordnet.princeton.edu/ http://wordnet.princeton.edu/wordnet/license/ License Info (just the important bit): "Permission to use, copy, modify and distribute this software and database and its documentation for any purpose and without fee or royalty is hereby granted" Perfect candidate for my offline dictionary. Now the hard part... I have database files. I downloaded "WordNet 3.1 DATABASE FILES ONLY" I dove in, started looking at the files and decided that it was probably best to use wn3.1.dictdictdbfiles Unfortunately, the syntax inside these files is a bit more complicated than I can figure out just by looking at the files alone. So... the program I'm working on goes in alphabetical order. First is, "adj.all" Let's look at the first line. [{ [ ABLE, noun.cognition:ability,+ noun.attribute:ability,+ UNABLE,!] noun.attribute:ability,= noun.cognition:ability,= ((usually followed by `to') having the necessary means or skill or know-how or authority to do something; "able to swim"; "she was able to program her computer"; "we were at last able to buy a car"; "able to get a grant for the project") } What I can figure out: The type marker is to the right of the text. Main word is followed by , Antonym is followed by ,! The definition is enclosed is parenthesis The definition and example sentences are separated by ; I'm not sure what ,+ is I'm not sure what ,= is If you look at more lines, it appears comma plus is some type of grouping component. Special group names seem to be followed by ,;c or ,;u ,^ appears to be for similes Moving on to the next file in the list... adj.pert They have added , for another grouping and now they also have alternate forms of the word It appears that the words and their alternate forms are enclosed in square brackets. Blah blah blah, I'm listing off tons of unorganized details about tedious files. Essentially... I need help figuring out how to turn this database into something readable. Mainly, I'm trying to figure out the grouping system that they're using. The program that I'm working on will go through every file and create a single text document for each word. Inside the document will be the information such as, "noun, verb, adjective, etc" definition, antonym, synonym... the works. I can code all the stuff myself, I just need help deciphering the raw data. Thank you, and sorry for the huge post.