WORD FILE STRUCTURE
 
Structure

File vocab.000

At the start of the file is a series of word offsets.
The first 26 offsets are offsets to the first word of each alphabet letter.

Ex. :1st offset points to the first word beginning with an 'a'.
2nd offset points to the first word beginning with a 'b'.
etc.

Note : Words are in alphabetical order.


Word structure

[Number of characters identical to the previous word (0 if first word)](byte)
[String](X byte(s))
[Word type](byte)
[Word group number](reversed word)

Notes :

[String]:

If first word, complete string.
if not first word, differences with the previous word.

IMPORTANT : The last character of the string has bit 7 set

[Word type] :

Grammatical function of word

Known types :

00h (Useless word)
01h preposition
02h article
04h qualifying adjective
08h relative pronoun
10h noun
20h conjugated verb
40h adverb
80h imperative verb
FFh (anyword)

Note : I am a French-speaker, thus it might not be the correct terms


[Word group number] : Used with the function 'said' parameters block of scripts


Example

If we have two words, let's say 'animal' and 'animalism', it will look like that :
   00h 61h 6Eh 69h 6Dh 61h ECh 10h 00h 01h
    |  'a' 'n' 'i' 'm' 'a' 'l'  |   |___|____0001h = Word group number
    |                       |   |
    |__0, first word        |   |__10h = noun
                            |
                            |__'l' = 6Ch, but remember, last char has bit 7 set


   06h 69h 73h EDh 10h 00h 02h
    |  'i' 's' 'm'  |   |___|____0002h = Word group number
    |           |   |
    |           |   |__10h = noun
    |           |
    |           |__'m' = 6Dh, but remember, last char has bit 7 set
    |                         
    |__6 characters identical to previous word 'animal'



Main | Specifications | Programs | Project | SCI | Links