+Dict formats

Misc

  • tsv files.
    • In TSV file, newline is \n.
  • Monodix (dix) format (use with lttoolbox)
  • slang jargon file
    :word1:definition 1
    :word2:definition 2
    

Babylon

babylon source format (bgl file)

[blank line]
Term | Alternate1 | Alternate2| ... | AlternateK  
Definition  
[blank line]  
Term | Alternate1 | Alternate2| ... | AlternateK  
Definition  
[blank line]  
...
  • Special formatting
    • In Babylon source file, newline is
    • <a href="bword://next">next</a>

With html

  • <br> <a>, <b>, <i>, <em>, <ul/ol/li/dl/dd/dt>
  • HTML features require some special header. Example: purANa-encyclopedia.
  • Note that the language pair can be specified in the bookname header field : (ml-ml). This is recognized by goldendict.
blank line
#stripmethod=keep
#sametypesequence=h
#bookname=Pals eng-eng-tamil dictionary (en-ta)
#author=Jim Breen
#email=jwb@csse.monash.edu.au
#website=http://www.csse.monash.edu.au/~jwb/enamdict_doc.html
#description="...."
#Any other comment.
blank line
head_word|synonym1|synonym2
definition in single line. for optical line break, you can use <br>
----this third line should be blank --

gls

[blank line]
Term | Alternate1 | Alternate2| ... | AlternateK
[attributes]
Definition
[blank line]

stardict

Support

Used by: Stardict, GoldenDict, QDict, SDCV (command-line)

Dictionary making tips

  • Technique to create stardict files
    • From TSV or babylon source files (and back) using stardict-editor
    • Other useful tools: babylon, tabfile, dictzip, dictunzip, dictconv (manual) . Note tabfile in v 3.0.1 has trouble with duplicate entries in tsvfile. 3.0.5 works fine (need to build and use separately).
  • Stardict parsers
    • java. (no syn file support as of 2017)

dictd dicts

  • The standard dictd server made by the DICT Development Group uses a special DICT file format, although other dictd servers (such as GNU Dico) may optionally use other file formats.
  • Dictionaries in the standard DICT file format are made up of two files, a .index file and a .dict file (or .dict.dz if compressed).

Creation:

  • These files are not usually written manually but are compiled by a program called dictfmt. For example, from jargon file: dictfmt --utf8 --allchars -s "My Dictionary" -j mydict < mydict.txt

Support

Used by: dict clients (including some emacs plugins) and servers, GoldenDict, GNOME Dictionary