Building the Package

  1. clone the github repository: https://github.com/wolfgangmm/exist-stanford-ner
  2. edit build.properties and set exist.dir to point to your eXist install directory
  3. call "ant" in the directory to create a .xar
  4. upload the xar into eXist using the dashboard

Chinese Language Support

To recognize entities in Chinese texts, you need to obtain the Chinese classifier and segmenter. Before you build the .xar to install, download the classifier and word segmenter using the following links:

From the first package, copy chinese.misc.distsim.crf.ser.gz into resources/classifiers. From the second zip, copy

and everything inside data/dict into the resources/classifiers directory.