An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
Viimati Vabastamist okt 07, 2016汉语言处理包
Viimati Vabastamist nullA Lucene tokenizer plugin for both Simplified Chinese and Traditional Chinese, featured with Chinese Word Segmentation, custom dictionary etc.
Viimati Vabastamist dets 14, 2016HanLP: Han Language Processing
Viimati Vabastamist dets 27, 2020A Lucene tokenizer plugin for both Simplified Chinese and Traditional Chinese, featured with Chinese Word Segmentation, custom dictionary etc.
Viimati Vabastamist dets 14, 2016