Giter Club home page Giter Club logo

dict_build's Introduction

构建词库

从原始文本中,自动构建词库,目前只适用于中文。参考:

http://www.matrix67.com/blog/archives/5044

new in 0.0.3

  1. 使用radix tree代替ternary search tree,提升性能。
  2. 加入LOG信息,展示抽取的进度。

new in 0.0.2

  1. 直接导入java-merge-sort源码, thx@cowtowncoder
  2. 将之前的maven项目,转变为一个gradle项目,方便打包使用。

成词条件

  1. 互信息
  2. 左右熵
  3. 位置成词概率
  4. ngram 频率

运行方法

  1. 下载或者gradle distTar打包程序
  2. 解压dict_build-x.x.x.tar
  3. 解压之后,进入bin. 运行:./dict_build 你的数据文件的绝对路径
  4. 结束之后,在数据文件同目录有文件:words_sort.data
  5. 四列分别为:词,词频,互信息,左右熵,位置成词概率.

注意

  • 数据文件一定要是UTF8编码的
  • 如果数据文件较大, 出现out of memory问题,可以尝试如下方式,限mac和linux,其中2G可以根据实际情况调整
export JAVA_OPTS=-Xmx2G
./dict_build 你的数据文件的绝对路径

示例

《金瓶梅》抽取结果

西门庆  4754    6.727920454563199   2.0315193024276885  0.17472535684926388
月娘    1829    6.491853096329675   2.3714166640957095  0.22135096835144072
敬济    906 9.084808387804362   2.554594603718855   0.14485683987274656
春梅    799 8.134426320220927   2.7880175589451714  0.16484505593416485
玳安    796 8.228818690495881   2.865686193737731   0.11791820110723605
后边    617 6.6293566200796095  4.008365154080131   0.2160373686259245
玉楼    594 7.977279923499917   2.27346284978306    0.27518689925240297
明日    580 6.189824558880018   2.705423396095033   0.1774535638537181
两银子  458 6.129283016944967   2.351100547282295   0.3809078896437581
小厮    454 7.257387842692652   3.945653525477103   0.16666666666666666
打发    444 6.870364719583405   3.694604352707633   0.18409496065046307
如今    410 6.643856189774725   2.1460777430093394  0.1780766096169519
淫妇    382 7.768184324776926   3.277903508489837   0.2555205047318612
桂姐    371 7.584962500721156   2.5922046565140424  0.36255305256284687
老婆    331 6.266786540694902   3.5783015008688523  0.3758007117437722
衣服    309 8.90388184573618    2.786139685416002   0.13284518828451883
丫头    297 7.383704292474053   4.291010086795063   0.21875
潘金莲  288 8.276124405274238   2.4955186567189194  0.35333669524289796
昨日    285 6.857980995127572   2.6387249970833997  0.1774535638537181
王婆    284 7.1799090900149345  2.3129267619188907  0.3758007117437722

《西游记》抽取结果

八戒    1807    7.88874324889826    2.00952580557629    0.36441586280814575
师父    1632    7.507794640198696   3.745294449785798   0.1371395690812608
大圣    1270    6.599912842187128   2.7790919785432147  0.13128460061010055
唐僧    1003    7.076815597050832   4.350465172292435   0.43277723258096173
菩萨    765 9.471675214392045   3.6013747138664756  0.15910495734948696
妖精    634 7.199672344836364   3.1817261900583627  0.13134411600669268
徒弟    439 8.060695931687555   2.498555429145656   0.15553809897879026
兄弟    284 7.845490050944376   2.93037668783551    0.16085578446909668
宝贝    283 9.319672120946995   2.616164396748633   0.15108220492589827
今日    282 6.714245517666122   2.1303069812971214  0.1774535638537181
取经    263 7.539158811108032   2.663944888382171   0.10181178023912565
如今    259 6.189824558880018   2.056188859866133   0.1780766096169519
认得    223 6.357552004618085   2.9543379335926954  0.2326782564877803
东土    212 8.422064766172811   3.326253983395916   0.14745277618775043
孙大圣  202 6.022367813028454   2.4886576514017107  0.13128460061010055
变作    189 7.554588851677638   3.0713596792578635  0.23452975920036348
玉帝    189 8.912889336229961   2.973106046717708   0.27518689925240297
土地    179 7.499845887083206   3.1206506190132566  0.2819944064037033
欢喜    173 8.861086905995393   2.184918471204895   0.31727272727272726
贫僧    170 7.400879436282184   2.0731236036504477  0.43277723258096173

拉勾JD语料抽取结果

工作	641962	11.645208082774683	4.083574124851783	0.11247281022865935
开发	348538	14.031184262140844	4.37645153459778	0.18409496065046307
相关	300517	10.477758266443889	5.038915743418073	0.1758213331033888
合作	159688	10.397674632948268	3.9963476653135794	0.19498851077798446
专业	158831	10.712527000439824	3.152041650598071	0.2640750670241287
测试	158179	13.65362883340751	4.464104436545589	0.18344308560677328
互联网	148818	16.106992250086762	3.9556191209604314	0.407386403912951
活动	131099	10.391243589427443	3.9155422678129406	0.20137250696976194
维护	120316	12.681677655209691	3.2400117935377266	0.1960306406685237
问题	112116	9.159871336778389	2.314215135279833	0.20283174185051037
优化	109563	11.324180546618742	4.331660381832997	0.2456782591010779
营销	105845	14.36850646150769	5.097001962525406	0.14961371773129828
平台	100783	9.002815015607053	4.443804901153697	0.2877423571272965
培训	93204	9.041659151637216	3.8898570467819824	0.13345998575160295
资源	90339	8.651051691178928	4.063430372719874	0.14695817490494298
相关专业	87545	8.988684686772165	2.4897196388075598	0.2905199904149232
网站	87182	8.92184093707449	5.465843476701055	0.21266038137095059
独立	86111	9.074141462752506	3.1456261690072957	0.19050261614079594
一定	83798	8.335390354693924	2.107303660112154	0.26157299167679793
流程	83165	9.321928094887362	2.5509378861028074	0.2063141084699957
网络	82742	9.087462841250339	4.681429111504988	0.21266038137095059
优秀	74600	9.370687406807217	2.0756995478573135	0.2899855507391353
信息	71009	9.820178962415188	4.2602697278449755	0.18863532864443658
媒体	67533	10.556506054671928	4.615376861300178	0.17976710334788937
编写	64337	7.960001932068081	3.482400585501417	0.265625
思维	62351	8.741466986401146	2.4320664807326646	0.15396736072031514
规划	59733	7.851749041416057	2.936854928368285	0.14166201896263245
移动	59671	10.10459875356437	3.4421932833155653	0.20137250696976194
渠道	59072	9.513727595952437	4.597891463808354	0.23578595317725753
关系	58483	8.348728154231077	2.4369558675502927	0.3170022612253688
积极	57295	9.044394119358454	2.763249521041074	0.1746848469256496
实施	56645	7.781359713524661	4.371966846513886	0.15944453739334113
福利	55732	8.475733430966399	2.4036919305145426	0.20908952728378172
其他	55665	8.434628227636725	2.9614863103296867	0.15943975441289332
功能	55087	7.787902559391432	4.1663586610392755	0.18097560975609756
代码	52431	7.88874324889826	3.876917512626917	0.2135697048449972
微信	49143	8.945443836377912	3.6868130380800643	0.18215857916308253
企业	48799	9.422064766172813	5.568662443510237	0.2905199904149232
提升	48446	8.233619676759702	3.7390647282620666	0.29750778816199375
质量	47918	10.861862340059153	3.391825261582227	0.10921827734437191
人员	47109	7.774787059601174	5.249783964892326	0.13589632038101343
数据库	45445	8.290018846932618	4.123423571610193	0.2640569395017794
商务	44047	8.189824558880018	3.44858516585648	0.12901085044961344
主动	42628	13.815583433851023	2.5049637884195137	0.1968791796700847
创意	41768	14.396470993910388	4.115068825929573	0.30544056771141337
工具	40227	9.927777962082342	2.208874047820781	0.11247281022865935
等相关	39230	11.919608238603255	3.0330398736413557	0.1758213331033888
提出	38741	10.179909090014934	4.46446156782086	0.13053040103492886
各类	38309	8.344295907915816	5.136417986953123	0.3969948596283116
操作	37061	9.06339508128851	4.676836974292029	0.23452975920036348
收集	36600	8.800899899920305	2.797691452951563	0.11388512456999896
过程	36534	8.214319120800766	2.5633950372758565	0.2063141084699957
数据分析	36081	8.442943495848729	3.5589033442862585	0.2640569395017794

全宋词抽取结果

何处	388	6.491853096329675	3.3628674437455617	0.6815015936725298
东风	286	5.392317422778761	4.458774408044057	0.19724622030237582
江南	250	6.409390936137703	3.903802705407174	0.10545138034778331
春风	237	3.5849625007211565	4.927775131630969	0.16484505593416485
相思	225	6.614709844115209	4.358855443007008	0.242072962836686
千里	218	6.409390936137703	4.4108660037595	0.2562873368242496
人间	200	5.357552004618084	3.6298146463975085	0.13589632038101343
明月	196	5.357552004618084	4.461698115330817	0.2009720696427977
归来	195	5.08746284125034	4.510975805812117	0.4260707923476106
尊前	190	7.607330313749611	3.7677180601390012	0.1516088400320623
相逢	179	7.426264754702098	3.729594240735622	0.2827298050139276
芳草	176	7.409390936137703	4.193709696939418	0.10797973400886637
多情	175	6.247927513443586	3.8156445316213303	0.3327408912022344
阑干	167	9.30149619498255	4.1027945328835855	0.17564639607106747
梅花	159	4.807354922057604	4.829461592976214	0.1725721995566835
年年	157	3.8073549220576037	3.401504022650184	0.10157033077180087
无人	150	2.807354922057604	4.773999920722275	0.35809310100061825
如今	148	5.7279204545632	2.4554158038937834	0.1780766096169519
回首	145	7.94251450533924	3.197825274741958	0.20080445544554457
天涯	142	7.74819284958946	4.087307754334477	0.4339155749636099
一枝	135	5.20945336562895	3.5111675192832683	0.2674922938432581
当时	134	6.08746284125034	3.2683525636568564	0.14850198715988994
流水	132	5.700439718141093	4.024081009656002	0.13549047394111163
佳人	131	5.20945336562895	3.0918026501936384	0.22896958600345846
西风	128	4.321928094887363	4.310178372466687	0.19724622030237582
依旧	125	7.768184324776926	3.8821144630683277	0.1728525980911983
故人	122	5.392317422778761	2.9526098687901237	0.2363130219610269
今夜	121	5.554588851677638	3.239568407653533	0.2543231961836613
少年	120	5.357552004618084	2.8645866477158934	0.23419345103365022
春色	120	5.129283016944966	4.576389958371988	0.16484505593416485

dict_build's People

Contributors

dependabot[bot] avatar doumiaoo-oo avatar hyangyt avatar sing1ee avatar xv44586 avatar ylongo avatar zhiyuanhubj avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dict_build's Issues

请问内部使用的数据结构

你好,
请问统计串频和左右熵使用的是什么算法?
感觉要高效并且节省内存,要使用trie和nagao?

大概想借助你的代码做一个语料信息的统计。
但是懒得认真看代码了。

FastBuilder里面的 googlecode RadixTree 包应该怎么引入

import com.googlecode.concurrenttrees.radix.ConcurrentRadixTree;
import com.googlecode.concurrenttrees.radix.RadixTree;
import com.googlecode.concurrenttrees.radix.node.concrete.DefaultCharArrayNodeFactory;

package dict.build下的FastBuilder 的这个3个包应该怎么导入,我是菜鸟,求指导!

输出结果分别代表什么意思

输出结果除了词,词频,自由运用程度(左右邻信息熵的最小值),凝固程度还有什么?
以及源码中words.data文件的输出应该是词,凝固程度,自由运用程度,并没有词频也没有多出来的那一列,所以好像实际运行的代码并不是源码。

关于 isChinese 的字符编码范围

public static boolean isChinese(char a) { 
	     int v = (int)a; 
	     return (v >=19968 && v <= 171941); 
	}

其中的数值判断条件换算成十六进制是 [0x4e00, 0x29fa5], 请问这个右区间为什么等于0x29fa5呢。 我查了一下中文Unicode编码的右区间应该是 0x9fa5(十进制40869),不知道您代码中的0x29fa5是刻意为之还是笔误?

有个bug

排序之前 writeter应该flush,不然排序后的文件会缺行

关于并行调用

master分支的程序,处理数据时产生的中间状态的临时文件和最终的分词文件都和原始数据文件在同一个路径之下。
这样一来,如果数据目录有很多文件,希望能够通过shell来并行调用您的程序的时候就会很麻烦。

目前我们的想法是,写shell脚本来调度:

  • 对于源文件夹中的每个数据文件都创建一个独特的新文件夹
  • 然后将数据文件移入
  • 再调用您的程序
  • 处理完毕之后再将数据挪回来
  • 并将最终的分词结果文件重命名,统一移到特定的位置

不知道有没有什么简单的方法能够实现修改源程序来规避目前无法对同一文件夹下的多个文件同时进行处理的状况?
我们尝试修改了FastBuilder.java多处涉及到路径的部分,但是没能成功~

然后刚才才看到parallel这个分支,不过似乎和master是完全一致的?

位置成词概率 & 语料

请问位置成词概率是指BMES吗?是否有比对没有BMES的结果呢?
另外
请问在拉勾JD这种垂直领域的语料效果如何?
谢谢

java版本有要求吗

我在linux开发机上遇到了版本不支持的错误
Exception in thread "main" java.lang.UnsupportedClassVersionError: dict/build/Main : Unsupported major.minor version 52.0
at java.lang.ClassLoader.defineClass1(Native Method)
at java.lang.ClassLoader.defineClass(ClassLoader.java:800)
at java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
at java.net.URLClassLoader.defineClass(URLClassLoader.java:449)
at java.net.URLClassLoader.access$100(URLClassLoader.java:71)
at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:482)
我java版本是java version "1.7.0_79"

左右信息熵计算问题

1.这个 pos_prop.txt 里面存的是搜狗词典的数据?

double pp = -1;
if (null != posProp.get(first) && null != posProp.get(trailing))
    pp = Math.min(posProp.get(first)[0], posProp.get(trailing)[2]);

这里如果 pos_prop 里面没有的字岂不是永远不会成词?

请问怎么用

请说明下input文件的形式,目前测试了两种input形式:分词\不分词,输出的文件均为0 kb。

windows

求教大神,这个是在linux环境下运行的吗?windows下双击dict_build.bat时,只显示rawpath。谢谢~

设备上没有空间

  1. 不知道为什么,我看机器是有存储空间的,不是OOM,但是报没有空间的问题
  2. 换成小数据,同样报上述错误

请问pos_prop.txt文件的使用含义?

我正在研读这个项目的源代码,请问sing1ee,项目中的pos_prop.txt来源是训练的结果吗?那三列数值应该代表什么含义呢?因为我有看到了下面的代码:
pp = Math.min(posProp.get(w.subSequence(0, 1))[0], posProp.get(w.subSequence(w.length() - 1, w.length()))[2]);

我在研究新词抽取方面的事情,望与您进一步沟通。

total 与freq的计算问题

1.PMI=log(P(x,y)/(p(x)p(y)), p(x,y)=freq/N, p(x)=freq_x/N => PMI=log(freqN/(freq_xfreq*y)),而total=word_frag_count,这里感觉应该是取错了,应该是语料内所以的“字”的数量;
2.freq用的是右侧前缀树的总和,这样会少记录当词右侧没有内容的情况,如词出现在句子末尾(吧/呢)直接统计word的freq是不是会更好?

关于 pmi 的计算

@sing1ee 你好,为什么 pmi 的计算方式是这个样子呢?

double pf = freq * total / max;
double pmi = Math.log(pf) / Math.log(2);

请问 words_sort.data 文件在哪?

找不到结果文件,只看到有logs文件夹,没有报错。
以下是info log 文件内容:

2017-05-30 15:14:58.201 [main] INFO dict.build.FastBuilder - start to extract words
2017-05-30 15:14:58.283 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 1000
2017-05-30 15:14:58.291 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 2000
2017-05-30 15:14:58.296 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 3000
2017-05-30 15:14:58.301 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 4000
2017-05-30 15:14:58.312 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 5000
2017-05-30 15:14:58.319 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 6000
2017-05-30 15:14:58.328 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 7000
2017-05-30 15:14:58.336 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 8000
2017-05-30 15:14:58.344 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 9000
2017-05-30 15:14:58.353 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 10000
2017-05-30 15:14:58.361 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 11000
2017-05-30 15:14:58.371 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 12000
2017-05-30 15:14:58.379 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 13000
2017-05-30 15:14:58.386 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 14000
2017-05-30 15:14:58.392 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 15000
2017-05-30 15:14:58.400 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 16000
2017-05-30 15:14:58.409 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 17000
2017-05-30 15:14:58.415 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 18000
2017-05-30 15:14:58.420 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 19000
2017-05-30 15:14:58.425 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 20000
2017-05-30 15:14:58.431 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 21000
2017-05-30 15:14:58.440 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 22000
2017-05-30 15:14:58.444 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 23000
2017-05-30 15:14:58.449 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 24000
2017-05-30 15:14:58.457 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 25000
2017-05-30 15:14:58.463 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 26000
2017-05-30 15:14:58.466 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 27000
2017-05-30 15:14:58.468 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 28000
2017-05-30 15:14:58.470 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 29000
2017-05-30 15:14:58.472 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 30000
2017-05-30 15:14:58.475 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 31000
2017-05-30 15:14:58.477 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 32000
2017-05-30 15:14:58.483 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 33000
2017-05-30 15:14:58.489 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 34000
2017-05-30 15:14:58.491 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 35000
2017-05-30 15:14:58.494 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 36000
2017-05-30 15:14:58.496 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 37000
2017-05-30 15:14:58.499 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 38000
2017-05-30 15:14:58.505 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 39000
2017-05-30 15:14:58.510 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 40000
2017-05-30 15:14:58.514 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 41000
2017-05-30 15:14:58.518 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 42000
2017-05-30 15:14:58.523 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 43000
2017-05-30 15:14:58.526 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 44000
2017-05-30 15:14:58.531 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 45000
2017-05-30 15:14:58.536 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 46000
2017-05-30 15:14:58.540 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 47000
2017-05-30 15:14:58.544 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 48000
2017-05-30 15:14:58.547 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 49000
2017-05-30 15:14:58.553 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 50000
2017-05-30 15:14:58.560 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 51000
2017-05-30 15:14:58.566 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 52000
2017-05-30 15:14:58.570 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 53000
2017-05-30 15:14:58.571 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 54000
2017-05-30 15:14:58.574 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 55000
2017-05-30 15:14:58.583 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 56000
2017-05-30 15:14:58.586 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 57000
2017-05-30 15:14:58.588 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 58000
2017-05-30 15:14:58.590 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 59000
2017-05-30 15:14:58.593 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 60000
2017-05-30 15:14:58.597 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 61000
2017-05-30 15:14:58.603 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 62000
2017-05-30 15:14:58.609 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 63000
2017-05-30 15:14:58.611 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 64000
2017-05-30 15:14:58.613 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 65000
2017-05-30 15:14:58.615 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 66000
2017-05-30 15:14:58.618 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 67000
2017-05-30 15:14:58.620 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 68000
2017-05-30 15:14:58.622 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 69000
2017-05-30 15:14:58.624 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 70000
2017-05-30 15:14:58.627 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 71000
2017-05-30 15:14:58.628 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 72000
2017-05-30 15:14:58.630 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 73000
2017-05-30 15:14:58.632 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 74000
2017-05-30 15:14:58.635 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 75000
2017-05-30 15:14:58.638 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 76000
2017-05-30 15:14:58.640 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 77000
2017-05-30 15:14:58.641 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 78000
2017-05-30 15:14:58.643 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 79000
2017-05-30 15:14:58.645 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 80000
2017-05-30 15:14:58.647 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 81000
2017-05-30 15:14:58.649 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 82000
2017-05-30 15:14:58.652 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 83000
2017-05-30 15:14:58.657 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 84000
2017-05-30 15:14:58.659 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 85000
2017-05-30 15:14:58.662 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 86000
2017-05-30 15:14:58.667 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 87000
2017-05-30 15:14:58.669 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 88000
2017-05-30 15:14:58.670 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 89000
2017-05-30 15:14:58.672 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 90000
2017-05-30 15:14:58.674 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 91000
2017-05-30 15:14:58.675 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 92000
2017-05-30 15:14:58.677 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 93000
2017-05-30 15:14:58.679 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 94000
2017-05-30 15:14:58.680 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 95000
2017-05-30 15:14:58.682 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 96000
2017-05-30 15:14:58.687 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 97000
2017-05-30 15:14:58.689 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 98000
2017-05-30 15:14:58.691 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 99000
2017-05-30 15:14:58.693 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 100000
2017-05-30 15:14:58.695 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 101000
2017-05-30 15:14:58.700 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 102000
2017-05-30 15:14:58.711 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 103000
2017-05-30 15:14:58.723 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 104000
2017-05-30 15:14:58.730 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 105000
2017-05-30 15:14:58.734 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 106000
2017-05-30 15:14:58.737 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 107000
2017-05-30 15:14:58.739 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 108000
2017-05-30 15:14:58.753 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 109000
2017-05-30 15:14:58.755 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 110000
2017-05-30 15:14:58.757 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 111000
2017-05-30 15:14:58.759 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 112000
2017-05-30 15:14:58.761 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 113000
2017-05-30 15:14:58.764 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 114000
2017-05-30 15:14:58.766 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 115000
2017-05-30 15:14:58.768 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 116000
2017-05-30 15:14:58.770 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 117000
2017-05-30 15:14:58.772 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 118000
2017-05-30 15:14:58.774 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 119000
2017-05-30 15:14:58.776 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 120000
2017-05-30 15:14:58.779 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 121000
2017-05-30 15:14:58.781 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 122000
2017-05-30 15:14:58.783 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 123000
2017-05-30 15:14:58.786 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 124000
2017-05-30 15:14:58.788 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 125000
2017-05-30 15:14:58.790 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 126000
2017-05-30 15:14:58.792 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 127000
2017-05-30 15:14:58.794 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 128000
2017-05-30 15:14:58.796 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 129000
2017-05-30 15:14:58.798 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 130000
2017-05-30 15:14:58.800 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 131000
2017-05-30 15:14:58.802 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 132000
2017-05-30 15:14:58.804 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 133000
2017-05-30 15:14:58.806 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 134000
2017-05-30 15:14:58.809 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 135000
2017-05-30 15:14:58.811 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 136000
2017-05-30 15:14:58.813 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 137000
2017-05-30 15:14:58.815 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 138000
2017-05-30 15:14:58.817 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 139000
2017-05-30 15:14:58.819 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 140000
2017-05-30 15:14:58.820 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 141000
2017-05-30 15:14:58.823 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 142000
2017-05-30 15:14:58.828 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 143000
2017-05-30 15:14:58.832 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 144000
2017-05-30 15:14:58.834 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 145000
2017-05-30 15:14:58.837 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 146000
2017-05-30 15:14:58.839 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 147000
2017-05-30 15:14:58.841 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 148000
2017-05-30 15:14:58.843 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 149000
2017-05-30 15:14:58.844 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 150000
2017-05-30 15:14:58.848 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 151000
2017-05-30 15:14:58.851 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 152000
2017-05-30 15:14:58.853 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 153000
2017-05-30 15:14:58.855 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 154000
2017-05-30 15:14:58.859 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 155000
2017-05-30 15:14:58.861 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 156000
2017-05-30 15:14:58.864 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 157000
2017-05-30 15:14:58.866 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 158000
2017-05-30 15:14:58.868 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 159000
2017-05-30 15:14:58.870 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 160000
2017-05-30 15:14:58.872 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 161000
2017-05-30 15:14:58.874 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 162000
2017-05-30 15:14:58.876 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 163000
2017-05-30 15:14:58.878 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 164000
2017-05-30 15:14:58.880 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 165000
2017-05-30 15:14:58.885 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 166000
2017-05-30 15:14:58.889 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 167000
2017-05-30 15:14:58.891 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 168000
2017-05-30 15:14:58.893 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 169000
2017-05-30 15:14:58.896 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 170000
2017-05-30 15:14:58.898 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 171000
2017-05-30 15:14:58.900 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 172000
2017-05-30 15:14:58.903 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 173000
2017-05-30 15:14:58.909 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 174000
2017-05-30 15:14:58.914 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 175000
2017-05-30 15:14:58.917 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 176000
2017-05-30 15:14:58.920 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 177000
2017-05-30 15:14:58.932 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 178000
2017-05-30 15:14:58.934 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 179000
2017-05-30 15:14:58.937 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 180000
2017-05-30 15:14:58.942 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 181000
2017-05-30 15:14:58.945 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 182000
2017-05-30 15:14:58.946 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 183000
2017-05-30 15:14:58.948 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 184000
2017-05-30 15:14:58.951 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 185000
2017-05-30 15:14:58.953 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 186000
2017-05-30 15:14:58.955 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 187000
2017-05-30 15:14:58.957 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 188000
2017-05-30 15:14:58.960 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 189000
2017-05-30 15:14:58.962 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 190000
2017-05-30 15:14:58.964 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 191000
2017-05-30 15:14:58.966 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 192000
2017-05-30 15:14:58.969 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 193000
2017-05-30 15:14:58.972 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 194000
2017-05-30 15:14:58.974 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 195000
2017-05-30 15:14:58.976 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 196000
2017-05-30 15:14:58.978 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 197000
2017-05-30 15:14:58.980 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 198000
2017-05-30 15:14:58.982 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 199000
2017-05-30 15:14:58.986 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 200000
2017-05-30 15:14:58.988 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 201000
2017-05-30 15:14:58.990 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 202000
2017-05-30 15:14:58.992 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 203000
2017-05-30 15:14:58.994 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 204000
2017-05-30 15:14:58.996 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 205000
2017-05-30 15:14:58.998 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 206000
2017-05-30 15:14:59.000 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 207000
2017-05-30 15:14:59.002 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 208000
2017-05-30 15:14:59.005 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 209000
2017-05-30 15:14:59.007 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 210000
2017-05-30 15:14:59.008 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 211000
2017-05-30 15:14:59.011 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 212000
2017-05-30 15:14:59.014 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 213000
2017-05-30 15:14:59.020 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 214000
2017-05-30 15:14:59.021 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 215000
2017-05-30 15:14:59.023 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 216000
2017-05-30 15:14:59.025 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 217000
2017-05-30 15:14:59.028 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 218000
2017-05-30 15:14:59.030 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 219000
2017-05-30 15:14:59.033 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 220000
2017-05-30 15:14:59.035 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 221000
2017-05-30 15:14:59.037 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 222000
2017-05-30 15:14:59.040 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 223000
2017-05-30 15:14:59.042 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 224000
2017-05-30 15:14:59.044 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 225000
2017-05-30 15:14:59.047 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 226000
2017-05-30 15:14:59.048 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 227000
2017-05-30 15:14:59.051 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 228000
2017-05-30 15:14:59.054 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 229000
2017-05-30 15:14:59.055 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 230000
2017-05-30 15:14:59.058 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 231000
2017-05-30 15:14:59.061 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 232000
2017-05-30 15:14:59.065 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 233000
2017-05-30 15:14:59.067 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 234000
2017-05-30 15:14:59.069 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 235000
2017-05-30 15:14:59.072 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 236000
2017-05-30 15:14:59.074 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 237000
2017-05-30 15:14:59.077 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 238000
2017-05-30 15:14:59.079 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 239000
2017-05-30 15:14:59.081 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 240000
2017-05-30 15:14:59.083 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 241000
2017-05-30 15:14:59.086 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 242000
2017-05-30 15:14:59.091 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 243000
2017-05-30 15:14:59.093 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 244000
2017-05-30 15:14:59.097 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 245000
2017-05-30 15:14:59.098 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 246000
2017-05-30 15:14:59.103 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 247000
2017-05-30 15:14:59.106 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 248000
2017-05-30 15:14:59.112 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 249000
2017-05-30 15:14:59.118 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 250000
2017-05-30 15:14:59.122 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 251000
2017-05-30 15:14:59.125 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 252000
2017-05-30 15:14:59.130 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 253000
2017-05-30 15:14:59.133 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 254000
2017-05-30 15:14:59.137 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 255000
2017-05-30 15:14:59.140 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 256000
2017-05-30 15:14:59.143 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 257000
2017-05-30 15:14:59.146 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 258000
2017-05-30 15:14:59.149 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 259000
2017-05-30 15:14:59.156 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 260000
2017-05-30 15:14:59.161 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 261000
2017-05-30 15:14:59.167 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 262000
2017-05-30 15:14:59.171 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 263000
2017-05-30 15:14:59.185 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 264000
2017-05-30 15:14:59.189 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 265000
2017-05-30 15:14:59.191 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 266000
2017-05-30 15:14:59.193 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 267000
2017-05-30 15:14:59.196 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 268000
2017-05-30 15:14:59.199 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 269000
2017-05-30 15:14:59.205 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 270000
2017-05-30 15:14:59.213 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 271000
2017-05-30 15:14:59.221 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 272000
2017-05-30 15:14:59.223 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 273000
2017-05-30 15:14:59.225 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 274000
2017-05-30 15:14:59.227 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 275000
2017-05-30 15:14:59.231 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 276000
2017-05-30 15:14:59.233 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 277000
2017-05-30 15:14:59.235 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 278000
2017-05-30 15:14:59.239 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 279000
2017-05-30 15:14:59.243 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 280000
2017-05-30 15:14:59.245 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 281000
2017-05-30 15:14:59.248 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 282000
2017-05-30 15:14:59.250 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 283000
2017-05-30 15:14:59.253 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 284000
2017-05-30 15:14:59.256 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 285000
2017-05-30 15:14:59.259 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 286000
2017-05-30 15:14:59.263 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 287000
2017-05-30 15:14:59.267 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 288000
2017-05-30 15:14:59.273 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 289000
2017-05-30 15:14:59.275 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 290000
2017-05-30 15:14:59.278 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 291000
2017-05-30 15:14:59.280 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 292000
2017-05-30 15:14:59.283 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 293000
2017-05-30 15:14:59.286 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 294000
2017-05-30 15:14:59.288 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 295000
2017-05-30 15:14:59.291 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 296000
2017-05-30 15:14:59.293 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 297000
2017-05-30 15:14:59.297 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 298000
2017-05-30 15:14:59.300 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 299000
2017-05-30 15:14:59.303 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 300000
2017-05-30 15:14:59.306 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 301000
2017-05-30 15:14:59.310 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 302000
2017-05-30 15:14:59.314 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 303000
2017-05-30 15:14:59.318 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 304000
2017-05-30 15:14:59.323 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 305000
2017-05-30 15:14:59.327 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 306000
2017-05-30 15:14:59.331 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 307000
2017-05-30 15:14:59.334 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 308000
2017-05-30 15:14:59.338 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 309000
2017-05-30 15:14:59.340 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 310000
2017-05-30 15:14:59.345 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 311000
2017-05-30 15:14:59.347 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 312000
2017-05-30 15:14:59.350 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 313000
2017-05-30 15:14:59.353 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 314000
2017-05-30 15:14:59.357 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 315000
2017-05-30 15:14:59.361 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 316000
2017-05-30 15:14:59.364 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 317000
2017-05-30 15:14:59.368 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 318000
2017-05-30 15:14:59.373 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 319000
2017-05-30 15:14:59.375 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 320000
2017-05-30 15:14:59.378 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 321000
2017-05-30 15:14:59.382 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 322000
2017-05-30 15:14:59.388 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 323000
2017-05-30 15:14:59.392 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 324000
2017-05-30 15:14:59.394 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 325000
2017-05-30 15:14:59.397 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 326000
2017-05-30 15:14:59.399 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 327000
2017-05-30 15:14:59.401 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 328000
2017-05-30 15:14:59.404 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 329000
2017-05-30 15:14:59.405 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 330000
2017-05-30 15:14:59.407 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 331000
2017-05-30 15:14:59.412 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 332000
2017-05-30 15:14:59.428 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 333000
2017-05-30 15:14:59.435 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 334000
2017-05-30 15:14:59.440 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 335000
2017-05-30 15:14:59.443 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 336000
2017-05-30 15:14:59.449 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 337000
2017-05-30 15:14:59.452 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 338000
2017-05-30 15:14:59.457 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 339000
2017-05-30 15:14:59.459 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 340000
2017-05-30 15:14:59.466 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 341000
2017-05-30 15:14:59.470 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 342000
2017-05-30 15:14:59.473 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 343000
2017-05-30 15:14:59.478 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 344000
2017-05-30 15:14:59.480 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 345000
2017-05-30 15:14:59.484 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 346000
2017-05-30 15:14:59.490 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 347000
2017-05-30 15:14:59.493 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 348000
2017-05-30 15:14:59.497 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 349000
2017-05-30 15:14:59.500 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 350000
2017-05-30 15:14:59.503 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 351000
2017-05-30 15:14:59.506 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 352000
2017-05-30 15:14:59.510 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 353000
2017-05-30 15:14:59.516 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 354000
2017-05-30 15:14:59.519 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 355000
2017-05-30 15:14:59.523 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 356000
2017-05-30 15:14:59.526 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 357000
2017-05-30 15:14:59.528 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 358000
2017-05-30 15:14:59.530 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 359000
2017-05-30 15:14:59.537 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 360000
2017-05-30 15:14:59.542 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 361000
2017-05-30 15:14:59.544 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 362000
2017-05-30 15:14:59.547 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 363000
2017-05-30 15:14:59.551 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 364000
2017-05-30 15:14:59.556 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 365000
2017-05-30 15:14:59.559 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 366000
2017-05-30 15:14:59.562 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 367000
2017-05-30 15:14:59.566 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 368000
2017-05-30 15:14:59.570 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 369000
2017-05-30 15:14:59.574 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 370000
2017-05-30 15:14:59.580 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 371000
2017-05-30 15:14:59.586 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 372000
2017-05-30 15:14:59.595 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 373000
2017-05-30 15:14:59.602 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 374000
2017-05-30 15:14:59.611 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 375000
2017-05-30 15:14:59.614 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 376000
2017-05-30 15:14:59.618 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 377000
2017-05-30 15:14:59.620 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 378000
2017-05-30 15:14:59.627 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 379000
2017-05-30 15:14:59.630 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 380000
2017-05-30 15:14:59.632 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 381000
2017-05-30 15:14:59.634 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 382000
2017-05-30 15:14:59.638 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 383000
2017-05-30 15:14:59.643 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 384000
2017-05-30 15:14:59.646 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 385000
2017-05-30 15:14:59.650 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 386000
2017-05-30 15:14:59.652 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 387000
2017-05-30 15:14:59.655 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 388000
2017-05-30 15:14:59.662 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 389000
2017-05-30 15:14:59.666 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 390000
2017-05-30 15:14:59.671 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 391000
2017-05-30 15:14:59.675 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 392000
2017-05-30 15:14:59.681 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 393000
2017-05-30 15:14:59.690 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 394000
2017-05-30 15:14:59.703 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 395000
2017-05-30 15:14:59.707 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 396000
2017-05-30 15:14:59.710 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 397000
2017-05-30 15:14:59.717 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 398000
2017-05-30 15:14:59.724 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 399000
2017-05-30 15:14:59.728 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 400000
2017-05-30 15:14:59.740 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 401000
2017-05-30 15:14:59.753 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 402000
2017-05-30 15:14:59.762 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 403000
2017-05-30 15:14:59.776 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 404000
2017-05-30 15:14:59.782 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 405000
2017-05-30 15:14:59.788 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 406000
2017-05-30 15:14:59.799 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 407000
2017-05-30 15:14:59.810 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 408000
2017-05-30 15:14:59.819 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 409000
2017-05-30 15:14:59.827 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 410000
2017-05-30 15:14:59.841 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 411000
2017-05-30 15:14:59.848 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 412000
2017-05-30 15:14:59.858 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 413000
2017-05-30 15:14:59.866 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 414000
2017-05-30 15:14:59.871 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 415000
2017-05-30 15:14:59.875 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 416000
2017-05-30 15:14:59.883 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 417000
2017-05-30 15:14:59.902 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 418000
2017-05-30 15:14:59.907 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 419000
2017-05-30 15:14:59.914 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 420000
2017-05-30 15:14:59.926 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 421000
2017-05-30 15:14:59.944 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 422000
2017-05-30 15:14:59.950 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 423000
2017-05-30 15:14:59.959 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 424000
2017-05-30 15:14:59.973 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 425000
2017-05-30 15:14:59.978 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 426000
2017-05-30 15:14:59.983 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 427000
2017-05-30 15:14:59.990 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 428000
2017-05-30 15:14:59.999 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 429000
2017-05-30 15:15:00.008 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 430000
2017-05-30 15:15:00.015 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 431000
2017-05-30 15:15:00.023 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 432000
2017-05-30 15:15:00.028 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 433000
2017-05-30 15:15:00.032 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 434000
2017-05-30 15:15:00.039 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 435000
2017-05-30 15:15:00.043 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 436000
2017-05-30 15:15:00.048 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 437000
2017-05-30 15:15:00.051 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 438000
2017-05-30 15:15:00.056 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 439000
2017-05-30 15:15:00.061 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 440000
2017-05-30 15:15:00.069 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 441000
2017-05-30 15:15:00.074 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 442000
2017-05-30 15:15:00.082 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 443000
2017-05-30 15:15:00.088 [main] INFO dict.build.FastBuilder - load freq to radix tree done: 444000
2017-05-30 15:15:00.094 [main] INFO dict.build.FastBuilder - build freq TST done!
2017-05-30 15:15:00.115 [main] INFO dict.build.FastBuilder - extract words done: 1000
2017-05-30 15:15:00.127 [main] INFO dict.build.FastBuilder - extract words done: 2000
2017-05-30 15:15:00.136 [main] INFO dict.build.FastBuilder - extract words done: 3000
2017-05-30 15:15:00.149 [main] INFO dict.build.FastBuilder - extract words done: 4000
2017-05-30 15:15:00.157 [main] INFO dict.build.FastBuilder - extract words done: 5000
2017-05-30 15:15:00.164 [main] INFO dict.build.FastBuilder - extract words done: 6000
2017-05-30 15:15:00.170 [main] INFO dict.build.FastBuilder - extract words done: 7000
2017-05-30 15:15:00.177 [main] INFO dict.build.FastBuilder - extract words done: 8000
2017-05-30 15:15:00.184 [main] INFO dict.build.FastBuilder - extract words done: 9000
2017-05-30 15:15:00.191 [main] INFO dict.build.FastBuilder - extract words done: 10000
2017-05-30 15:15:00.199 [main] INFO dict.build.FastBuilder - extract words done: 11000
2017-05-30 15:15:00.206 [main] INFO dict.build.FastBuilder - extract words done: 12000
2017-05-30 15:15:00.213 [main] INFO dict.build.FastBuilder - extract words done: 13000
2017-05-30 15:15:00.221 [main] INFO dict.build.FastBuilder - extract words done: 14000
2017-05-30 15:15:00.227 [main] INFO dict.build.FastBuilder - extract words done: 15000
2017-05-30 15:15:00.233 [main] INFO dict.build.FastBuilder - extract words done: 16000
2017-05-30 15:15:00.240 [main] INFO dict.build.FastBuilder - extract words done: 17000
2017-05-30 15:15:00.246 [main] INFO dict.build.FastBuilder - extract words done: 18000
2017-05-30 15:15:00.252 [main] INFO dict.build.FastBuilder - extract words done: 19000
2017-05-30 15:15:00.259 [main] INFO dict.build.FastBuilder - extract words done: 20000
2017-05-30 15:15:00.265 [main] INFO dict.build.FastBuilder - extract words done: 21000
2017-05-30 15:15:00.272 [main] INFO dict.build.FastBuilder - extract words done: 22000
2017-05-30 15:15:00.279 [main] INFO dict.build.FastBuilder - extract words done: 23000
2017-05-30 15:15:00.286 [main] INFO dict.build.FastBuilder - extract words done: 24000
2017-05-30 15:15:00.292 [main] INFO dict.build.FastBuilder - extract words done: 25000
2017-05-30 15:15:00.298 [main] INFO dict.build.FastBuilder - extract words done: 26000
2017-05-30 15:15:00.305 [main] INFO dict.build.FastBuilder - extract words done: 27000
2017-05-30 15:15:00.311 [main] INFO dict.build.FastBuilder - extract words done: 28000
2017-05-30 15:15:00.317 [main] INFO dict.build.FastBuilder - extract words done: 29000
2017-05-30 15:15:00.323 [main] INFO dict.build.FastBuilder - extract words done: 30000
2017-05-30 15:15:00.329 [main] INFO dict.build.FastBuilder - extract words done: 31000
2017-05-30 15:15:00.335 [main] INFO dict.build.FastBuilder - extract words done: 32000
2017-05-30 15:15:00.342 [main] INFO dict.build.FastBuilder - extract words done: 33000
2017-05-30 15:15:00.348 [main] INFO dict.build.FastBuilder - extract words done: 34000
2017-05-30 15:15:00.354 [main] INFO dict.build.FastBuilder - extract words done: 35000
2017-05-30 15:15:00.362 [main] INFO dict.build.FastBuilder - extract words done: 36000
2017-05-30 15:15:00.368 [main] INFO dict.build.FastBuilder - extract words done: 37000
2017-05-30 15:15:00.374 [main] INFO dict.build.FastBuilder - extract words done: 38000
2017-05-30 15:15:00.381 [main] INFO dict.build.FastBuilder - extract words done: 39000
2017-05-30 15:15:00.387 [main] INFO dict.build.FastBuilder - extract words done: 40000
2017-05-30 15:15:00.394 [main] INFO dict.build.FastBuilder - extract words done: 41000
2017-05-30 15:15:00.400 [main] INFO dict.build.FastBuilder - extract words done: 42000
2017-05-30 15:15:00.407 [main] INFO dict.build.FastBuilder - extract words done: 43000
2017-05-30 15:15:00.414 [main] INFO dict.build.FastBuilder - extract words done: 44000
2017-05-30 15:15:00.423 [main] INFO dict.build.FastBuilder - extract words done: 45000
2017-05-30 15:15:00.430 [main] INFO dict.build.FastBuilder - extract words done: 46000
2017-05-30 15:15:00.436 [main] INFO dict.build.FastBuilder - extract words done: 47000
2017-05-30 15:15:00.443 [main] INFO dict.build.FastBuilder - extract words done: 48000
2017-05-30 15:15:00.450 [main] INFO dict.build.FastBuilder - extract words done: 49000
2017-05-30 15:15:00.457 [main] INFO dict.build.FastBuilder - extract words done: 50000
2017-05-30 15:15:00.463 [main] INFO dict.build.FastBuilder - extract words done: 51000
2017-05-30 15:15:00.470 [main] INFO dict.build.FastBuilder - extract words done: 52000
2017-05-30 15:15:00.476 [main] INFO dict.build.FastBuilder - extract words done: 53000
2017-05-30 15:15:00.483 [main] INFO dict.build.FastBuilder - extract words done: 54000
2017-05-30 15:15:00.490 [main] INFO dict.build.FastBuilder - extract words done: 55000
2017-05-30 15:15:00.496 [main] INFO dict.build.FastBuilder - extract words done: 56000
2017-05-30 15:15:00.503 [main] INFO dict.build.FastBuilder - extract words done: 57000
2017-05-30 15:15:00.510 [main] INFO dict.build.FastBuilder - extract words done: 58000
2017-05-30 15:15:00.517 [main] INFO dict.build.FastBuilder - extract words done: 59000
2017-05-30 15:15:00.524 [main] INFO dict.build.FastBuilder - extract words done: 60000
2017-05-30 15:15:00.530 [main] INFO dict.build.FastBuilder - extract words done: 61000
2017-05-30 15:15:00.537 [main] INFO dict.build.FastBuilder - extract words done: 62000
2017-05-30 15:15:00.541 [main] INFO dict.build.FastBuilder - extract words done: 63000
2017-05-30 15:15:00.545 [main] INFO dict.build.FastBuilder - extract words done: 64000
2017-05-30 15:15:00.549 [main] INFO dict.build.FastBuilder - extract words done: 65000
2017-05-30 15:15:00.553 [main] INFO dict.build.FastBuilder - extract words done: 66000
2017-05-30 15:15:00.557 [main] INFO dict.build.FastBuilder - extract words done: 67000
2017-05-30 15:15:00.561 [main] INFO dict.build.FastBuilder - extract words done: 68000
2017-05-30 15:15:00.565 [main] INFO dict.build.FastBuilder - extract words done: 69000
2017-05-30 15:15:00.569 [main] INFO dict.build.FastBuilder - extract words done: 70000
2017-05-30 15:15:00.573 [main] INFO dict.build.FastBuilder - extract words done: 71000
2017-05-30 15:15:00.577 [main] INFO dict.build.FastBuilder - extract words done: 72000
2017-05-30 15:15:00.581 [main] INFO dict.build.FastBuilder - extract words done: 73000
2017-05-30 15:15:00.585 [main] INFO dict.build.FastBuilder - extract words done: 74000
2017-05-30 15:15:00.590 [main] INFO dict.build.FastBuilder - extract words done: 75000
2017-05-30 15:15:00.594 [main] INFO dict.build.FastBuilder - extract words done: 76000
2017-05-30 15:15:00.598 [main] INFO dict.build.FastBuilder - extract words done: 77000
2017-05-30 15:15:00.602 [main] INFO dict.build.FastBuilder - extract words done: 78000
2017-05-30 15:15:00.606 [main] INFO dict.build.FastBuilder - extract words done: 79000
2017-05-30 15:15:00.610 [main] INFO dict.build.FastBuilder - extract words done: 80000
2017-05-30 15:15:00.614 [main] INFO dict.build.FastBuilder - extract words done: 81000
2017-05-30 15:15:00.618 [main] INFO dict.build.FastBuilder - extract words done: 82000
2017-05-30 15:15:00.622 [main] INFO dict.build.FastBuilder - extract words done: 83000
2017-05-30 15:15:00.626 [main] INFO dict.build.FastBuilder - extract words done: 84000
2017-05-30 15:15:00.630 [main] INFO dict.build.FastBuilder - extract words done: 85000
2017-05-30 15:15:00.635 [main] INFO dict.build.FastBuilder - extract words done: 86000
2017-05-30 15:15:00.639 [main] INFO dict.build.FastBuilder - extract words done: 87000
2017-05-30 15:15:00.643 [main] INFO dict.build.FastBuilder - extract words done: 88000
2017-05-30 15:15:00.646 [main] INFO dict.build.FastBuilder - extract words done: 89000
2017-05-30 15:15:00.651 [main] INFO dict.build.FastBuilder - extract words done: 90000
2017-05-30 15:15:00.654 [main] INFO dict.build.FastBuilder - extract words done: 91000
2017-05-30 15:15:00.658 [main] INFO dict.build.FastBuilder - extract words done: 92000
2017-05-30 15:15:00.662 [main] INFO dict.build.FastBuilder - extract words done: 93000
2017-05-30 15:15:00.666 [main] INFO dict.build.FastBuilder - extract words done: 94000
2017-05-30 15:15:00.671 [main] INFO dict.build.FastBuilder - extract words done: 95000
2017-05-30 15:15:00.675 [main] INFO dict.build.FastBuilder - extract words done: 96000
2017-05-30 15:15:00.678 [main] INFO dict.build.FastBuilder - extract words done: 97000
2017-05-30 15:15:00.682 [main] INFO dict.build.FastBuilder - extract words done: 98000
2017-05-30 15:15:00.686 [main] INFO dict.build.FastBuilder - extract words done: 99000
2017-05-30 15:15:00.690 [main] INFO dict.build.FastBuilder - extract words done: 100000
2017-05-30 15:15:00.694 [main] INFO dict.build.FastBuilder - extract words done: 101000
2017-05-30 15:15:00.698 [main] INFO dict.build.FastBuilder - extract words done: 102000
2017-05-30 15:15:00.702 [main] INFO dict.build.FastBuilder - extract words done: 103000
2017-05-30 15:15:00.706 [main] INFO dict.build.FastBuilder - extract words done: 104000
2017-05-30 15:15:00.710 [main] INFO dict.build.FastBuilder - extract words done: 105000
2017-05-30 15:15:00.716 [main] INFO dict.build.FastBuilder - extract words done: 106000
2017-05-30 15:15:00.722 [main] INFO dict.build.FastBuilder - extract words done: 107000
2017-05-30 15:15:00.729 [main] INFO dict.build.FastBuilder - extract words done: 108000
2017-05-30 15:15:00.734 [main] INFO dict.build.FastBuilder - extract words done: 109000
2017-05-30 15:15:00.740 [main] INFO dict.build.FastBuilder - extract words done: 110000
2017-05-30 15:15:00.745 [main] INFO dict.build.FastBuilder - extract words done: 111000
2017-05-30 15:15:00.750 [main] INFO dict.build.FastBuilder - extract words done: 112000
2017-05-30 15:15:00.756 [main] INFO dict.build.FastBuilder - extract words done: 113000
2017-05-30 15:15:00.762 [main] INFO dict.build.FastBuilder - extract words done: 114000
2017-05-30 15:15:00.767 [main] INFO dict.build.FastBuilder - extract words done: 115000
2017-05-30 15:15:00.773 [main] INFO dict.build.FastBuilder - extract words done: 116000
2017-05-30 15:15:00.779 [main] INFO dict.build.FastBuilder - extract words done: 117000
2017-05-30 15:15:00.784 [main] INFO dict.build.FastBuilder - extract words done: 118000
2017-05-30 15:15:00.790 [main] INFO dict.build.FastBuilder - extract words done: 119000
2017-05-30 15:15:00.797 [main] INFO dict.build.FastBuilder - extract words done: 120000
2017-05-30 15:15:00.803 [main] INFO dict.build.FastBuilder - extract words done: 121000
2017-05-30 15:15:00.828 [main] INFO dict.build.FastBuilder - extract words done: 122000
2017-05-30 15:15:00.836 [main] INFO dict.build.FastBuilder - extract words done: 123000
2017-05-30 15:15:00.842 [main] INFO dict.build.FastBuilder - extract words done: 124000
2017-05-30 15:15:00.848 [main] INFO dict.build.FastBuilder - extract words done: 125000
2017-05-30 15:15:00.854 [main] INFO dict.build.FastBuilder - extract words done: 126000
2017-05-30 15:15:00.860 [main] INFO dict.build.FastBuilder - extract words done: 127000
2017-05-30 15:15:00.866 [main] INFO dict.build.FastBuilder - extract words done: 128000
2017-05-30 15:15:00.872 [main] INFO dict.build.FastBuilder - extract words done: 129000
2017-05-30 15:15:00.878 [main] INFO dict.build.FastBuilder - extract words done: 130000
2017-05-30 15:15:00.884 [main] INFO dict.build.FastBuilder - extract words done: 131000
2017-05-30 15:15:00.889 [main] INFO dict.build.FastBuilder - extract words done: 132000
2017-05-30 15:15:00.895 [main] INFO dict.build.FastBuilder - extract words done: 133000
2017-05-30 15:15:00.900 [main] INFO dict.build.FastBuilder - extract words done: 134000
2017-05-30 15:15:00.905 [main] INFO dict.build.FastBuilder - extract words done: 135000
2017-05-30 15:15:00.912 [main] INFO dict.build.FastBuilder - extract words done: 136000
2017-05-30 15:15:00.919 [main] INFO dict.build.FastBuilder - extract words done: 137000
2017-05-30 15:15:00.925 [main] INFO dict.build.FastBuilder - extract words done: 138000
2017-05-30 15:15:00.931 [main] INFO dict.build.FastBuilder - extract words done: 139000
2017-05-30 15:15:00.936 [main] INFO dict.build.FastBuilder - extract words done: 140000
2017-05-30 15:15:00.942 [main] INFO dict.build.FastBuilder - extract words done: 141000
2017-05-30 15:15:00.949 [main] INFO dict.build.FastBuilder - extract words done: 142000
2017-05-30 15:15:00.954 [main] INFO dict.build.FastBuilder - extract words done: 143000
2017-05-30 15:15:00.960 [main] INFO dict.build.FastBuilder - extract words done: 144000
2017-05-30 15:15:00.966 [main] INFO dict.build.FastBuilder - extract words done: 145000
2017-05-30 15:15:00.971 [main] INFO dict.build.FastBuilder - extract words done: 146000
2017-05-30 15:15:00.977 [main] INFO dict.build.FastBuilder - extract words done: 147000
2017-05-30 15:15:00.982 [main] INFO dict.build.FastBuilder - extract words done: 148000
2017-05-30 15:15:00.988 [main] INFO dict.build.FastBuilder - extract words done: 149000
2017-05-30 15:15:00.993 [main] INFO dict.build.FastBuilder - extract words done: 150000
2017-05-30 15:15:00.998 [main] INFO dict.build.FastBuilder - extract words done: 151000
2017-05-30 15:15:01.004 [main] INFO dict.build.FastBuilder - extract words done: 152000
2017-05-30 15:15:01.010 [main] INFO dict.build.FastBuilder - extract words done: 153000
2017-05-30 15:15:01.016 [main] INFO dict.build.FastBuilder - extract words done: 154000
2017-05-30 15:15:01.022 [main] INFO dict.build.FastBuilder - extract words done: 155000
2017-05-30 15:15:01.028 [main] INFO dict.build.FastBuilder - extract words done: 156000
2017-05-30 15:15:01.033 [main] INFO dict.build.FastBuilder - extract words done: 157000
2017-05-30 15:15:01.040 [main] INFO dict.build.FastBuilder - extract words done: 158000
2017-05-30 15:15:01.046 [main] INFO dict.build.FastBuilder - extract words done: 159000
2017-05-30 15:15:01.051 [main] INFO dict.build.FastBuilder - extract words done: 160000
2017-05-30 15:15:01.057 [main] INFO dict.build.FastBuilder - extract words done: 161000
2017-05-30 15:15:01.063 [main] INFO dict.build.FastBuilder - extract words done: 162000
2017-05-30 15:15:01.070 [main] INFO dict.build.FastBuilder - extract words done: 163000
2017-05-30 15:15:01.076 [main] INFO dict.build.FastBuilder - extract words done: 164000
2017-05-30 15:15:01.082 [main] INFO dict.build.FastBuilder - extract words done: 165000
2017-05-30 15:15:01.089 [main] INFO dict.build.FastBuilder - extract words done: 166000
2017-05-30 15:15:01.096 [main] INFO dict.build.FastBuilder - extract words done: 167000
2017-05-30 15:15:01.101 [main] INFO dict.build.FastBuilder - extract words done: 168000
2017-05-30 15:15:01.106 [main] INFO dict.build.FastBuilder - extract words done: 169000
2017-05-30 15:15:01.112 [main] INFO dict.build.FastBuilder - extract words done: 170000
2017-05-30 15:15:01.117 [main] INFO dict.build.FastBuilder - extract words done: 171000
2017-05-30 15:15:01.123 [main] INFO dict.build.FastBuilder - extract words done: 172000
2017-05-30 15:15:01.129 [main] INFO dict.build.FastBuilder - extract words done: 173000
2017-05-30 15:15:01.135 [main] INFO dict.build.FastBuilder - extract words done: 174000
2017-05-30 15:15:01.138 [main] INFO dict.build.FastBuilder - extract words done: 175000
2017-05-30 15:15:01.143 [main] INFO dict.build.FastBuilder - extract words done: 176000
2017-05-30 15:15:01.147 [main] INFO dict.build.FastBuilder - extract words done: 177000
2017-05-30 15:15:01.151 [main] INFO dict.build.FastBuilder - extract words done: 178000
2017-05-30 15:15:01.155 [main] INFO dict.build.FastBuilder - extract words done: 179000
2017-05-30 15:15:01.159 [main] INFO dict.build.FastBuilder - extract words done: 180000
2017-05-30 15:15:01.163 [main] INFO dict.build.FastBuilder - extract words done: 181000
2017-05-30 15:15:01.166 [main] INFO dict.build.FastBuilder - extract words done: 182000
2017-05-30 15:15:01.170 [main] INFO dict.build.FastBuilder - extract words done: 183000
2017-05-30 15:15:01.174 [main] INFO dict.build.FastBuilder - extract words done: 184000
2017-05-30 15:15:01.177 [main] INFO dict.build.FastBuilder - extract words done: 185000
2017-05-30 15:15:01.181 [main] INFO dict.build.FastBuilder - extract words done: 186000
2017-05-30 15:15:01.184 [main] INFO dict.build.FastBuilder - extract words done: 187000
2017-05-30 15:15:01.189 [main] INFO dict.build.FastBuilder - extract words done: 188000
2017-05-30 15:15:01.193 [main] INFO dict.build.FastBuilder - extract words done: 189000
2017-05-30 15:15:01.196 [main] INFO dict.build.FastBuilder - extract words done: 190000
2017-05-30 15:15:01.200 [main] INFO dict.build.FastBuilder - extract words done: 191000
2017-05-30 15:15:01.203 [main] INFO dict.build.FastBuilder - extract words done: 192000
2017-05-30 15:15:01.207 [main] INFO dict.build.FastBuilder - extract words done: 193000
2017-05-30 15:15:01.210 [main] INFO dict.build.FastBuilder - extract words done: 194000
2017-05-30 15:15:01.214 [main] INFO dict.build.FastBuilder - extract words done: 195000
2017-05-30 15:15:01.218 [main] INFO dict.build.FastBuilder - extract words done: 196000
2017-05-30 15:15:01.222 [main] INFO dict.build.FastBuilder - extract words done: 197000
2017-05-30 15:15:01.226 [main] INFO dict.build.FastBuilder - extract words done: 198000
2017-05-30 15:15:01.229 [main] INFO dict.build.FastBuilder - extract words done: 199000
2017-05-30 15:15:01.233 [main] INFO dict.build.FastBuilder - extract words done: 200000
2017-05-30 15:15:01.236 [main] INFO dict.build.FastBuilder - extract words done: 201000
2017-05-30 15:15:01.240 [main] INFO dict.build.FastBuilder - extract words done: 202000
2017-05-30 15:15:01.243 [main] INFO dict.build.FastBuilder - extract words done: 203000
2017-05-30 15:15:01.247 [main] INFO dict.build.FastBuilder - extract words done: 204000
2017-05-30 15:15:01.252 [main] INFO dict.build.FastBuilder - extract words done: 205000
2017-05-30 15:15:01.256 [main] INFO dict.build.FastBuilder - extract words done: 206000
2017-05-30 15:15:01.259 [main] INFO dict.build.FastBuilder - extract words done: 207000
2017-05-30 15:15:01.263 [main] INFO dict.build.FastBuilder - extract words done: 208000
2017-05-30 15:15:01.267 [main] INFO dict.build.FastBuilder - extract words done: 209000
2017-05-30 15:15:01.271 [main] INFO dict.build.FastBuilder - extract words done: 210000
2017-05-30 15:15:01.274 [main] INFO dict.build.FastBuilder - extract words done: 211000
2017-05-30 15:15:01.279 [main] INFO dict.build.FastBuilder - extract words done: 212000
2017-05-30 15:15:01.284 [main] INFO dict.build.FastBuilder - extract words done: 213000
2017-05-30 15:15:01.288 [main] INFO dict.build.FastBuilder - extract words done: 214000
2017-05-30 15:15:01.291 [main] INFO dict.build.FastBuilder - extract words done: 215000
2017-05-30 15:15:01.295 [main] INFO dict.build.FastBuilder - extract words done: 216000
2017-05-30 15:15:01.299 [main] INFO dict.build.FastBuilder - extract words done: 217000
2017-05-30 15:15:01.303 [main] INFO dict.build.FastBuilder - extract words done: 218000
2017-05-30 15:15:01.306 [main] INFO dict.build.FastBuilder - extract words done: 219000
2017-05-30 15:15:01.311 [main] INFO dict.build.FastBuilder - extract words done: 220000
2017-05-30 15:15:01.316 [main] INFO dict.build.FastBuilder - extract words done: 221000
2017-05-30 15:15:01.320 [main] INFO dict.build.FastBuilder - extract words done: 222000
2017-05-30 15:15:01.324 [main] INFO dict.build.FastBuilder - extract words done: 223000
2017-05-30 15:15:01.328 [main] INFO dict.build.FastBuilder - extract words done: 224000
2017-05-30 15:15:01.331 [main] INFO dict.build.FastBuilder - extract words done: 225000
2017-05-30 15:15:01.335 [main] INFO dict.build.FastBuilder - extract words done: 226000
2017-05-30 15:15:01.341 [main] INFO dict.build.FastBuilder - extract words done: 227000
2017-05-30 15:15:01.346 [main] INFO dict.build.FastBuilder - extract words done: 228000
2017-05-30 15:15:01.350 [main] INFO dict.build.FastBuilder - extract words done: 229000
2017-05-30 15:15:01.355 [main] INFO dict.build.FastBuilder - extract words done: 230000
2017-05-30 15:15:01.359 [main] INFO dict.build.FastBuilder - extract words done: 231000
2017-05-30 15:15:01.363 [main] INFO dict.build.FastBuilder - extract words done: 232000
2017-05-30 15:15:01.366 [main] INFO dict.build.FastBuilder - extract words done: 233000
2017-05-30 15:15:01.371 [main] INFO dict.build.FastBuilder - extract words done: 234000
2017-05-30 15:15:01.375 [main] INFO dict.build.FastBuilder - extract words done: 235000
2017-05-30 15:15:01.378 [main] INFO dict.build.FastBuilder - extract words done: 236000
2017-05-30 15:15:01.381 [main] INFO dict.build.FastBuilder - extract words done: 237000
2017-05-30 15:15:01.385 [main] INFO dict.build.FastBuilder - extract words done: 238000
2017-05-30 15:15:01.389 [main] INFO dict.build.FastBuilder - extract words done: 239000
2017-05-30 15:15:01.393 [main] INFO dict.build.FastBuilder - extract words done: 240000
2017-05-30 15:15:01.397 [main] INFO dict.build.FastBuilder - extract words done: 241000
2017-05-30 15:15:01.401 [main] INFO dict.build.FastBuilder - extract words done: 242000
2017-05-30 15:15:01.406 [main] INFO dict.build.FastBuilder - extract words done: 243000
2017-05-30 15:15:01.409 [main] INFO dict.build.FastBuilder - extract words done: 244000
2017-05-30 15:15:01.413 [main] INFO dict.build.FastBuilder - extract words done: 245000
2017-05-30 15:15:01.416 [main] INFO dict.build.FastBuilder - extract words done: 246000
2017-05-30 15:15:01.419 [main] INFO dict.build.FastBuilder - extract words done: 247000
2017-05-30 15:15:01.422 [main] INFO dict.build.FastBuilder - extract words done: 248000
2017-05-30 15:15:01.425 [main] INFO dict.build.FastBuilder - extract words done: 249000
2017-05-30 15:15:01.429 [main] INFO dict.build.FastBuilder - extract words done: 250000
2017-05-30 15:15:01.434 [main] INFO dict.build.FastBuilder - extract words done: 251000
2017-05-30 15:15:01.437 [main] INFO dict.build.FastBuilder - extract words done: 252000
2017-05-30 15:15:01.441 [main] INFO dict.build.FastBuilder - extract words done: 253000
2017-05-30 15:15:01.444 [main] INFO dict.build.FastBuilder - extract words done: 254000
2017-05-30 15:15:01.447 [main] INFO dict.build.FastBuilder - extract words done: 255000
2017-05-30 15:15:01.451 [main] INFO dict.build.FastBuilder - extract words done: 256000
2017-05-30 15:15:01.454 [main] INFO dict.build.FastBuilder - extract words done: 257000
2017-05-30 15:15:01.458 [main] INFO dict.build.FastBuilder - extract words done: 258000
2017-05-30 15:15:01.463 [main] INFO dict.build.FastBuilder - extract words done: 259000
2017-05-30 15:15:01.467 [main] INFO dict.build.FastBuilder - extract words done: 260000
2017-05-30 15:15:01.471 [main] INFO dict.build.FastBuilder - extract words done: 261000
2017-05-30 15:15:01.474 [main] INFO dict.build.FastBuilder - extract words done: 262000
2017-05-30 15:15:01.477 [main] INFO dict.build.FastBuilder - extract words done: 263000
2017-05-30 15:15:01.481 [main] INFO dict.build.FastBuilder - extract words done: 264000
2017-05-30 15:15:01.486 [main] INFO dict.build.FastBuilder - extract words done: 265000
2017-05-30 15:15:01.490 [main] INFO dict.build.FastBuilder - extract words done: 266000
2017-05-30 15:15:01.496 [main] INFO dict.build.FastBuilder - extract words done: 267000
2017-05-30 15:15:01.500 [main] INFO dict.build.FastBuilder - extract words done: 268000
2017-05-30 15:15:01.503 [main] INFO dict.build.FastBuilder - extract words done: 269000
2017-05-30 15:15:01.507 [main] INFO dict.build.FastBuilder - extract words done: 270000
2017-05-30 15:15:01.510 [main] INFO dict.build.FastBuilder - extract words done: 271000
2017-05-30 15:15:01.514 [main] INFO dict.build.FastBuilder - extract words done: 272000
2017-05-30 15:15:01.517 [main] INFO dict.build.FastBuilder - extract words done: 273000
2017-05-30 15:15:01.521 [main] INFO dict.build.FastBuilder - extract words done: 274000
2017-05-30 15:15:01.524 [main] INFO dict.build.FastBuilder - extract words done: 275000
2017-05-30 15:15:01.528 [main] INFO dict.build.FastBuilder - extract words done: 276000
2017-05-30 15:15:01.532 [main] INFO dict.build.FastBuilder - extract words done: 277000
2017-05-30 15:15:01.535 [main] INFO dict.build.FastBuilder - extract words done: 278000
2017-05-30 15:15:01.538 [main] INFO dict.build.FastBuilder - extract words done: 279000
2017-05-30 15:15:01.542 [main] INFO dict.build.FastBuilder - extract words done: 280000
2017-05-30 15:15:01.545 [main] INFO dict.build.FastBuilder - extract words done: 281000
2017-05-30 15:15:01.549 [main] INFO dict.build.FastBuilder - extract words done: 282000
2017-05-30 15:15:01.552 [main] INFO dict.build.FastBuilder - extract words done: 283000
2017-05-30 15:15:01.556 [main] INFO dict.build.FastBuilder - extract words done: 284000
2017-05-30 15:15:01.559 [main] INFO dict.build.FastBuilder - extract words done: 285000
2017-05-30 15:15:01.563 [main] INFO dict.build.FastBuilder - extract words done: 286000
2017-05-30 15:15:01.566 [main] INFO dict.build.FastBuilder - extract words done: 287000
2017-05-30 15:15:01.569 [main] INFO dict.build.FastBuilder - extract words done: 288000
2017-05-30 15:15:01.572 [main] INFO dict.build.FastBuilder - extract words done: 289000
2017-05-30 15:15:01.576 [main] INFO dict.build.FastBuilder - extract words done: 290000
2017-05-30 15:15:01.579 [main] INFO dict.build.FastBuilder - extract words done: 291000
2017-05-30 15:15:01.583 [main] INFO dict.build.FastBuilder - extract words done: 292000
2017-05-30 15:15:01.586 [main] INFO dict.build.FastBuilder - extract words done: 293000
2017-05-30 15:15:01.590 [main] INFO dict.build.FastBuilder - extract words done: 294000
2017-05-30 15:15:01.594 [main] INFO dict.build.FastBuilder - extract words done: 295000
2017-05-30 15:15:01.597 [main] INFO dict.build.FastBuilder - extract words done: 296000
2017-05-30 15:15:01.600 [main] INFO dict.build.FastBuilder - extract words done: 297000
2017-05-30 15:15:01.604 [main] INFO dict.build.FastBuilder - extract words done: 298000
2017-05-30 15:15:01.607 [main] INFO dict.build.FastBuilder - extract words done: 299000
2017-05-30 15:15:01.610 [main] INFO dict.build.FastBuilder - extract words done: 300000
2017-05-30 15:15:01.614 [main] INFO dict.build.FastBuilder - extract words done: 301000
2017-05-30 15:15:01.618 [main] INFO dict.build.FastBuilder - extract words done: 302000
2017-05-30 15:15:01.621 [main] INFO dict.build.FastBuilder - extract words done: 303000
2017-05-30 15:15:01.624 [main] INFO dict.build.FastBuilder - extract words done: 304000
2017-05-30 15:15:01.627 [main] INFO dict.build.FastBuilder - extract words done: 305000
2017-05-30 15:15:01.630 [main] INFO dict.build.FastBuilder - extract words done: 306000
2017-05-30 15:15:01.634 [main] INFO dict.build.FastBuilder - extract words done: 307000
2017-05-30 15:15:01.637 [main] INFO dict.build.FastBuilder - extract words done: 308000
2017-05-30 15:15:01.641 [main] INFO dict.build.FastBuilder - extract words done: 309000
2017-05-30 15:15:01.645 [main] INFO dict.build.FastBuilder - extract words done: 310000
2017-05-30 15:15:01.650 [main] INFO dict.build.FastBuilder - extract words done: 311000
2017-05-30 15:15:01.655 [main] INFO dict.build.FastBuilder - extract words done: 312000
2017-05-30 15:15:01.659 [main] INFO dict.build.FastBuilder - extract words done: 313000
2017-05-30 15:15:01.664 [main] INFO dict.build.FastBuilder - extract words done: 314000
2017-05-30 15:15:01.668 [main] INFO dict.build.FastBuilder - extract words done: 315000
2017-05-30 15:15:01.673 [main] INFO dict.build.FastBuilder - extract words done: 316000
2017-05-30 15:15:01.678 [main] INFO dict.build.FastBuilder - extract words done: 317000
2017-05-30 15:15:01.682 [main] INFO dict.build.FastBuilder - extract words done: 318000
2017-05-30 15:15:01.687 [main] INFO dict.build.FastBuilder - extract words done: 319000
2017-05-30 15:15:01.692 [main] INFO dict.build.FastBuilder - extract words done: 320000
2017-05-30 15:15:01.697 [main] INFO dict.build.FastBuilder - extract words done: 321000
2017-05-30 15:15:01.701 [main] INFO dict.build.FastBuilder - extract words done: 322000
2017-05-30 15:15:01.705 [main] INFO dict.build.FastBuilder - extract words done: 323000
2017-05-30 15:15:01.711 [main] INFO dict.build.FastBuilder - extract words done: 324000
2017-05-30 15:15:01.716 [main] INFO dict.build.FastBuilder - extract words done: 325000
2017-05-30 15:15:01.721 [main] INFO dict.build.FastBuilder - extract words done: 326000
2017-05-30 15:15:01.727 [main] INFO dict.build.FastBuilder - extract words done: 327000
2017-05-30 15:15:01.732 [main] INFO dict.build.FastBuilder - extract words done: 328000
2017-05-30 15:15:01.739 [main] INFO dict.build.FastBuilder - extract words done: 329000
2017-05-30 15:15:01.744 [main] INFO dict.build.FastBuilder - extract words done: 330000
2017-05-30 15:15:01.749 [main] INFO dict.build.FastBuilder - extract words done: 331000
2017-05-30 15:15:01.753 [main] INFO dict.build.FastBuilder - extract words done: 332000
2017-05-30 15:15:01.758 [main] INFO dict.build.FastBuilder - extract words done: 333000
2017-05-30 15:15:01.763 [main] INFO dict.build.FastBuilder - extract words done: 334000
2017-05-30 15:15:01.769 [main] INFO dict.build.FastBuilder - extract words done: 335000
2017-05-30 15:15:01.773 [main] INFO dict.build.FastBuilder - extract words done: 336000
2017-05-30 15:15:01.778 [main] INFO dict.build.FastBuilder - extract words done: 337000
2017-05-30 15:15:01.782 [main] INFO dict.build.FastBuilder - extract words done: 338000
2017-05-30 15:15:01.786 [main] INFO dict.build.FastBuilder - extract words done: 339000
2017-05-30 15:15:01.791 [main] INFO dict.build.FastBuilder - extract words done: 340000
2017-05-30 15:15:01.795 [main] INFO dict.build.FastBuilder - extract words done: 341000
2017-05-30 15:15:01.800 [main] INFO dict.build.FastBuilder - extract words done: 342000
2017-05-30 15:15:01.805 [main] INFO dict.build.FastBuilder - extract words done: 343000
2017-05-30 15:15:01.810 [main] INFO dict.build.FastBuilder - extract words done: 344000
2017-05-30 15:15:01.815 [main] INFO dict.build.FastBuilder - extract words done: 345000
2017-05-30 15:15:01.819 [main] INFO dict.build.FastBuilder - extract words done: 346000
2017-05-30 15:15:01.824 [main] INFO dict.build.FastBuilder - extract words done: 347000
2017-05-30 15:15:01.828 [main] INFO dict.build.FastBuilder - extract words done: 348000
2017-05-30 15:15:01.833 [main] INFO dict.build.FastBuilder - extract words done: 349000
2017-05-30 15:15:01.838 [main] INFO dict.build.FastBuilder - extract words done: 350000
2017-05-30 15:15:01.843 [main] INFO dict.build.FastBuilder - extract words done: 351000
2017-05-30 15:15:01.847 [main] INFO dict.build.FastBuilder - extract words done: 352000
2017-05-30 15:15:01.852 [main] INFO dict.build.FastBuilder - extract words done: 353000
2017-05-30 15:15:01.856 [main] INFO dict.build.FastBuilder - extract words done: 354000
2017-05-30 15:15:01.861 [main] INFO dict.build.FastBuilder - extract words done: 355000
2017-05-30 15:15:01.866 [main] INFO dict.build.FastBuilder - extract words done: 356000
2017-05-30 15:15:01.871 [main] INFO dict.build.FastBuilder - extract words done: 357000
2017-05-30 15:15:01.876 [main] INFO dict.build.FastBuilder - extract words done: 358000
2017-05-30 15:15:01.880 [main] INFO dict.build.FastBuilder - extract words done: 359000
2017-05-30 15:15:01.884 [main] INFO dict.build.FastBuilder - extract words done: 360000
2017-05-30 15:15:01.890 [main] INFO dict.build.FastBuilder - extract words done: 361000
2017-05-30 15:15:01.896 [main] INFO dict.build.FastBuilder - extract words done: 362000
2017-05-30 15:15:01.900 [main] INFO dict.build.FastBuilder - extract words done: 363000
2017-05-30 15:15:01.904 [main] INFO dict.build.FastBuilder - extract words done: 364000
2017-05-30 15:15:01.909 [main] INFO dict.build.FastBuilder - extract words done: 365000
2017-05-30 15:15:01.914 [main] INFO dict.build.FastBuilder - extract words done: 366000
2017-05-30 15:15:01.918 [main] INFO dict.build.FastBuilder - extract words done: 367000
2017-05-30 15:15:01.923 [main] INFO dict.build.FastBuilder - extract words done: 368000
2017-05-30 15:15:01.928 [main] INFO dict.build.FastBuilder - extract words done: 369000
2017-05-30 15:15:01.932 [main] INFO dict.build.FastBuilder - extract words done: 370000
2017-05-30 15:15:01.936 [main] INFO dict.build.FastBuilder - extract words done: 371000
2017-05-30 15:15:01.940 [main] INFO dict.build.FastBuilder - extract words done: 372000
2017-05-30 15:15:01.945 [main] INFO dict.build.FastBuilder - extract words done: 373000
2017-05-30 15:15:01.949 [main] INFO dict.build.FastBuilder - extract words done: 374000
2017-05-30 15:15:01.953 [main] INFO dict.build.FastBuilder - extract words done: 375000
2017-05-30 15:15:01.958 [main] INFO dict.build.FastBuilder - extract words done: 376000
2017-05-30 15:15:01.963 [main] INFO dict.build.FastBuilder - extract words done: 377000
2017-05-30 15:15:01.967 [main] INFO dict.build.FastBuilder - extract words done: 378000
2017-05-30 15:15:01.971 [main] INFO dict.build.FastBuilder - extract words done: 379000
2017-05-30 15:15:01.976 [main] INFO dict.build.FastBuilder - extract words done: 380000
2017-05-30 15:15:01.982 [main] INFO dict.build.FastBuilder - extract words done: 381000
2017-05-30 15:15:01.987 [main] INFO dict.build.FastBuilder - extract words done: 382000
2017-05-30 15:15:01.991 [main] INFO dict.build.FastBuilder - extract words done: 383000
2017-05-30 15:15:01.996 [main] INFO dict.build.FastBuilder - extract words done: 384000
2017-05-30 15:15:02.001 [main] INFO dict.build.FastBuilder - extract words done: 385000
2017-05-30 15:15:02.006 [main] INFO dict.build.FastBuilder - extract words done: 386000
2017-05-30 15:15:02.010 [main] INFO dict.build.FastBuilder - extract words done: 387000
2017-05-30 15:15:02.016 [main] INFO dict.build.FastBuilder - extract words done: 388000
2017-05-30 15:15:02.020 [main] INFO dict.build.FastBuilder - extract words done: 389000
2017-05-30 15:15:02.024 [main] INFO dict.build.FastBuilder - extract words done: 390000
2017-05-30 15:15:02.029 [main] INFO dict.build.FastBuilder - extract words done: 391000
2017-05-30 15:15:02.034 [main] INFO dict.build.FastBuilder - extract words done: 392000
2017-05-30 15:15:02.038 [main] INFO dict.build.FastBuilder - extract words done: 393000
2017-05-30 15:15:02.043 [main] INFO dict.build.FastBuilder - extract words done: 394000
2017-05-30 15:15:02.048 [main] INFO dict.build.FastBuilder - extract words done: 395000
2017-05-30 15:15:02.053 [main] INFO dict.build.FastBuilder - extract words done: 396000
2017-05-30 15:15:02.057 [main] INFO dict.build.FastBuilder - extract words done: 397000
2017-05-30 15:15:02.061 [main] INFO dict.build.FastBuilder - extract words done: 398000
2017-05-30 15:15:02.066 [main] INFO dict.build.FastBuilder - extract words done: 399000
2017-05-30 15:15:02.071 [main] INFO dict.build.FastBuilder - extract words done: 400000
2017-05-30 15:15:02.076 [main] INFO dict.build.FastBuilder - extract words done: 401000
2017-05-30 15:15:02.080 [main] INFO dict.build.FastBuilder - extract words done: 402000
2017-05-30 15:15:02.084 [main] INFO dict.build.FastBuilder - extract words done: 403000
2017-05-30 15:15:02.089 [main] INFO dict.build.FastBuilder - extract words done: 404000
2017-05-30 15:15:02.094 [main] INFO dict.build.FastBuilder - extract words done: 405000
2017-05-30 15:15:02.099 [main] INFO dict.build.FastBuilder - extract words done: 406000
2017-05-30 15:15:02.104 [main] INFO dict.build.FastBuilder - extract words done: 407000
2017-05-30 15:15:02.108 [main] INFO dict.build.FastBuilder - extract words done: 408000
2017-05-30 15:15:02.113 [main] INFO dict.build.FastBuilder - extract words done: 409000
2017-05-30 15:15:02.118 [main] INFO dict.build.FastBuilder - extract words done: 410000
2017-05-30 15:15:02.122 [main] INFO dict.build.FastBuilder - extract words done: 411000
2017-05-30 15:15:02.127 [main] INFO dict.build.FastBuilder - extract words done: 412000
2017-05-30 15:15:02.133 [main] INFO dict.build.FastBuilder - extract words done: 413000
2017-05-30 15:15:02.138 [main] INFO dict.build.FastBuilder - extract words done: 414000
2017-05-30 15:15:02.144 [main] INFO dict.build.FastBuilder - extract words done: 415000
2017-05-30 15:15:02.148 [main] INFO dict.build.FastBuilder - extract words done: 416000
2017-05-30 15:15:02.152 [main] INFO dict.build.FastBuilder - extract words done: 417000
2017-05-30 15:15:02.157 [main] INFO dict.build.FastBuilder - extract words done: 418000
2017-05-30 15:15:02.162 [main] INFO dict.build.FastBuilder - extract words done: 419000
2017-05-30 15:15:02.168 [main] INFO dict.build.FastBuilder - extract words done: 420000
2017-05-30 15:15:02.172 [main] INFO dict.build.FastBuilder - extract words done: 421000
2017-05-30 15:15:02.175 [main] INFO dict.build.FastBuilder - extract words done: 422000
2017-05-30 15:15:02.178 [main] INFO dict.build.FastBuilder - extract words done: 423000
2017-05-30 15:15:02.181 [main] INFO dict.build.FastBuilder - extract words done: 424000
2017-05-30 15:15:02.185 [main] INFO dict.build.FastBuilder - extract words done: 425000
2017-05-30 15:15:02.188 [main] INFO dict.build.FastBuilder - extract words done: 426000
2017-05-30 15:15:02.191 [main] INFO dict.build.FastBuilder - extract words done: 427000
2017-05-30 15:15:02.195 [main] INFO dict.build.FastBuilder - extract words done: 428000
2017-05-30 15:15:02.198 [main] INFO dict.build.FastBuilder - extract words done: 429000
2017-05-30 15:15:02.202 [main] INFO dict.build.FastBuilder - extract words done: 430000
2017-05-30 15:15:02.205 [main] INFO dict.build.FastBuilder - extract words done: 431000
2017-05-30 15:15:02.208 [main] INFO dict.build.FastBuilder - extract words done: 432000
2017-05-30 15:15:02.212 [main] INFO dict.build.FastBuilder - extract words done: 433000
2017-05-30 15:15:02.215 [main] INFO dict.build.FastBuilder - extract words done: 434000
2017-05-30 15:15:02.219 [main] INFO dict.build.FastBuilder - extract words done: 435000
2017-05-30 15:15:02.222 [main] INFO dict.build.FastBuilder - extract words done: 436000
2017-05-30 15:15:02.225 [main] INFO dict.build.FastBuilder - extract words done: 437000
2017-05-30 15:15:02.229 [main] INFO dict.build.FastBuilder - extract words done: 438000
2017-05-30 15:15:02.232 [main] INFO dict.build.FastBuilder - extract words done: 439000
2017-05-30 15:15:02.236 [main] INFO dict.build.FastBuilder - extract words done: 440000
2017-05-30 15:15:02.239 [main] INFO dict.build.FastBuilder - extract words done: 441000
2017-05-30 15:15:02.242 [main] INFO dict.build.FastBuilder - extract words done: 442000
2017-05-30 15:15:02.245 [main] INFO dict.build.FastBuilder - extract words done: 443000
2017-05-30 15:15:02.248 [main] INFO dict.build.FastBuilder - start to sort extracted words
2017-05-30 15:15:02.277 [main] INFO dict.build.FastBuilder - all done

抽取结果与示例不太一致

使用<金瓶梅>进行测试发现, 实际结果中"西门庆"不是词频最高的词, 而且没有出现"西门庆",
而是出现了诸如"见西门庆","向西门庆","西门庆进"等词.
作者有时间的话能否给指导一下?

words_sort.data 无结果

我通过命令行运行,words_sort.data 没有信息。见图
info

但是我把代码 git 下来,在编译器里面运行,words_sort.data 却是有结果的。数据文件都是同一份

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.