Chinese Word Segmentation based on Bidirectional GRU-CRF Model
Che Jinli,Tang Liwei,Deng Shijie,Su Xujun
Table 1 Values of the hyper-parameters
Hyper-parameters Values
Character embedding dimension d=200
Dimension of hidden layer h =128
Dropout rate p =0.2
Initial learning rate lr=0.002