- As parameters listed in the
utils/data.py , the label_dim equal to HP_hidden_dim, which specified to 200.
- The output size of the LAN layer in the last layer is the size of
[HP_hidden_dim, label_dim], that is 2*HP_hidden_dim.
- After this, if we make
use_crf = False , Do the concated size of the last layer eq to label_alpha size?
Or make use_crf = True, how to make the output teasor with size (batch, seq_len, HP_hidden_dim) to the emission probability ?
Could Anyone give me some advise ? Thx !