How important is the grouped_topk?
#6
by
dzhulgakov
- opened
Hi,
I ran the model with just regular top_k softmax and got pretty sensible results. How important is it to include the hierchical grouped top_k?
dzhulgakov
changed discussion status to
closed