layer_type | N | M | Q | alpha | D | alpha-hat | log_SN | % Rand | num_traps | num_fingers | rank_loss | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
layer_id | ||||||||||||
2 | EMBEDDING | 30000 | 128 | 234.38 | 4.91 | 0.05 | 15.61 | 3.18 | 79.32 | 1 | 0 | 0 |
3 | EMBEDDING | 512 | 128 | 4.00 | 1.35 | 0.09 | 1.86 | 1.38 | 41.55 | 0 | 0 | 0 |
8 | DENSE | 4096 | 128 | 32.00 | 10.11 | 0.08 | 14.63 | 1.45 | 95.48 | 0 | 0 | 0 |
15 | DENSE | 4096 | 4096 | 1.00 | 3.09 | 0.01 | 8.09 | 2.61 | 83.29 | 0 | 0 | 2 |
16 | DENSE | 4096 | 4096 | 1.00 | 3.14 | 0.01 | 8.21 | 2.61 | 83.62 | 0 | 0 | 2 |
17 | DENSE | 4096 | 4096 | 1.00 | 3.09 | 0.07 | 5.76 | 1.87 | 88.84 | 0 | 1 | 2 |
20 | DENSE | 4096 | 4096 | 1.00 | 3.28 | 0.02 | 7.88 | 2.40 | 85.85 | 0 | 0 | 2 |
22 | DENSE | 16384 | 4096 | 4.00 | 3.52 | 0.01 | 12.10 | 3.44 | 88.87 | 0 | 0 | 0 |
23 | DENSE | 16384 | 4096 | 4.00 | 3.62 | 0.02 | 12.71 | 3.51 | 88.44 | 1 | 0 | 0 |
26 | DENSE | 4096 | 4096 | 1.00 | 1.75 | 0.02 | 6.77 | 3.87 | 33.72 | 0 | 0 | 7 |