layer_type | N | M | Q | alpha | D | alpha-hat | log_SN | % Rand | num_traps | num_fingers | rank_loss | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
layer_id | ||||||||||||
2 | EMBEDDING | 30000 | 128 | 234.38 | 4.18 | 0.05 | 13.91 | 3.33 | 75.20 | 1 | 0 | 0 |
3 | EMBEDDING | 512 | 128 | 4.00 | 1.26 | 0.12 | 1.60 | 1.27 | 34.09 | 0 | 0 | 0 |
8 | DENSE | 1024 | 128 | 8.00 | 6.87 | 0.07 | 2.84 | 0.41 | 95.01 | 0 | 0 | 0 |
15 | DENSE | 1024 | 1024 | 1.00 | 3.50 | 0.05 | 5.27 | 1.50 | 84.34 | 0 | 0 | 1 |
16 | DENSE | 1024 | 1024 | 1.00 | 3.33 | 0.05 | 5.38 | 1.62 | 82.38 | 0 | 0 | 1 |
17 | DENSE | 1024 | 1024 | 1.00 | 3.72 | 0.07 | 4.43 | 1.19 | 89.75 | 0 | 0 | 1 |
20 | DENSE | 1024 | 1024 | 1.00 | 3.48 | 0.04 | 4.87 | 1.40 | 87.97 | 0 | 0 | 1 |
22 | DENSE | 4096 | 1024 | 4.00 | 3.71 | 0.03 | 7.83 | 2.11 | 84.57 | 0 | 0 | 0 |
23 | DENSE | 4096 | 1024 | 4.00 | 4.21 | 0.03 | 9.17 | 2.18 | 85.75 | 0 | 0 | 0 |
26 | DENSE | 1024 | 1024 | 1.00 | 2.13 | 0.04 | 5.11 | 2.40 | 47.10 | 0 | 0 | 4 |