Llama-3.1-70B-Instruct


Find this model in the Llama3.1 model summary


Llama-3.1-70B-Instruct Model Set Plots


Llama3.1 Compared to Base Model Plots



Llama-3.1-70B-Instruct Model Selected Details
id layer_type N M Q alpha D alpha-hat num_spikes warning
1 dense 28672 8192 3.5 2.847089 0.014772 0.258336 207
2 dense 28672 8192 3.5 2.395193 0.016911 3.523310 774
3 dense 28672 8192 3.5 2.398095 0.017280 3.533453 692
4 dense 8192 1024 8.0 1.443948 0.014846 0.946591 444 over-trained
5 dense 8192 8192 1.0 2.728213 0.022633 -1.006799 71
6 dense 8192 8192 1.0 1.542435 0.017179 1.684866 109 over-trained
7 dense 8192 1024 8.0 1.644491 0.019264 1.716480 614 over-trained
8 dense 28672 8192 3.5 2.664352 0.036356 -0.954124 686
9 dense 28672 8192 3.5 4.088873 0.012324 -3.589805 183
10 dense 8192 1024 8.0 2.465904 0.033707 -1.723975 275
11 dense 8192 1024 8.0 1.826099 0.024280 -2.093052 131 over-trained
12 dense 8192 8192 1.0 3.008950 0.015346 -2.412067 106
13 dense 28672 8192 3.5 2.637243 0.031983 -0.860541 782
14 dense 8192 8192 1.0 1.725814 0.025926 -1.081069 512 over-trained
15 dense 8192 8192 1.0 1.611045 0.030209 -1.491812 767 over-trained
16 dense 8192 1024 8.0 3.537094 0.028147 -5.772153 63
17 dense 8192 8192 1.0 2.310075 0.063768 -2.645216 1270
18 dense 28672 8192 3.5 3.611190 0.031648 -1.276009 111
19 dense 28672 8192 3.5 2.724818 0.043687 -1.061400 926
20 dense 28672 8192 3.5 4.182672 0.012894 -3.694862 275
21 dense 8192 1024 8.0 1.551158 0.023962 -1.663335 554 over-trained
22 dense 28672 8192 3.5 4.053825 0.012109 -3.414007 290
23 dense 28672 8192 3.5 2.952383 0.028681 -0.868402 657
24 dense 28672 8192 3.5 2.984758 0.030782 -0.944223 576
25 dense 8192 1024 8.0 1.674320 0.047764 -1.570661 339 over-trained
26 dense 8192 8192 1.0 3.269727 0.018157 -3.286497 109
27 dense 8192 8192 1.0 1.789880 0.030267 -0.884196 631 over-trained
28 dense 8192 1024 8.0 3.118933 0.017163 -3.058375 155
29 dense 28672 8192 3.5 3.156817 0.021633 -0.990004 547
30 dense 28672 8192 3.5 3.163455 0.022653 -1.047056 542
31 dense 8192 1024 8.0 3.295536 0.041545 -5.727541 83
32 dense 28672 8192 3.5 4.064000 0.013271 -3.370659 275
33 dense 8192 8192 1.0 3.140139 0.038546 -3.280115 90
34 dense 8192 1024 8.0 1.954624 0.034578 -2.149639 163 over-trained
35 dense 8192 8192 1.0 1.901065 0.025449 -1.202491 658 over-trained
36 dense 28672 8192 3.5 3.803488 0.010671 -2.890434 269
37 dense 28672 8192 3.5 2.955049 0.014705 1.295188 480
38 dense 28672 8192 3.5 2.960103 0.014448 1.266839 457
39 dense 8192 1024 8.0 1.919164 0.033987 -2.450268 213 over-trained
40 dense 8192 8192 1.0 3.067724 0.026059 -3.162264 132
41 dense 8192 8192 1.0 1.941019 0.024078 -1.308911 735 over-trained
42 dense 8192 1024 8.0 3.178266 0.031208 -5.544776 99
43 dense 8192 1024 8.0 3.219708 0.044553 -5.794322 78
44 dense 8192 8192 1.0 3.218915 0.033085 -3.489942 86
45 dense 8192 1024 8.0 2.197873 0.031405 -2.925638 174
46 dense 8192 8192 1.0 2.070221 0.023912 -1.326872 593
47 dense 28672 8192 3.5 3.265577 0.012501 -0.811278 142
48 dense 28672 8192 3.5 3.720944 0.012357 -2.435661 231
49 dense 28672 8192 3.5 3.290462 0.011246 -0.911684 130
50 dense 8192 8192 1.0 2.773582 0.043150 -2.503404 159
51 dense 28672 8192 3.5 3.130394 0.011145 -0.631547 136
52 dense 28672 8192 3.5 3.646511 0.009723 -2.366342 234
53 dense 28672 8192 3.5 3.149994 0.012736 -0.743668 143
54 dense 8192 8192 1.0 2.059728 0.021287 -1.238136 541
55 dense 8192 1024 8.0 2.239289 0.023914 -2.927209 138
56 dense 8192 1024 8.0 2.767065 0.052168 -4.747082 159
57 dense 8192 8192 1.0 1.970844 0.024290 -0.707356 657 over-trained
58 dense 28672 8192 3.5 3.687125 0.015751 -2.485140 216
59 dense 28672 8192 3.5 2.955877 0.009781 -0.513578 199
60 dense 28672 8192 3.5 2.999615 0.011046 -0.597765 182
61 dense 8192 1024 8.0 1.902343 0.028094 -1.938462 241 over-trained
62 dense 8192 8192 1.0 3.226518 0.038388 -2.924755 62
63 dense 8192 1024 8.0 3.063262 0.039687 -5.116494 97
64 dense 8192 8192 1.0 1.949546 0.024748 -0.910612 577 over-trained
65 dense 8192 1024 8.0 3.386256 0.043149 -6.017173 61
66 dense 8192 1024 8.0 1.843893 0.045023 -2.038805 259 over-trained
67 dense 8192 8192 1.0 3.522540 0.038606 -3.559896 40
68 dense 28672 8192 3.5 2.927559 0.012045 1.039297 245
69 dense 28672 8192 3.5 2.915103 0.011211 1.103817 236
70 dense 28672 8192 3.5 3.646050 0.010998 -2.111575 168
71 dense 8192 8192 1.0 2.203656 0.024217 -1.238878 437
72 dense 8192 8192 1.0 3.270643 0.039800 -3.470845 96
73 dense 8192 1024 8.0 2.293426 0.026434 -2.924108 180
74 dense 8192 1024 8.0 3.314864 0.054611 -5.942568 88
75 dense 28672 8192 3.5 3.201046 0.015218 -0.747981 91
76 dense 28672 8192 3.5 3.848217 0.010198 -2.656146 175
77 dense 28672 8192 3.5 3.181913 0.015721 -0.848578 152
78 dense 28672 8192 3.5 3.926211 0.018351 -2.633785 127
79 dense 28672 8192 3.5 3.080913 0.017222 -0.803860 222
80 dense 28672 8192 3.5 3.113347 0.015459 -0.927862 208
81 dense 8192 1024 8.0 1.989985 0.029666 -2.458876 306 over-trained
82 dense 8192 8192 1.0 3.234112 0.061351 -3.938728 269
83 dense 8192 8192 1.0 2.094751 0.025742 -1.181424 585
84 dense 8192 1024 8.0 3.443246 0.036619 -6.247240 74
85 dense 8192 1024 8.0 3.545720 0.040610 -5.878966 34
86 dense 28672 8192 3.5 3.609412 0.020382 -2.330764 153
87 dense 8192 8192 1.0 2.048716 0.018400 -0.851036 599
88 dense 28672 8192 3.5 2.961243 0.013667 -0.489164 161
89 dense 8192 1024 8.0 2.078820 0.018543 -2.246921 197
90 dense 28672 8192 3.5 2.941296 0.015138 -0.582331 255
91 dense 8192 8192 1.0 2.270671 0.062973 -2.061734 768
92 dense 28672 8192 3.5 3.571202 0.014274 -2.098549 104
93 dense 28672 8192 3.5 2.995271 0.023523 -0.485839 119
94 dense 28672 8192 3.5 3.035329 0.026371 -0.581818 115
95 dense 8192 1024 8.0 2.237124 0.022655 -2.576702 176
96 dense 8192 8192 1.0 2.139771 0.068193 -2.443838 1108
97 dense 8192 8192 1.0 2.191813 0.024917 -0.991607 387
98 dense 8192 1024 8.0 3.515690 0.052480 -6.436075 73
99 dense 8192 8192 1.0 3.550825 0.040146 -3.867964 70
100 dense 8192 1024 8.0 3.387304 0.054583 -6.372783 90
101 dense 8192 8192 1.0 2.169234 0.036647 -1.080842 532
102 dense 8192 1024 8.0 2.413607 0.030500 -2.791156 105
103 dense 28672 8192 3.5 3.085754 0.020337 -0.582596 102
104 dense 28672 8192 3.5 3.418734 0.018135 -2.167624 210
105 dense 28672 8192 3.5 3.239353 0.026496 -0.756718 53
106 dense 28672 8192 3.5 3.000709 0.024252 -0.167268 76
107 dense 8192 1024 8.0 2.268538 0.031978 -2.705623 84
108 dense 8192 8192 1.0 3.092914 0.033732 -2.619211 77
109 dense 8192 8192 1.0 2.142500 0.024317 -0.847563 409
110 dense 8192 1024 8.0 2.868838 0.044836 -4.844330 140
111 dense 28672 8192 3.5 3.328596 0.020579 -2.370277 255
112 dense 28672 8192 3.5 2.992892 0.024024 -0.308818 112
113 dense 8192 8192 1.0 2.098963 0.032648 -0.947570 548
114 dense 8192 1024 8.0 3.152004 0.058463 -5.403837 93
115 dense 8192 8192 1.0 3.395401 0.044749 -3.118236 59
116 dense 8192 1024 8.0 2.068496 0.040816 -2.626513 280
117 dense 28672 8192 3.5 3.044428 0.017312 -0.277112 85
118 dense 28672 8192 3.5 2.972994 0.020699 -0.135159 79
119 dense 28672 8192 3.5 3.528193 0.027783 -2.609856 159
120 dense 28672 8192 3.5 3.546921 0.023157 -2.689988 228
121 dense 28672 8192 3.5 2.926362 0.021374 -0.296597 146
122 dense 28672 8192 3.5 3.093863 0.013373 -0.471078 98
123 dense 8192 1024 8.0 2.394445 0.036393 -3.205927 108
124 dense 8192 8192 1.0 3.780573 0.033927 -3.973233 64
125 dense 8192 8192 1.0 2.140692 0.039420 -1.127839 662
126 dense 8192 1024 8.0 4.086722 0.042352 -7.167544 40
127 dense 28672 8192 3.5 3.808953 0.029937 -2.746521 99
128 dense 8192 1024 8.0 2.955638 0.083102 -5.729538 299
129 dense 28672 8192 3.5 2.983726 0.019182 -0.391951 111
130 dense 8192 8192 1.0 3.686614 0.048137 -4.279323 101
131 dense 8192 1024 8.0 2.506474 0.033328 -3.399683 91
132 dense 8192 8192 1.0 2.174403 0.038414 -1.148670 615
133 dense 28672 8192 3.5 3.225030 0.018124 -0.742943 77
134 dense 8192 8192 1.0 2.019079 0.033650 -0.501818 795
135 dense 8192 1024 8.0 3.616992 0.027988 -5.958693 57
136 dense 8192 1024 8.0 2.030568 0.044099 -2.268237 263
137 dense 8192 8192 1.0 2.890189 0.021999 -2.040329 147
138 dense 28672 8192 3.5 3.194724 0.017385 -0.787146 96
139 dense 28672 8192 3.5 2.714184 0.017113 -0.344144 650
140 dense 28672 8192 3.5 3.749668 0.019288 -2.805807 261
141 dense 28672 8192 3.5 4.181120 0.019934 -3.082013 85
142 dense 28672 8192 3.5 2.777627 0.019271 -0.343686 496
143 dense 28672 8192 3.5 3.158620 0.019417 -0.616407 103
144 dense 8192 1024 8.0 2.155278 0.026678 -2.012008 217
145 dense 8192 8192 1.0 2.958194 0.020530 -2.684793 168
146 dense 8192 8192 1.0 2.170789 0.036362 -0.801573 490
147 dense 8192 1024 8.0 3.502699 0.028920 -5.716601 65
148 dense 28672 8192 3.5 3.289901 0.019694 -0.951778 91
149 dense 28672 8192 3.5 2.863327 0.017698 -0.534209 478
150 dense 28672 8192 3.5 4.090873 0.013780 -3.419004 284
151 dense 8192 8192 1.0 3.876544 0.028694 -4.411082 98
152 dense 8192 8192 1.0 2.238836 0.042498 -1.029022 654
153 dense 8192 1024 8.0 2.539626 0.032303 -3.169353 97
154 dense 8192 1024 8.0 4.809173 0.034764 -8.960625 39
155 dense 8192 8192 1.0 2.177406 0.024922 -0.956146 547
156 dense 8192 8192 1.0 3.504172 0.020714 -3.180128 196
157 dense 8192 1024 8.0 2.326527 0.022993 -2.834764 143
158 dense 8192 1024 8.0 4.142978 0.023494 -7.839177 76
159 dense 28672 8192 3.5 2.934326 0.015379 -0.880652 588
160 dense 28672 8192 3.5 4.490763 0.011061 -3.920757 251
161 dense 28672 8192 3.5 3.094318 0.020053 -1.264972 466
162 dense 28672 8192 3.5 4.703445 0.010857 -4.275132 263
163 dense 28672 8192 3.5 2.893335 0.017075 -0.782149 589
164 dense 28672 8192 3.5 3.048157 0.024476 -1.151097 470
165 dense 8192 1024 8.0 2.344324 0.020916 -2.532036 137
166 dense 8192 8192 1.0 3.328882 0.024875 -3.125984 193
167 dense 8192 8192 1.0 3.141844 0.026079 -1.010950 56
168 dense 8192 1024 8.0 4.323788 0.028338 -7.824680 60
169 dense 8192 8192 1.0 2.098660 0.027455 -0.872009 555
170 dense 8192 1024 8.0 4.447533 0.030648 -8.225598 61
171 dense 8192 8192 1.0 4.371331 0.022799 -5.197546 106
172 dense 8192 1024 8.0 2.102909 0.037163 -2.293385 179
173 dense 28672 8192 3.5 3.026146 0.019855 -1.176211 547
174 dense 28672 8192 3.5 2.935052 0.017491 -0.899951 520
175 dense 28672 8192 3.5 4.733797 0.011962 -4.113036 179
176 dense 8192 8192 1.0 2.217079 0.028319 -1.222992 536
177 dense 8192 8192 1.0 3.973396 0.025768 -4.444556 154
178 dense 8192 1024 8.0 2.357915 0.017599 -2.939555 187
179 dense 8192 1024 8.0 4.176836 0.022563 -7.421676 95
180 dense 28672 8192 3.5 2.915970 0.015079 -0.875336 659
181 dense 28672 8192 3.5 4.718619 0.024133 -4.167494 130
182 dense 28672 8192 3.5 3.088836 0.020921 -1.228221 469
183 dense 28672 8192 3.5 2.920718 0.015984 -0.861208 646
184 dense 28672 8192 3.5 3.541551 0.022205 -1.400874 79
185 dense 8192 1024 8.0 2.242426 0.022240 -1.852009 140
186 dense 8192 8192 1.0 3.126366 0.019703 -2.921210 242
187 dense 8192 8192 1.0 2.151974 0.034812 -0.813418 593
188 dense 8192 1024 8.0 3.568929 0.019575 -6.136211 103
189 dense 28672 8192 3.5 3.814485 0.055182 -3.322812 645
190 dense 8192 8192 1.0 3.125327 0.024883 -0.865232 47
191 dense 8192 1024 8.0 3.927556 0.024903 -6.811179 76
192 dense 8192 8192 1.0 3.307619 0.024895 -2.762456 198
193 dense 28672 8192 3.5 4.362756 0.027770 -3.505052 154
194 dense 28672 8192 3.5 3.452938 0.021212 -1.098278 64
195 dense 28672 8192 3.5 2.763216 0.019754 -0.652836 818
196 dense 8192 1024 8.0 2.319711 0.028885 -2.585005 150
197 dense 28672 8192 3.5 4.114547 0.028776 -3.251660 235
198 dense 28672 8192 3.5 2.797391 0.021725 -0.586141 538
199 dense 28672 8192 3.5 3.479047 0.022031 -1.062620 56
200 dense 8192 1024 8.0 2.487249 0.021848 -2.852917 160
201 dense 8192 8192 1.0 3.912754 0.014813 -4.205880 115
202 dense 8192 8192 1.0 2.382971 0.048000 -1.260169 406
203 dense 8192 1024 8.0 3.994866 0.025932 -6.999458 67
204 dense 8192 1024 8.0 3.401216 0.014607 -5.472364 102
205 dense 28672 8192 3.5 3.256231 0.019469 -0.936962 165
206 dense 28672 8192 3.5 3.912026 0.027321 -2.963983 242
207 dense 8192 8192 1.0 2.339020 0.038515 -1.042180 420
208 dense 8192 1024 8.0 2.407189 0.022297 -2.807443 169
209 dense 8192 8192 1.0 3.205949 0.019937 -2.514130 203
210 dense 28672 8192 3.5 3.092038 0.023574 -0.478614 145
211 dense 28672 8192 3.5 3.717275 0.032620 -2.821167 256
212 dense 28672 8192 3.5 2.755468 0.020054 -0.339889 552
213 dense 28672 8192 3.5 3.252187 0.016110 -0.779827 92
214 dense 8192 1024 8.0 2.255562 0.030053 -2.561632 168
215 dense 8192 8192 1.0 3.291771 0.019451 -2.518575 189
216 dense 8192 8192 1.0 2.133807 0.040033 -0.644341 702
217 dense 8192 1024 8.0 4.122044 0.032565 -7.421369 48
218 dense 8192 1024 8.0 3.835818 0.082841 -7.261631 183
219 dense 8192 8192 1.0 3.160455 0.030068 -0.998223 63
220 dense 8192 8192 1.0 3.667346 0.014945 -3.098463 117
221 dense 28672 8192 3.5 3.541609 0.032547 -2.712804 374
222 dense 28672 8192 3.5 3.280626 0.014081 -0.753295 100
223 dense 28672 8192 3.5 3.105223 0.021378 -0.386781 109
224 dense 8192 1024 8.0 2.489116 0.037148 -2.929891 150
225 dense 28672 8192 3.5 3.095189 0.021603 -0.330099 93
226 dense 28672 8192 3.5 3.170976 0.019288 -0.626267 116
227 dense 28672 8192 3.5 3.482910 0.032976 -2.399847 311
228 dense 8192 1024 8.0 2.140880 0.021883 -2.244451 237
229 dense 8192 8192 1.0 2.062316 0.038535 -0.658394 655
230 dense 8192 8192 1.0 3.232993 0.016095 -2.158856 143
231 dense 8192 1024 8.0 3.531916 0.031108 -5.825471 117
232 dense 28672 8192 3.5 3.257243 0.035886 -2.393737 439
233 dense 28672 8192 3.5 2.915137 0.015492 1.982432 136
234 dense 28672 8192 3.5 2.950290 0.014128 2.020529 297
235 dense 8192 1024 8.0 2.551386 0.035371 -2.956301 113
236 dense 8192 8192 1.0 3.597819 0.020743 -3.553896 140
237 dense 8192 8192 1.0 2.333467 0.043053 -0.765445 447
238 dense 8192 1024 8.0 4.499475 0.053014 -8.521016 61
239 dense 8192 8192 1.0 3.681446 0.017338 -3.130472 134
240 dense 8192 1024 8.0 4.275251 0.075365 -8.371229 125
241 dense 8192 8192 1.0 2.250086 0.051268 -0.691927 646
242 dense 28672 8192 3.5 3.267885 0.017748 -0.716621 106
243 dense 28672 8192 3.5 2.713481 0.020740 -0.202060 695
244 dense 28672 8192 3.5 3.537809 0.035009 -2.470217 227
245 dense 8192 1024 8.0 2.453574 0.035536 -3.003362 164
246 dense 8192 1024 8.0 2.686972 0.027303 -2.753542 108
247 dense 8192 8192 1.0 2.156285 0.059561 -0.728338 907
248 dense 8192 1024 8.0 4.784517 0.049188 -9.520306 63
249 dense 28672 8192 3.5 3.251246 0.032423 -2.543317 577
250 dense 28672 8192 3.5 2.747570 0.026438 -0.383920 719
251 dense 28672 8192 3.5 3.363754 0.023654 -0.949121 124
252 dense 8192 8192 1.0 4.242652 0.021536 -4.566352 108
253 dense 8192 1024 8.0 4.606709 0.039339 -8.634651 48
254 dense 8192 8192 1.0 2.266573 0.036690 -0.769328 385
255 dense 8192 8192 1.0 3.876937 0.017838 -3.358951 107
256 dense 28672 8192 3.5 2.969057 0.025110 -0.782208 497
257 dense 28672 8192 3.5 3.018545 0.024395 -0.507542 226
258 dense 28672 8192 3.5 3.195724 0.030302 -2.229734 598
259 dense 8192 1024 8.0 2.328059 0.028381 -2.612089 147
260 dense 28672 8192 3.5 3.161304 0.027248 -2.342486 692
261 dense 28672 8192 3.5 3.148617 0.025712 -0.494801 137
262 dense 28672 8192 3.5 3.241530 0.022370 -0.847090 184
263 dense 8192 1024 8.0 2.455256 0.031705 -2.709107 117
264 dense 8192 8192 1.0 4.073487 0.021199 -3.708096 125
265 dense 8192 8192 1.0 2.312474 0.045963 -0.749831 528
266 dense 8192 1024 8.0 4.823405 0.036232 -9.177838 56
267 dense 8192 8192 1.0 4.533341 0.020138 -4.423318 70
268 dense 8192 1024 8.0 2.362609 0.024074 -2.586181 167
269 dense 8192 8192 1.0 2.338164 0.038925 -0.958845 429
270 dense 28672 8192 3.5 3.059330 0.023494 -0.520163 247
271 dense 28672 8192 3.5 3.363175 0.024572 -2.531677 465
272 dense 28672 8192 3.5 3.297426 0.019394 -0.972658 191
273 dense 8192 1024 8.0 4.097419 0.028796 -6.868563 43
274 dense 8192 8192 1.0 4.194192 0.019910 -4.676861 78
275 dense 8192 8192 1.0 2.360286 0.039815 -1.232514 345
276 dense 8192 1024 8.0 2.479609 0.021301 -2.873528 147
277 dense 8192 1024 8.0 4.349502 0.037948 -7.901832 70
278 dense 28672 8192 3.5 2.999171 0.023489 -0.612284 366
279 dense 28672 8192 3.5 3.365122 0.019129 -2.564756 503
280 dense 28672 8192 3.5 3.257031 0.022108 -1.025221 214
281 dense 28672 8192 3.5 3.129831 0.021447 -0.627606 218
282 dense 28672 8192 3.5 3.270522 0.018539 -0.942306 189
283 dense 8192 1024 8.0 2.449766 0.027766 -2.867858 118
284 dense 8192 8192 1.0 3.581383 0.033286 -3.689494 176
285 dense 8192 8192 1.0 2.204533 0.030676 -0.857807 469
286 dense 8192 1024 8.0 4.523665 0.050411 -8.599172 64
287 dense 28672 8192 3.5 3.494614 0.018901 -2.653728 447
288 dense 8192 8192 1.0 2.228657 0.025667 -0.985065 520
289 dense 8192 8192 1.0 4.142613 0.033803 -3.160306 136
290 dense 8192 1024 8.0 2.286382 0.021910 -2.181052 181
291 dense 8192 1024 8.0 4.623235 0.027749 -8.526490 58
292 dense 28672 8192 3.5 3.137649 0.022411 -0.701445 262
293 dense 28672 8192 3.5 3.516974 0.019470 -2.768360 549
294 dense 28672 8192 3.5 3.320424 0.021405 -1.005321 182
295 dense 8192 8192 1.0 2.085450 0.030633 -1.165086 570
296 dense 8192 8192 1.0 4.927228 0.033210 -6.020898 89
297 dense 8192 1024 8.0 2.277734 0.031609 -2.574855 130
298 dense 8192 1024 8.0 5.210773 0.030187 -9.789193 45
299 dense 28672 8192 3.5 3.270020 0.021497 -0.793506 146
300 dense 28672 8192 3.5 3.589040 0.015976 -2.832545 516
301 dense 28672 8192 3.5 3.375808 0.022102 -1.085007 156
302 dense 28672 8192 3.5 3.253594 0.019455 -0.759485 187
303 dense 8192 1024 8.0 2.233870 0.023895 -2.649132 163
304 dense 8192 8192 1.0 4.412213 0.022027 -4.724061 87
305 dense 8192 8192 1.0 2.111482 0.029589 -1.225146 577
306 dense 8192 1024 8.0 4.012535 0.039284 -7.186870 110
307 dense 28672 8192 3.5 3.530169 0.015748 -2.662017 637
308 dense 28672 8192 3.5 3.381582 0.018404 -1.064913 161
309 dense 8192 8192 1.0 2.287056 0.031881 -0.939625 496
310 dense 8192 1024 8.0 4.264519 0.043781 -7.781678 54
311 dense 8192 8192 1.0 3.469447 0.019244 -2.697200 107
312 dense 8192 1024 8.0 2.591455 0.023091 -3.131402 93
313 dense 28672 8192 3.5 3.288766 0.018193 -0.948151 270
314 dense 28672 8192 3.5 3.156208 0.017530 -0.633350 314
315 dense 28672 8192 3.5 3.595899 0.015802 -2.800880 540
316 dense 28672 8192 3.5 3.680107 0.013709 -3.014700 516
317 dense 28672 8192 3.5 3.174604 0.016230 -0.659754 271
318 dense 28672 8192 3.5 3.378210 0.016470 -0.972502 134
319 dense 8192 1024 8.0 2.272124 0.036242 -2.489521 122
320 dense 8192 8192 1.0 3.591675 0.017337 -3.247561 167
321 dense 8192 8192 1.0 2.066692 0.027051 -1.064110 602
322 dense 8192 1024 8.0 3.819671 0.028874 -7.096052 130
323 dense 28672 8192 3.5 3.210177 0.015288 -0.763760 281
324 dense 28672 8192 3.5 3.677593 0.011201 -2.967848 600
325 dense 8192 1024 8.0 4.507748 0.025561 -8.151765 81
326 dense 28672 8192 3.5 3.415323 0.015470 -1.048195 135
327 dense 8192 8192 1.0 4.514115 0.067662 -6.206637 230
328 dense 8192 1024 8.0 1.499648 0.027419 -1.543658 723 over-trained
329 dense 8192 8192 1.0 1.747745 0.046379 -1.488792 581 over-trained
330 dense 8192 8192 1.0 2.018293 0.026860 -1.478713 746
331 dense 8192 8192 1.0 3.186658 0.011600 -2.558560 328
332 dense 8192 1024 8.0 2.254558 0.031907 -2.635995 136
333 dense 8192 1024 8.0 3.380134 0.015556 -5.444599 147
334 dense 28672 8192 3.5 3.225118 0.012457 -0.712430 269
335 dense 28672 8192 3.5 3.706513 0.013696 -3.034345 545
336 dense 28672 8192 3.5 3.364296 0.014012 -1.000216 168
337 dense 28672 8192 3.5 3.716904 0.012044 -3.027958 534
338 dense 28672 8192 3.5 3.186951 0.014602 -0.661277 334
339 dense 28672 8192 3.5 3.377483 0.014181 -0.976105 166
340 dense 8192 1024 8.0 2.220775 0.028658 -2.701405 125
341 dense 8192 8192 1.0 3.500892 0.017545 -3.295877 130
342 dense 8192 8192 1.0 2.067930 0.027676 -1.418602 608
343 dense 8192 1024 8.0 3.414643 0.025710 -5.564629 148
344 dense 28672 8192 3.5 3.374930 0.013429 -0.986118 158
345 dense 8192 8192 1.0 3.732862 0.025341 -3.541723 187
346 dense 28672 8192 3.5 3.251604 0.013057 -0.720824 232
347 dense 28672 8192 3.5 3.733019 0.012196 -3.152863 563
348 dense 8192 1024 8.0 3.329996 0.021271 -5.159179 145
349 dense 8192 8192 1.0 2.098586 0.026808 -1.555782 598
350 dense 8192 1024 8.0 2.275712 0.027278 -2.699711 137
351 dense 28672 8192 3.5 3.737051 0.011962 -3.182908 567
352 dense 28672 8192 3.5 3.231756 0.012014 -0.750328 343
353 dense 28672 8192 3.5 3.310655 0.012816 -0.964817 312
354 dense 8192 1024 8.0 2.256064 0.050811 -2.600931 96
355 dense 8192 8192 1.0 3.773609 0.067553 -4.590205 434
356 dense 8192 8192 1.0 2.028915 0.033725 -1.760984 602
357 dense 8192 1024 8.0 4.449748 0.019110 -8.005379 70
358 dense 8192 8192 1.0 2.079144 0.027976 -1.562438 617
359 dense 8192 8192 1.0 3.321028 0.011134 -2.669486 207
360 dense 8192 1024 8.0 2.199865 0.029212 -2.596747 117
361 dense 8192 1024 8.0 3.457595 0.014684 -5.292376 168
362 dense 28672 8192 3.5 3.218530 0.012964 -0.720576 389
363 dense 28672 8192 3.5 3.744312 0.011257 -3.098467 588
364 dense 28672 8192 3.5 3.311297 0.013165 -0.981898 338
365 dense 8192 8192 1.0 2.235788 0.027672 -1.099144 591
366 dense 8192 8192 1.0 3.245729 0.029975 -3.573360 208
367 dense 8192 1024 8.0 2.547601 0.027423 -2.982109 134
368 dense 8192 1024 8.0 3.052529 0.052769 -5.511845 272
369 dense 28672 8192 3.5 3.223751 0.010346 -0.661501 355
370 dense 28672 8192 3.5 3.806598 0.013094 -3.351722 521
371 dense 28672 8192 3.5 3.310785 0.013939 -0.940741 345
372 dense 28672 8192 3.5 3.856060 0.010917 -3.456056 490
373 dense 28672 8192 3.5 3.238125 0.012047 -0.658978 336
374 dense 28672 8192 3.5 3.317417 0.013158 -0.927372 329
375 dense 8192 1024 8.0 2.430257 0.036372 -2.956415 125
376 dense 8192 8192 1.0 3.842722 0.028962 -4.638030 175
377 dense 8192 8192 1.0 2.193304 0.028982 -1.889630 581
378 dense 8192 1024 8.0 4.507887 0.032588 -7.906383 53
379 dense 8192 1024 8.0 4.716728 0.029296 -8.744224 62
380 dense 28672 8192 3.5 3.824699 0.011059 -3.400342 509
381 dense 8192 8192 1.0 1.971109 0.030094 -1.406523 708 over-trained
382 dense 8192 8192 1.0 3.841306 0.067255 -5.356056 402
383 dense 8192 1024 8.0 2.170643 0.036995 -2.528430 114
384 dense 28672 8192 3.5 3.327142 0.011977 -0.975097 358
385 dense 28672 8192 3.5 3.255605 0.011661 -0.741660 343
386 dense 28672 8192 3.5 3.882425 0.012233 -3.160528 479
387 dense 28672 8192 3.5 3.258228 0.008359 -0.641791 302
388 dense 28672 8192 3.5 3.332584 0.008782 -0.885307 269
389 dense 8192 1024 8.0 2.035359 0.016712 -2.058018 176
390 dense 8192 8192 1.0 3.193591 0.015516 -2.611888 275
391 dense 8192 8192 1.0 1.974850 0.023162 -0.872908 708 over-trained
392 dense 8192 1024 8.0 3.253658 0.012260 -5.009265 187
393 dense 8192 8192 1.0 2.303404 0.025982 -1.014969 538
394 dense 8192 8192 1.0 3.432867 0.029440 -3.466317 143
395 dense 8192 1024 8.0 2.443124 0.022677 -2.451465 149
396 dense 8192 1024 8.0 3.566541 0.036202 -6.024920 144
397 dense 28672 8192 3.5 3.226151 0.008202 -0.588902 305
398 dense 28672 8192 3.5 3.957116 0.011125 -3.534790 399
399 dense 28672 8192 3.5 3.291566 0.010154 -0.822200 324
400 dense 28672 8192 3.5 3.902116 0.014072 -3.461375 486
401 dense 8192 1024 8.0 4.252722 0.021875 -7.597794 65
402 dense 8192 8192 1.0 2.219999 0.024246 -1.511459 597
403 dense 28672 8192 3.5 3.296169 0.010176 -0.827507 373
404 dense 8192 1024 8.0 2.330364 0.025805 -2.731609 182
405 dense 28672 8192 3.5 3.203575 0.010272 -0.577102 419
406 dense 8192 8192 1.0 3.568421 0.058433 -3.826683 387
407 dense 28672 8192 3.5 3.896400 0.014334 -3.424644 511
408 dense 28672 8192 3.5 3.243699 0.007913 -0.634449 324
409 dense 28672 8192 3.5 3.337745 0.010619 -0.865602 225
410 dense 8192 1024 8.0 2.244002 0.033833 -2.668373 139
411 dense 8192 8192 1.0 4.838222 0.021820 -6.009667 77
412 dense 8192 8192 1.0 2.057385 0.029337 -1.694718 647
413 dense 8192 1024 8.0 4.096755 0.022972 -6.808580 92
414 dense 8192 8192 1.0 3.357494 0.015256 -3.277794 335
415 dense 8192 1024 8.0 3.466981 0.019888 -4.894464 226
416 dense 8192 1024 8.0 1.965245 0.024222 -1.253541 128 over-trained
417 dense 8192 8192 1.0 1.912764 0.024186 -0.658458 766 over-trained
418 dense 28672 8192 3.5 3.246994 0.008041 -0.433809 305
419 dense 28672 8192 3.5 3.968997 0.014816 -3.530414 337
420 dense 28672 8192 3.5 3.339014 0.009542 -0.682840 261
421 dense 8192 1024 8.0 2.455302 0.028524 -2.995178 122
422 dense 8192 8192 1.0 3.472414 0.039338 -3.688393 187
423 dense 8192 8192 1.0 2.316625 0.029344 -1.153130 515
424 dense 8192 1024 8.0 3.685369 0.031821 -6.437639 154
425 dense 28672 8192 3.5 3.948562 0.016537 -3.558220 376
426 dense 28672 8192 3.5 3.248658 0.007791 -0.396396 267
427 dense 28672 8192 3.5 3.310446 0.009829 -0.641528 349
428 dense 8192 8192 1.0 2.067972 0.024711 -1.334147 691
429 dense 8192 8192 1.0 4.131205 0.017415 -4.807783 144
430 dense 8192 1024 8.0 2.175460 0.025683 -2.517225 170
431 dense 8192 1024 8.0 3.951748 0.021876 -6.247713 140
432 dense 28672 8192 3.5 3.273080 0.007542 -0.429633 273
433 dense 28672 8192 3.5 3.985891 0.019994 -3.628594 363
434 dense 28672 8192 3.5 3.362461 0.009568 -0.720521 266
435 dense 8192 8192 1.0 1.802826 0.030655 -1.368770 687 over-trained
436 dense 8192 8192 1.0 3.766423 0.013665 -3.692308 215
437 dense 8192 1024 8.0 1.843618 0.037480 -2.014059 93 over-trained
438 dense 8192 1024 8.0 3.605447 0.015386 -5.065058 187
439 dense 28672 8192 3.5 3.273895 0.006629 -0.463930 339
440 dense 28672 8192 3.5 4.034328 0.018765 -3.432632 307
441 dense 28672 8192 3.5 3.331497 0.007134 -0.690444 365
442 dense 8192 1024 8.0 2.055052 0.031441 -2.102582 167
443 dense 8192 8192 1.0 1.980585 0.021827 -1.289621 667 over-trained
444 dense 8192 1024 8.0 3.032115 0.011506 -3.831171 229
445 dense 28672 8192 3.5 3.959695 0.017949 -3.400963 340
446 dense 28672 8192 3.5 3.245656 0.004998 -0.439615 356
447 dense 28672 8192 3.5 3.328561 0.006602 -0.654214 309
448 dense 8192 8192 1.0 2.869139 0.008847 -1.637469 302
449 dense 8192 8192 1.0 2.242368 0.023484 -0.923128 602
450 dense 8192 1024 8.0 3.823705 0.024145 -6.013782 77
451 dense 8192 8192 1.0 4.230629 0.021981 -3.843577 82
452 dense 8192 1024 8.0 2.328327 0.018510 -2.494623 167
453 dense 28672 8192 3.5 3.315384 0.005117 -0.637171 309
454 dense 28672 8192 3.5 3.217068 0.005816 -0.331389 352
455 dense 28672 8192 3.5 3.926747 0.022974 -3.399995 313
456 dense 28672 8192 3.5 3.940867 0.022331 -3.431284 301
457 dense 28672 8192 3.5 3.220639 0.006886 -0.378615 369
458 dense 28672 8192 3.5 3.303393 0.005404 -0.690377 365
459 dense 8192 1024 8.0 1.953848 0.035471 -1.787030 108 over-trained
460 dense 8192 8192 1.0 3.462296 0.012989 -3.026035 183
461 dense 8192 8192 1.0 1.934893 0.025265 -0.896493 639 over-trained
462 dense 8192 1024 8.0 3.302124 0.018214 -4.887244 188
463 dense 8192 8192 1.0 1.968282 0.020899 -1.080016 688 over-trained
464 dense 8192 8192 1.0 3.030438 0.013209 -1.636538 344
465 dense 8192 1024 8.0 2.004338 0.021947 -2.326752 149
466 dense 8192 1024 8.0 3.171034 0.013017 -4.037949 230
467 dense 28672 8192 3.5 3.218800 0.006175 -0.426258 397
468 dense 28672 8192 3.5 3.926416 0.025351 -3.344573 286
469 dense 28672 8192 3.5 3.295540 0.006356 -0.675064 361
470 dense 8192 8192 1.0 2.118905 0.020486 -0.968923 668
471 dense 8192 8192 1.0 2.926344 0.008896 -1.509767 243
472 dense 8192 1024 8.0 2.212292 0.015184 -2.337906 189
473 dense 8192 1024 8.0 2.975968 0.014293 -3.764063 227
474 dense 28672 8192 3.5 3.185982 0.005299 -0.335350 379
475 dense 28672 8192 3.5 3.898437 0.024655 -3.163455 290
476 dense 28672 8192 3.5 3.249604 0.006719 -0.616250 425
477 dense 28672 8192 3.5 3.157111 0.006353 -0.249368 365
478 dense 28672 8192 3.5 3.244001 0.005225 -0.595896 425
479 dense 8192 1024 8.0 2.319589 0.016069 -2.695653 180
480 dense 8192 8192 1.0 3.965799 0.026314 -3.419253 63
481 dense 8192 8192 1.0 2.262722 0.027922 -0.890049 562
482 dense 8192 1024 8.0 3.369859 0.030138 -5.194896 131
483 dense 28672 8192 3.5 3.885441 0.027078 -3.023420 258
484 dense 8192 8192 1.0 2.811494 0.012730 -1.253217 191
485 dense 8192 1024 8.0 2.909391 0.018252 -3.622552 124
486 dense 8192 1024 8.0 2.151515 0.019692 -1.576954 155
487 dense 8192 8192 1.0 2.090095 0.023020 -0.661865 540
488 dense 28672 8192 3.5 3.138157 0.006091 -0.103607 427
489 dense 28672 8192 3.5 3.786434 0.027174 -3.060488 290
490 dense 28672 8192 3.5 3.227999 0.004776 -0.444586 380
491 dense 28672 8192 3.5 3.661705 0.025299 -2.397763 351
492 dense 28672 8192 3.5 3.115231 0.005853 -0.025634 472
493 dense 28672 8192 3.5 3.203371 0.007152 -0.396653 416
494 dense 8192 1024 8.0 2.270085 0.020186 -1.839584 175
495 dense 8192 8192 1.0 2.657540 0.008661 -1.487331 283
496 dense 8192 8192 1.0 2.136328 0.016590 -0.526671 584
497 dense 8192 1024 8.0 2.763082 0.016322 -3.793959 260
498 dense 28672 8192 3.5 3.625151 0.018588 -2.521702 352
499 dense 8192 1024 8.0 3.280064 0.037570 -5.375137 151
500 dense 8192 8192 1.0 2.244634 0.027905 -0.139108 446
501 dense 28672 8192 3.5 3.170029 0.006779 -0.320023 382
502 dense 8192 1024 8.0 2.404513 0.023444 -2.458549 130
503 dense 28672 8192 3.5 3.095033 0.006729 -0.012901 420
504 dense 8192 8192 1.0 3.487283 0.013940 -2.105473 142
505 dense 8192 8192 1.0 2.108466 0.030803 -0.414600 803
506 dense 8192 8192 1.0 3.097805 0.023352 -2.045650 69
507 dense 8192 1024 8.0 2.317891 0.023797 -2.475843 205
508 dense 8192 1024 8.0 2.631491 0.034656 -3.870198 196
509 dense 28672 8192 3.5 3.058062 0.008509 0.106831 360
510 dense 28672 8192 3.5 3.586090 0.021279 -2.457458 324
511 dense 28672 8192 3.5 3.141827 0.008807 -0.196297 367
512 dense 28672 8192 3.5 3.481238 0.023527 -2.152496 449
513 dense 28672 8192 3.5 2.994833 0.009406 0.235740 393
514 dense 28672 8192 3.5 3.097725 0.007540 -0.161761 434
515 dense 8192 1024 8.0 2.265615 0.025875 -1.912689 192
516 dense 8192 8192 1.0 3.020880 0.023099 -1.467787 223
517 dense 8192 8192 1.0 2.285918 0.023581 0.074872 467
518 dense 8192 1024 8.0 3.042783 0.033176 -4.577494 147
519 dense 28672 8192 3.5 4.072062 0.020995 -2.295258 70
520 dense 28672 8192 3.5 3.111387 0.006904 -0.155343 381
521 dense 8192 1024 8.0 2.820100 0.029647 -4.026318 187
522 dense 28672 8192 3.5 2.972443 0.006673 0.384017 359
523 dense 8192 8192 1.0 3.236858 0.024853 -2.122262 103
524 dense 8192 1024 8.0 2.351722 0.024986 -2.280048 145
525 dense 8192 8192 1.0 2.273438 0.026693 0.069983 412
526 dense 8192 8192 1.0 2.893498 0.018750 0.181294 47
527 dense 8192 8192 1.0 3.568808 0.018928 -2.884219 181
528 dense 8192 1024 8.0 2.385138 0.020795 -2.252223 160
529 dense 8192 1024 8.0 3.299133 0.017708 -4.979527 102
530 dense 28672 8192 3.5 2.940231 0.007842 0.426164 338
531 dense 28672 8192 3.5 3.740890 0.020327 -1.890942 101
532 dense 28672 8192 3.5 3.042834 0.007589 0.048964 393
533 dense 28672 8192 3.5 3.391012 0.020212 -1.604465 338
534 dense 28672 8192 3.5 2.898897 0.009563 0.626730 444
535 dense 28672 8192 3.5 3.058961 0.010614 -0.034737 485
536 dense 8192 1024 8.0 2.163786 0.022966 -1.344376 191
537 dense 8192 8192 1.0 2.457222 0.012183 -0.633864 210
538 dense 8192 8192 1.0 2.058818 0.022736 0.282996 565
539 dense 8192 1024 8.0 2.390782 0.028485 -3.198007 349
540 dense 8192 8192 1.0 3.191492 0.019094 -1.579806 210
541 dense 28672 8192 3.5 3.005559 0.011947 -0.011763 452
542 dense 28672 8192 3.5 2.893316 0.010248 0.485640 386
543 dense 8192 8192 1.0 2.693504 0.019490 0.203065 81
544 dense 8192 1024 8.0 3.119585 0.036841 -4.847216 174
545 dense 8192 1024 8.0 2.180747 0.025580 -1.863745 228
546 dense 28672 8192 3.5 3.331776 0.020826 -1.398805 363
547 dense 8192 1024 8.0 2.814691 0.017555 -4.026953 194
548 dense 8192 8192 1.0 2.755096 0.023895 -1.005493 335
549 dense 8192 1024 8.0 2.067323 0.032851 -1.302894 247
550 dense 8192 8192 1.0 2.125895 0.029921 0.152200 472
551 dense 28672 8192 3.5 2.865756 0.010691 0.501464 291
552 dense 28672 8192 3.5 3.312314 0.013017 -1.391830 266
553 dense 28672 8192 3.5 2.921597 0.009142 0.118275 367
554 dense 8192 8192 1.0 1.889902 0.033678 0.315312 620 over-trained
555 dense 28672 8192 3.5 2.841103 0.014167 -1.113705 494
556 dense 28672 8192 3.5 2.903742 0.010638 0.496889 275
557 dense 28672 8192 3.5 2.933339 0.012241 0.383523 399
558 dense 8192 1024 8.0 1.877083 0.029895 -1.270419 285 over-trained
559 dense 8192 8192 1.0 2.586775 0.012803 -1.089840 296
560 dense 8192 1024 8.0 2.947151 0.013139 -4.090791 208