WeightWatcher Analysis of SmolLM
"SmolLM is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters. They are capable of solving a wide range of tasks while being lightweight enough to run on-device." github
Here, see the SmolLM-Instruct base models. And as with other well trained models, as the model gets larger, both the average alpha and the average Dks values systematically decrease.