Microsoft's small language models showing that smaller models can match larger ones with better training.