Dummy LM Just an old dummy experiment to build a small language model based in the transformer (decoder-only) architecture.