A true 'hello world' LLM pipeline

(alganet.github.io)

2 points | by gaigalas 4 hours ago

1 comments

  • storystarling 3 hours ago
    Fast feedback is key, but I'm skeptical of the $100 figure for training nanoGPT. If you use spot instances on Lambda or RunPod you can train a model that size for less than a dollar. I've been running similar experiments recently and the compute cost is basically a rounding error.