Sector 6 | The Newsletter of AIM

Sector 6 | The Newsletter of AIM

Share this post

Sector 6 | The Newsletter of AIM
Sector 6 | The Newsletter of AIM
LLMs Getting Cheaper πŸ€— πŸ”₯
Copy link
Facebook
Email
Notes
More

LLMs Getting Cheaper πŸ€— πŸ”₯

Analytics India Magazine's avatar
Analytics India Magazine
Jul 17, 2024
βˆ™ Paid

Share this post

Sector 6 | The Newsletter of AIM
Sector 6 | The Newsletter of AIM
LLMs Getting Cheaper πŸ€— πŸ”₯
Copy link
Facebook
Email
Notes
More
Share

In a recent post, renowned computer scientist Andrej Karpathy demonstrated how the cost of training large language models (LLMs) has significantly decreased over the past five years, making it feasible to train models like GPT-2 for approximately $672 on β€œone 8XH100 GPU node in 24 hours”.

β€œIncredibly, the costs have come down dramatically over the past five years due to improvements in compute hardware (H100 GPUs), software (CUDA, cuBLAS, cuDNN, FlashAttention) and data quality (e.g., the FineWeb-Edu dataset),” said Karpathy.Β 

Keep reading with a 7-day free trial

Subscribe to Sector 6 | The Newsletter of AIM to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
Β© 2025 Analytics India Magazine
Privacy βˆ™ Terms βˆ™ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More