<p>🎉 1T or bust my dudes 🎉 An implementation of model &amp; data parallel GPT2 &amp; GPT3 -like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library. Training and inference supported</p>

Breakdown

🎉 1T or bust my dudes 🎉

An implementation of model & data parallel GPT2 & GPT3 -like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.

Training and inference supported

Curated

Mar 23, 8:45 AM

Source

Tags

Tomorrow's news, today

AI-driven updates, curated by humans and hand-edited for the Prototypr community