Flash Attention 2 Integration Reduces Lit-GPT Runtime by 11%

AI Dynamics

Global AI News Aggregator

Flash Attention 2 Integration Reduces Lit-GPT Runtime by 11%

–

24 July 2023 17h57

My colleagues already added Flash Attention 2 to our Lit-GPT repo!
So if you are working on the NeurIPS LLM Efficiency Challenge (for which Lit-GPT is the official starter kit, https://
llm-efficiency-challenge.github.io), you can shave ~11% off your total runtime

→ View original post on X — @rasbt,

24 July 2023

AI CODE INNOVATION LLMS OPEN SOURCE RESEARCH

AI Dynamics

Flash Attention 2 Integration Reduces Lit-GPT Runtime by 11%

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer