Do you see an arithmetic operation that could help us calculate this layernorm standard deviation? pic.twitter.com/nV4wwJRCLB
— Andrej Karpathy (@karpathy) 11 juillet 2024
Do you see an arithmetic operation that could help us calculate this layernorm standard deviation?
Leave a Reply