Most important breakthrough this month: Differential Transformer vastly improves attention ⇒ better retrieval and fewer hallucinations! Thought that self-attention could not be improved anymore? Researchers at @MSFTResearch and @Tsinghua_Uni have dropped a novel
Differential Transformer improves attention and reduces hallucinations
By
–
