We are happy to announce that we have brought up support for Llama-3.1-70B inference on Tenstorrent’s 8-chip systems, the TT-QuietBox and the TT-LoudBox. The source code for Llama-3.1-70B and other models that are supported is on our GitHub —>
Tenstorrent Brings Llama-3.1-70B Inference Support to 8-Chip Systems
By
–
Leave a Reply