Summary of Gemini's 60-page technical report. 1. Written in Jax and trained using TPUs. The architecture, while not explained in details, seems similar to Flamigo's. 2. Gemini Pro's performance is similar to GPT-3.5 and Gemini Ultra is reported to be better than GPT-4. Nano-1
Gemini Technical Report: TPU Training, Performance vs GPT Models
By
–
Leave a Reply