Exploring Accelerated Compound AI Systems with SambaNova & Llama 3.3-70B
OpenAI o1 is approximately 30 times slower than GPT-4o. Similarly, the o1 mini version is around 16 times slower than GPT-4o mini.
Frontier models are getting slower! Yup, you heard that right. A few days ago Google released Gemini 2.0 Flash Thinking with experimental reasoning to compete with OpenAI&