The AI (infe)race that will shape 2024

The AI industry is experiencing a pivotal shift as inference costs1 plummet. This trend is evident in the competitive pricing strategies of companies: Mistral charges $0.65 per million input tokens and $1.96 per million output tokens, OpenAI offers $1.00 and $2.00, respectively, Fireworks.ai comes in at $1.60 per million output tokens, and Deepinfra presents a low $0.27 rate.

Dylan Patel

and

Daniel Nishball

ask whether it’s a “race to the bottom”. I wouldn’t call it that… The race is to decrease the prices, but the quality remains impressive. The market is now teeming with GPT-3.5 calibre models, both open-source and proprietary, and these are plenty good enough — especially when this cheap — to be built into useful applications.

Συνέχεια  εδώ

Σχετικά Άρθρα