
Google’s Gemini AI hints at the next great leap for the technology: analysing real-time information
Google has launched Gemini, a new artificial intelligence (AI) system that can seemingly understand and talk intelligently about almost any kind of prompt – pictures, text, speech, music, computer code and much more.
This type of AI system is known as a multimodal model. It’s a step beyond just being able to handle text or images as previous ones have. And it provides a strong hint of where AI may be going next: being able to analyse and respond to real-time information coming from the outside world.
Although Gemini’s capabilities might not be quite as advanced as they seemed in a viral video, which was edited from carefully curated text and still image prompts, it is clear that AI systems are rapidly advancing. They are heading towards an ability to handle more and more complex inputs and outputs.
To develop new capabilities, AI systems are highly dependent on the kind of “training” data they have access to. They are exposed to this data to help them improve at what they do, including making inferences such as recognising a face in a picture, or writing an essay.
Συνέχεια εδώ