Google rolls out Gemini 1.5 Pro to join the AI wars

Google’s pre-generative large language model Gemini is growing up a bit with a .5 upgrade rolling out to users in over 180 countries now.

Google Gemini 1.5 Pro is available now through Google AI Studio for developers and introduces new audio and video modalities, API improvements, and a new embedded model with additional performance optimizations.

According to the Google for Developers blog the new Audio and Video Modalities include “audio (speech) understanding in both the Gemini API and Google AI Studio.”

Developers can also tap into the ability for Gemini 1.5 Pro’s support for reasoning across images (frames) and audio (speech) when videos are uploaded into the studio soon.

As for the API improvements, there are System Instructions that help to define roles, formats, goals, and rules for AI behavior in different instances. There is a JSON Mode that helps instruct models to output into JSON objects. Google also notes there are improvements to function calling so developers can assign text, function calls, or simply the function altogether.

Lastly, there is a new text embedding model present in Gemini 1.5 Pro thanks to API support that aids in stronger retrival performance, while also outperforming the previous models, according to Google.

Google is promising more Gemini API and AI Studio improvements over the coming weeks.

With OpenAI set to debut its GPT-5 model soon, we should see how Google’s latest model improvements stack up against OpenAI and subsequently Microsoft, soon.

Subscribe

Related articles

Xbox Game Pass introduces more in game benefits

Starting this week, Microsoft is rolling out more value...

ThinkPad X9 14 Aura Edition: A Fresh Take on a Classic

The ThinkPad X9 14 Aura Edition is a stunner. Gone are the days of matte black finishes and the iconic red TrackPoint nub. Instead, Lenovo has opted for a sleek, recycled aluminum chassis in a classy gray finish. It’s lightweight (just 2.74 pounds) and ultra-thin at 0.51 inches, making it a breeze to carry around.

Say Goodbye to DALL-E: ChatGPT Now Offers Built-In Image Generation

The new GPT-4o model integrates native image generation directly into ChatGPT, eliminating the need for external tools like DALL-E or Midjourney. This seamless integration marks a significant leap forward in multimodal AI capabilities, blending text and visuals into a unified experience.

Game Hubs bring achievements, stats, and events together in Xbox’s latest update.”

Microsoft is shaking things up for Xbox users with the introduction of Game Hubs, a new feature currently being tested in the Alpha Skip-Ahead Ring of the Xbox Insider Program. This update, part of the latest Xbox Update Preview, promises to enhance the gaming experience by centralizing game-related information in one convenient location. Let’s break down what’s being tested and how it might change the way you interact with your games.

Microsoft’s Photos App Just Got Smarter with Copilot – Here’s What’s New!

Remember when we speculated about when the Photos app would finally get some Copilot love? Well, the wait is officially over! Microsoft has rolled out a shiny new update for the Photos app on Windows 11 (and even Windows 10 for Insiders in the Release Preview Channel), and it’s packed with AI-powered goodness. Let’s dive into the details and see what’s new.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

WP Twitter Auto Publish Powered By : XYZScripts.com