Gemini vs Chatgpt: Google claims that its new AI model is better than GPT-4

“We introduce Gemini, the most capable and general model we’ve ever built,” says Demis Hassabis, CEO and Co-Founder of Google DeepMind

On December 6, Alphabet Inc., the parent company of Google, introduced Gemini AI, its most extensive and advanced AI model to date. This move positions the tech giant to compete with other leading players in the evolving field of artificial intelligence (AI), such as OpenAI’s GPT-4 and Meta’s Llama 2.

Gemini marks the inaugural AI model developed by Alphabet Inc. following the consolidation of its AI research entities, DeepMind, and Google Brain. The integration of these units into a unified division named Google DeepMind is under the leadership of DeepMind CEO Demis Hassabis.

“This is incredible momentum, and yet, we’re only beginning to scratch the surface of what’s possible,” Sundar Pichai, the chief executive officer of Google states in the release.

He further adds, “This new era of models represents one of the biggest science and engineering efforts we’ve undertaken as a company.”

During a briefing, Eli Collins, the Vice President of Product at Google DeepMind, highlighted that Gemini stands out as the inaugural AI model to surpass human experts in specific benchmarks related to problem-solving across diverse domains such as mathematics, physics, history, law, medicine, and ethics.

“Gemini can understand the world around us in the way that we do,” Demis Hassabis, the founder of DeepMind, Google’s AI laboratory responsible for crafting Gemini, confidently stated adding that Gemini outshines all other existing models, underscoring its superiority in the realm of artificial intelligence.

The demonstration showcased Gemini’s remarkable ability to accurately recognize and comprehend various stimuli, ranging from a person reenacting a scene from the “Matrix” movie to interpreting drawings, such as identifying a hand-drawn depiction of a duck and correlating it with a physical rubber duck.

Gemini AI is a groundbreaking creation, constructed from the ground up with a “multimodal” architecture. This distinctive feature enables Gemini to seamlessly comprehend and manipulate various forms of information concurrently, encompassing text, code, audio, images, and video.

The availability of Gemini AI comes in three distinct sizes, each tailored for specific purposes. The Ultra version is designed for handling highly intricate tasks, the Pro version is adept at scaling across a diverse array of tasks, and the Nano version is optimised for on-device tasks. This tiered approach ensures versatility and efficiency across a spectrum of applications.

Disclaimer: The views expressed in this article are those of the author and do not necessarily reflect the views of ET Edge Insights, its management, or its members

Scroll to Top