Friday

14-03-2025

Category: AI

Auto Added by WPeMatico

FACTS Grounding: A new benchmark for evaluating the factuality of large language models

FACTS Grounding: A new benchmark for evaluating the factuality of large language models

Our comprehensive benchmark and online leaderboard offer a much-needed measure of how accurately LLMs ground their responses in provided source…
Updating the Frontier Safety Framework

Updating the Frontier Safety Framework

Our next iteration of the FSF sets out stronger security protocols on the path to AGI
Gemini 2.0 is now available to everyone

Gemini 2.0 is now available to everyone

We’re announcing new updates to Gemini 2.0 Flash, plus introducing Gemini 2.0 Flash-Lite and Gemini 2.0 Pro Experimental.
Start building with Gemini 2.0 Flash and Flash-Lite

Start building with Gemini 2.0 Flash and Flash-Lite

Gemini 2.0 Flash-Lite is now generally available in the Gemini API for production use in Google AI Studio and for…
The “AI Agent As Coworker” Narrative Is Nonsense

The “AI Agent As Coworker” Narrative Is Nonsense

In this two-part blog series, Principal Analysts Anthony McPartlin and Seth Marrs debate the idea of AI agents as coworkers.…
Why The “AI Agent As Coworker” Narrative Is The Future

Why The “AI Agent As Coworker” Narrative Is The Future

AI agents have a promising future as innovative coworkers, and their presence is already being felt despite the inflated hype…
AI Product Managers: The Role Of The Future Or Another Tool In Your Toolkit?

AI Product Managers: The Role Of The Future Or Another Tool In Your Toolkit?

Though “AI PMs” might ultimately go the way of “internet product manager” from the early days of the web, it’s…
The UK Government Is Ready To Embrace AI, But Without Trust, It Risks Disaster

The UK Government Is Ready To Embrace AI, But Without Trust, It Risks Disaster

A commitment to trustworthy AI is paramount to keep the enthusiasm going and avoid backlash — particularly as safety takes…