Edition 26 · 15th - 30th Sep 2025

News You Can Use

Deep Dives

Three stories worth sitting with

FT Innovative Lawyers: Europe 2025 -- report and winners

What

Addleshaw Goddard is ranked among the Top 10 Most Innovative Law Firms in Europe (8th), marking our sixth consecutive year on the list. This year, AG was Highly Commended in the Automation and AI Tools category for the RAG report, and Commended in Strategic Direction and Science & Technology.

So what

Some big projects stood out from across all law firms, with Freshfields ranking 1st overall. Interest was drawn to more Legal Tech Consultancy work, the rise of data science, discussions around the business model in law and specific products being created for clients. Interestingly, Simmons & Simmons won the Automation in AI Category with 'Percy' a very similar tool to AGPT.

Benchmarking Humans & AI in Contract Drafting -- Preliminary Findings

What

LegalBenchmarks' latest study tested 13 AI platforms alongside a group of in-house lawyers, scoring outputs on reliability, usefulness, and workflow support. The results show that AI is increasingly capable of producing reliable first drafts. Some models even outperformed top lawyers on certain reliability metrics.

So what

This highlights that hybrid approaches are now the most practical way to work: AI can take on the bulk of routine drafting, while lawyers provide critical oversight, commercial context, and final judgment.

GDPval - Evaluating AI on real-world tasks

What

OpenAI has introduced GDPval, a new benchmark designed to test how well AI models perform tasks that matter in the real world. GDPval covers 44 occupations across nine sectors with tasks such as legal briefs, slide decks, reports, spreadsheets, etc.

So what

This matters a lot for us because GDPval gives a more grounded way to measure how capable AI is in assisting with real legal work. This is very similar to our AI Benchmarking work in R&D and we can learn a lot from this structured approach.

Worth Reading

Everything else worth a click

Legal IT Insider - Deloitte and Legora Interview

Deeper cut on the Deloitte Legal and Legora UK strategic partnership. Both sides argue the point is to rewire legal operating models, not just speed up tasks.

Bloomberg Law - Clients Push Big Law Firms to Use GenAI for Cost Savings

Big-client GCs are starting to demand AI usage in RFPs and asking for measurable cost savings. Early sign that the client-side pressure is converting from anecdote to procurement language.

Infolaw - The Risks of Using GenAI for Legal Research

Sobering piece on hallucination rates (69-88% on some legal tasks), professional responsibility, and why even specialised tools need heavy verification. Good risk-side briefing.

Jordan Furlong - The Divergence of Law Firms from Lawyers

Furlong on the widening gap between the interests of law firms and the interests of individual lawyers as AI reshapes the economics of practice. Strong read on where the two start to pull apart.

LegalBenchmarks - Benchmarking Humans and AI in Contract Drafting

Phase 2 study pitting 13 AI platforms against in-house lawyers on contract drafting. AI models now match or beat the human baseline on several reliability metrics.

LawNext - AI Tools Match or Exceed Human Lawyers in Contract Drafting

LawNext's write-up of the LegalBenchmarks study. Gemini 2.5 Pro hit 73.3% reliability vs 56.7% for human lawyers, and specialised legal AI flagged compliance risks lawyers missed entirely.

Databricks - Building Enterprise Agents 90x Cheaper

Databricks shows open-source gpt-oss-120b plus automated prompt optimisation (GEPA) beating Claude Sonnet 4 and Opus 4.1 at ~90x lower serving cost. Meaningful signal on the economics of open-model enterprise agents.

Propel - How AI Models Are Getting Dramatically Better at Complex Policy Questions

Propel tested 45 models on a complex SNAP benefits eligibility question. Latest frontier models now give accurate state-specific answers where older models confidently misled users.

Wordsmith - Is Bureaucracy Killing UK Growth?

Wordsmith's argument that regulatory and administrative load is now a primary drag on UK growth, and legal AI is part of the answer. Political piece but a useful read for UK-focused conversations.

SuperComparer - Breaking the Innovation Drought in Document Comparison

Argues the comparison market is stuck between reliable-but-limited mechanistic tools and scalable-but-hallucinating GenAI. SuperComparer pitches many-to-many mechanistic comparison as the way out.

BigHand - 2025 Legal Pricing and Budgeting Report

Annual pricing report. 50% of firms see rising transparency demands from clients, only 34% have updated pricing to reflect AI efficiencies, and structured matter budgeting delivers ~9% higher realisation.

FT - Innovative Lawyers Europe 2025

FT's annual ranking and case studies. Useful view of which firms and projects the FT is crediting this year.

HBR - AI-Generated "Workslop" Is Destroying Productivity

HBR study finding 95% of organisations see no measurable ROI on AI despite heavy adoption. "Workslop" - AI output that looks useful but isn't - is the hidden productivity tax.

Anthropic - Introducing Claude Sonnet 4.5

Sonnet 4.5 pitched as the best coding model in the world. Significant gains on agentic tasks and computer use, same $3/$15 per million tokens as Sonnet 4.

GDPeval - Evaluating AI on Real-World Tasks (PDF)

[Internal AG resource] OpenAI's GDPval benchmark covering 44 occupations across 9 sectors with realistic deliverables. Close to the R&D benchmarking approach and worth learning from.

How People Use ChatGPT (PDF)

[Internal AG resource] Research on real-world ChatGPT usage patterns - task types, length, sector breakdown. Useful for calibrating internal training and adoption programmes.

ILTA 2025 Technology Survey - Executive Summary (PDF)

[Internal AG resource] ILTA's annual tech survey exec summary. Reliable cross-firm reference for technology spend, adoption and priorities.

News You Can Use

Three stories worth sitting with

FT Innovative Lawyers: Europe 2025 -- report and winners

Benchmarking Humans & AI in Contract Drafting -- Preliminary Findings

GDPval - Evaluating AI on real-world tasks

Everything else worth a click

Legal IT Insider - Deloitte and Legora Interview

Bloomberg Law - Clients Push Big Law Firms to Use GenAI for Cost Savings

Infolaw - The Risks of Using GenAI for Legal Research

Jordan Furlong - The Divergence of Law Firms from Lawyers

LegalBenchmarks - Benchmarking Humans and AI in Contract Drafting

LawNext - AI Tools Match or Exceed Human Lawyers in Contract Drafting

Databricks - Building Enterprise Agents 90x Cheaper

Propel - How AI Models Are Getting Dramatically Better at Complex Policy Questions

Wordsmith - Is Bureaucracy Killing UK Growth?

SuperComparer - Breaking the Innovation Drought in Document Comparison

BigHand - 2025 Legal Pricing and Budgeting Report

FT - Innovative Lawyers Europe 2025

HBR - AI-Generated "Workslop" Is Destroying Productivity

Anthropic - Introducing Claude Sonnet 4.5

GDPeval - Evaluating AI on Real-World Tasks (PDF)

How People Use ChatGPT (PDF)

ILTA 2025 Technology Survey - Executive Summary (PDF)

LawNext - Filevine Raises $400M Across Two Rounds

Artificial Lawyer - SimpleDocs Acquires Law Insider

Artificial Lawyer - Covenant 2.0 Launches Legal Data Intelligence

Artificial Lawyer - Westlaw Deep Research and Litigation Doc Analyzer

Stanford Law Unveils Liftlab