News You Can Use

Edition 5 · 1st - 15th Nov 2024

News You Can Use

Deep Dives

Three stories worth sitting with

OpenAI launch ChatGPT Search Function

What
This update combines conversational capabilities with real-time search functions. Users can query live web content and cross-check answers directly, making the tool more reliable for dynamic, up-to-date tasks like research and analysis.
So what
The possibility of integrating similar search capabilities within AGPT could elevate its potential as a real-time legal assistant, enabling on-demand access to up-to-date laws, regulations, and case law. It would also enhance stakeholder-facing interactions by allowing instant responses backed by verified data.

Prompt Improvement Template

What
The template is a structured framework to craft better prompts for AI in educational settings. It focuses on maximising AI's ability to explain complex concepts, suggest ideas, and even provide constructive feedback on student work.
So what
This is relevant to AG's internal training initiatives. Incorporating such structured AI prompts could enhance how we train our lawyers for prompt engineering, train on new legal standards, and provide feedback on drafted contracts or reports.

Harvard Study into use of GenAI by programmers

What
The findings reveal that access to Copilot leads developers to shift their focus towards core coding activities, reducing time spent on non-core project management tasks. This shift is attributed to an increase in autonomous work and a higher engagement in exploratory activities, with the most significant benefits observed among developers of relatively lower ability. The paper suggests that generative AI has the potential to flatten organizational hierarchies by enabling workers to concentrate on creative and technical tasks, thereby enhancing productivity and innovation in knowledge-intensive industries.
So what
The findings reaffirm the importance of having a tech stack to work alongside. Software Development is a similar 'technical expertise' based role, and the report here shows that freeing up time leads to better engagement in activities which we would class as higher value. Removing the admin or low value work from lawyers day to day should have similar effects, ensuring that teams can focus on strategic, high-value legal tasks while GenAI handles the basic work. Interestingly the evidence that lower ability developers improved more could indicate that there is more value in upskilling our trainees and juniors.

The Scaling Debate for LLMs

What
These articles argue that LLMs may have hit a plateau in their effectiveness and scalability due to issues like resource consumption, accuracy, and data limits. They call for alternative methods and frameworks to push AI innovation beyond current boundaries. Some of these include modular AI systems that focus on specialised tasks rather than general capabilities, integration of symbolic AI to enhance reasoning and interpretability, and low-resource models designed to reduce costs and energy demands.
So what
This presents a cautionary note as we develop AG's AI tools. It highlights the importance of not relying solely on large, generalist LLMs but also exploring hybrid approaches like combining modular AI for focused legal tasks or incorporating symbolic AI to improve reasoning in niche applications.

DLA Piper and Copilot

What
DLA Piper is leveraging Microsoft 365 Copilot to automate routine document tasks, enhance collaboration, and reduce time spent on administrative work.
So what
This example underscores the value of integrating productivity-focused AI like Copilot at AG, ensuring we stay competitive while maintaining efficiency in handling legal matters. This is worth being aware of in light of the continued push by Clifford Chance recently, some firms are going "all in" with Microsoft Copilot.

Worth Reading

Everything else worth a click

OpenAI Search

OpenAI's launch of ChatGPT Search, bringing real-time web results with citations into the chat interface and putting Google directly in its sights.

Harvard Prompting Template

Ethan Mollick and Lilach Mollick's structured prompting template for teaching tasks - five-part framework (role, goal, step-by-step, personalisation, constraints) that transfers beyond education.

Anthropic Prompt Improver

Anthropic's built-in Prompt Improver in the Console - adds chain-of-thought, standardises examples, and claims a 30% accuracy uplift on a multilabel classification task in their testing.

Harvard Study - GenAI and the Nature of Work (PDF)

[Internal AG resource] Harvard Business School study on how GenAI changes the nature of knowledge work - task composition, skill premiums, and where productivity lands.

Scaling LLMs - Gary Marcus

Gary Marcus argues scaling has hit diminishing returns, pointing to Andreessen's GPU comments and The Information's reporting on GPT slowdown - the bear case in concentrated form.

So AI won't scale, what next?

James Ravenscroft's thoughtful response to the scaling-wall debate - a pragmatist's view on what practitioners should do if raw scale stops delivering and we need smarter architectures.

DLA Piper Copilot case study

Microsoft's DLA Piper Copilot case study - "coalition of the willing" rollout, up to 36 hours a week saved on content generation, with a heavy emphasis on data governance and Purview.

The Enigma of Big Law Innovation

Mark Cohen on why Big Law struggles to innovate despite its resources - partnership incentives, hourly billing, and short-termism keep firms locked into legacy models.

The psychology of viral technologies

Every's case study on why ChatGPT went viral when the underlying GPT tech had existed for years - interface and psychological accessibility beat raw capability.

Engineer Legal Newsletter

Engineer Legal's newsletter covering their HighQ plugin updates - relevant if you run HighQ and want to push it beyond out-of-the-box capability with task dashboards and reporting.

Law Professor Gives Lexis AI a Failing Grade

Professor Benjamin Perrin's scathing review of Lexis+ AI - fabricated citations, copy-pasted headnotes passed off as summaries, and criminal-vs-tort law confusion, with a recommendation not to roll it out to students.

Macfarlanes on using GenAI in DD

Macfarlanes on using GenAI for DD - summary tables across hundreds of documents in an hour versus days, with a clear insistence that expert oversight is non-negotiable.

So Much Hate for Harvey

A thoughtful essay on why Harvey generates so much schadenfreude despite comparable valuations to Casetext - brand positioning, founder optics, and the legal tech tall-poppy instinct.

Ironclad launches Jurist

Ironclad's launch of Jurist, a conversational AI legal assistant built on their open-source Rivet platform, operating inside Word with transparent multi-agent reasoning and source citations.

New Harvey Benchmark - retrieval focus

Harvey's retrieval-focused benchmark claims their system finds up to 30% more relevant content than standard embedding-based retrieval, using metadata, feature engineering, and LLM-based relevancy judgements.