Edition 3 · 1st - 13th Oct 2024

News You Can Use

Deep Dives

Three stories worth your time

OpenAI Dev Day

What

OpenAI ran a day for developers and shared what they have been working on. This includes real time voice APIs, prompt caching (which means that elements of prompts used before can be used again), model distillation, to allow fine tuning of smaller models.

So what

Better voice, prompt caching and small model fine tuning all adds up to give us better tools for building out complex agents. Prompt caching and the ability to distill models means you could set up specific agents using this framework and fine tune them to a task - like negotiating a document as per a set of rules with fine-tuned knowledge beyond that which you include in the prompt - you could then dial your agent into a Teams call and run it in real time!

Google NotebookLM

What

NotebookLM is a tool from Google that allows you to upload documents to a notebook style LLM powered window and generate AI notes, briefings or text. You can now also create podcasts!

So what

This is a good way of storing notes together and getting decent summaries, the RAG Report briefing and podcast came from NotebookLM. The podcast feature could be a good way of condensing a lot of information into a relatively easy to digest 10 minute clip. It shows progress being made and could be used to create training snippets from detailed training information, or provide real time voice Q&A as it develops. It is similar to OpenAI 'canvas' which lets you work alongside a document, mainly for writing code.

Everlaw Survey

What

A report from the Association of Corporate Counsel and Everlaw based on a survey of 475 in-house lawyers in the US. They showed that there is an appetite for GenAI, ongoing adoption and in-house seem to be leading the charge, however that most of them expect legal services to get cheaper. Whilst there was a lot of excitement about the potential on careers and work, a large group thought that their legal team is unprepared to actually address changes from GenAI.

So what

There are some good findings within this, even though it's US only a lot of the messages we hear are the same. It gives us some validation on pushing out our workshop offerings as there is a mix of excitement with a lack of confidence and understanding which some more focused training would help with. Some of the stories and feedback indicates that in-house professionals are really keen to skill up on AI, with a belief that this will support them in their role and help to bring more tasks in-house rather than outsourcing. This seems contrary to the private practice culture around GenAI at the moment.

Apple Research into LLMs

What

Research undertaken by Apple shows that LLMs do not perform as well as expected on reasoning problems. This may explain why Apple have not been as gung-ho around GenAI, but it also focuses mainly on logic and reasoning in the pure sense rather than applying to specific tasks. This research was done with open source models and some smaller models, so may not be representative of the more common / larger models.

So what

An interesting perspective for the market, one of the big tech companies slightly putting the brakes on, whilst also going all in on things like Apple Intelligence in their consumer devices. However, they saw some similar results to us, around complexity increases in input leading to decreases in quality of output, and maths not being a strong point. But where it comes to hiding information / incorrect answers within documents, this was something testing in the Deal Clarity research and GPT-4 performed well.

Worth Reading

Everything else worth a click

Our RAG Report

[Internal AG resource] Addleshaw Goddard's RAG Report - findings on retrieval-augmented generation, prompt engineering effort, and why most firms should partner with vendors rather than build in-house.

Legal Tech Trends Newsletter #33

Peter Duffy's scan - adoption data from Lexis, Stephenson Harwood's time savings, Linklaters' GenAI training, and a round of startup funding including Qura and Fynk.

Fuller look at the OpenAI Dev Day

OpenAI Dev Day 2024 hub - Realtime API for voice agents, prompt caching, vision fine-tuning, and Model Distillation, aimed squarely at developers building on the platform.

OpenAI Prompt Generator

OpenAI's built-in Prompt Generator - you describe the task, it produces a structured prompt for you, useful if you want to see how OpenAI itself recommends prompts be written.

Detail on OpenAI Canvas

OpenAI Canvas - a side-by-side editing workspace for writing and coding with ChatGPT, letting you highlight specific passages to revise rather than regenerate whole outputs.

ACC and Everlaw Survey (PDF)

[Internal AG resource] ACC and Everlaw survey on GenAI and the future of corporate legal work - adoption rates, budget expectations, and use case priorities from in-house teams.

Jeff Langlands at BT on their AI journey

Jeff Langlands at BT sets out a four-pillar framework for legal GenAI: productivity (Copilot), self-service, knowledge management, and risk - grounded in real usage data.

NotebookLM

Google's NotebookLM - upload your own documents and NotebookLM grounds its answers in them, including the viral audio-overview podcast feature.

Emotive Prompting Podcast

[Internal AG resource] A NotebookLM-generated audio overview based on our Emotive Prompting research - a working example of the podcast output format.

Apple Report details

Ars Technica's write-up of Apple's GSM-Symbolic paper arguing that LLMs are pattern-matching rather than reasoning - performance collapses when you change the variables in maths problems.