Vals AI Benchmarking Report
What
The Vals Report evaluates the performance of 4 legal AI tools - CoCounsel (Thomson Reuters), Vincent AI (vLex), Harvey Assistant (Harvey), and Oliver (Vecflow) across 7 common legal tasks. These tasks include Data Extraction, Document Q&A, Document Summarisation, Redlining, Transcript Analysis, Chronology Generation, and EDGAR Research. The study benchmarks each tool's performance against a control group of lawyers (the Lawyer Baseline). Key findings: - Harvey Assistant: Participated in six tasks, achieving top scores in five and second place in one. Outperforming the Lawyer Baseline in four tasks. - CoCounsel: Evaluated in four tasks, it consistently ranked among the top performers, with scores ranging from 73.2% to 89.6%. Highest score for Document Summarisation (77.2%). - Lawyer Baseline: Outperformed AI tools in two tasks and matched in one. In four tasks, AI surpassed the Lawyer Baseline. Task-specific insights: - Document Q&A: Harvey Assistant achieved a high score of 94.8%. - Chronology Generation: Both Harvey Assistant and the Lawyer Baseline scored 80.2%. The study concludes that legal AI tools provide significant value in legal work, particularly in tasks involving document analysis, information retrieval, and data extraction. Certain areas such as complex research tasks still require further development to meet law firm expectations.
So what
This study provides a useful benchmark for evaluating our own AI tools such as Deal Clarity. Some key areas relevant to our workflows are document analysis, Q&A and extraction that we are actively trying to deal with. Our clients will be interested in this report, where the focus on refining human-AI collaboration is key in assessing AI strengths and gaps, and may help select suitable vendors.