The content on this page was provided by an independent third party and syndicated by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AI Built for Law Outperforms ChatGPT, Claude, and Gemini on Legal Reasoning Benchmark

DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and scored lower on legal reasoning quality.

We had a thesis that purpose-built legal AI produces meaningfully different results. Legal professionals deserve evidence. So we tested ourselves and published our methodology for anyone to replicate.”
— Kara Peterson, Co-Founder and CEO of Descrybe

BOSTON, MA, UNITED STATES, March 5, 2026 /EINPresswire.com/ — When AI gets a legal question wrong, the most dangerous failure isn’t an obvious error. It’s an answer that sounds authoritative: fluent, confident, well-structured, and yet applying the wrong legal standard. The error reads like competent lawyering.

Today, Descrybe launched DescrybeLM — an AI system built specifically for legal reasoning — and published a white paper with benchmark data to show what that difference looks like in practice.

Descrybe ran a controlled benchmark against ChatGPT 5.2, Claude Opus 4.5, and Gemini 3 Pro on 200 multistate bar exam questions. The study measured not just whether each system chose the correct answer, but whether the legal reasoning behind it was sound: Did it identify the right rule? Apply it correctly to the facts? Avoid the traps that produce persuasive but wrong analysis?

“We had a thesis that purpose-built legal AI produces meaningfully different results for legal reasoning tasks. Legal professionals deserve to make tool decisions based on real evidence. So we tested ourselves, published our methodology, and invite anyone to replicate it,” said Kara Peterson, Co-Founder and CEO of Descrybe.

What the benchmark showed

All four systems were tested under standardized, no-external-web conditions using the NCBE MBE Complete Practice Exam (Questions 1–200, no exclusions), producing 800 separate evaluation runs with blinded scoring.

When general-purpose models were wrong, they were confidently wrong. Among 52 incorrect outputs, 49 delivered assertive, well-structured reasoning that did not signal uncertainty — the failure mode that imposes the highest verification burden on practitioners. The dominant patterns were applying the wrong legal standard or misapplying the correct one, while the prose read like competent analysis.

Two models — Claude Opus 4.5 and Gemini 3 Pro — exhibited overconfident tone on correct outputs as well as incorrect ones. DescrybeLM and ChatGPT 5.2 received zero overconfidence flags across all 200 outputs. A system that sounds equally confident whether it is right or wrong gives practitioners no reliable signal from tone alone.

The study also found that cross-checking between general-purpose models is not a reliable substitute for getting the answer right. Across 200 questions, 40 were missed by at least one model, 11 by two or more, and only 1 by all three — meaning errors were largely unpredictable and non-overlapping.

What’s behind the results

DescrybeLM is built on a curated primary-law corpus of more than 100 million structured records, requiring more than 100 billion tokens of preparation.
“Most AI tools are built for general use and adapted for law. DescrybeLM was built differently: from the foundation up, specifically for legal reasoning, on more than 100 million structured records individually cleaned and organized for that purpose. That kind of data work is painstaking and takes years — but it’s the difference between a system that sounds right and one that is right,” said Richard DiBona, Co-Founder and CTO of Descrybe.

Why this matters

The headline problem in legal AI isn’t systems that obviously fail. It’s systems that fail invisibly, confidently, and in a way that reads like competent analysis. In a crowded market, sounding right is easy to mistake for being right. Legal professionals need real evidence to decide which tools to use for which purposes — which is why Descrybe published its methodology and invites independent replication.

“It’s rare to see something that genuinely stops you in your tracks. When I saw DescrybeLM answer all 200 multistate bar exam questions correctly while ChatGPT, Claude, and Gemini each missed double digits — that’s not a marginal difference. That’s a different category of tool,” said Ken Friedman, legal technology pioneer and advisor to Descrybe.

The full white paper, Beyond Confidently Wrong: How Purpose-Built AI Mitigates Legal Reasoning’s Hidden Risk, is available now.

Kara Peterson
Descrybe
+1 617-752-2020
email us here
Visit us on social media:
LinkedIn
YouTube

Descrybe demo

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Houzeo Launches a Neighborhood Guide to Help Buyers and Renters Discover Los Angeles’ Encino Neighborhood

Houzeo Launches a Neighborhood Guide to Help Buyers and Renters Discover Los Angeles’ Encino Neighborhood

This updated neighborhood guide explores Encino thoroughly, helping buyers analyze safety, demographics, and market

March 6, 2026

Houzeo Launches a Comprehensive North Hollywood Guide for Buyers Curious About Los Angeles’ Iconic Neighborhood

Houzeo Launches a Comprehensive North Hollywood Guide for Buyers Curious About Los Angeles’ Iconic Neighborhood

The refreshed North Hollywood guide shares detailed insights, supporting buyers in assessing safety, demographics, and

March 6, 2026

The Colors of Africa Art Exhibit Brings the Power, Beauty, and History of African Civilizations to U.S. Cities

The Colors of Africa Art Exhibit Brings the Power, Beauty, and History of African Civilizations to U.S. Cities

A traveling art exhibit celebrating African civilizations, Egyptian mythology, and historic landscapes arrives in

March 6, 2026

Rubenstein Public Relations Honored as Best Public Relations & Communications Business of 2026 – United States

Rubenstein Public Relations Honored as Best Public Relations & Communications Business of 2026 – United States

This recognition is a true honor and a testament to our team’s dedication to telling powerful stories that move

March 6, 2026

Occams Advisory Launches IEEPA Tariff Refund & Financing Plan to Help US Importers Recoup Up to $175B in Unlawful Duties

Occams Advisory Launches IEEPA Tariff Refund & Financing Plan to Help US Importers Recoup Up to $175B in Unlawful Duties

Occams Advisory launches a 4-pillar platform to help importers recover $130B+ in illegal IEEPA tariffs following a 2026

March 6, 2026

Houzeo Releases a Mira Mesa Neighborhood Guide to Help Home Shoppers Explore the Area in San Diego

Houzeo Releases a Mira Mesa Neighborhood Guide to Help Home Shoppers Explore the Area in San Diego

The latest neighborhood guide delivers insights into Mira Mesa, helping buyers assess safety, demographics, and housing

March 6, 2026

GLORION MEDIA Launches as a Data Driven Podcast and Influencer Media Company in New York

GLORION MEDIA Launches as a Data Driven Podcast and Influencer Media Company in New York

GLORION MEDIA today announced its official launch as a New York City based podcast advertising and influencer media

March 6, 2026

BNP Paribas Continues Support of Girls on the Run International Through Points for Change Initiative

BNP Paribas Continues Support of Girls on the Run International Through Points for Change Initiative

CHARLOTTE, NC, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Girls on the Run International, a nonprofit dedicated

March 6, 2026

SquarePet to Showcase Veterinarian-Formulated Nutrition at the 2026 Global Pet Expo Booth #4838

SquarePet to Showcase Veterinarian-Formulated Nutrition at the 2026 Global Pet Expo Booth #4838

Global Pet Expo provides a great venue for us to connect with valued industry partners, meet with existing and new

March 6, 2026

Naked Cashmere Announces Gian Matteo Mellerio in the Role of Chief Revenue Officer

Naked Cashmere Announces Gian Matteo Mellerio in the Role of Chief Revenue Officer

The brand, rooted in quiet luxury and uncompromising quality, names commercial and digital leader to drive e‑commerce,

March 6, 2026

Great Coffee Starts with Better Water: Bluewater Shows How at New York Coffee Fest 2026

Great Coffee Starts with Better Water: Bluewater Shows How at New York Coffee Fest 2026

Bluewater SuperiorOsmosis clears the slate, while LiquidRock rebuilds the minerals. Taste the difference March 8–10 at

March 6, 2026

Allied Universal Recognized For Third Consecutive Year Among America’s Greatest Workplaces for Women by Newsweek

Allied Universal Recognized For Third Consecutive Year Among America’s Greatest Workplaces for Women by Newsweek

IRVINE, CA / ACCESS Newswire / March 6, 2026 / Allied Universal®, the world's leading security and facility services

March 6, 2026

COSMarketing Agency Highlights 2026 Data Showing Small Businesses Losing Up to 40% of Marketing Budgets

COSMarketing Agency Highlights 2026 Data Showing Small Businesses Losing Up to 40% of Marketing Budgets

Lead quality remains one of the biggest challenges for small businesses in 2026. ORLANDO, FL, UNITED STATES, March 6,

March 6, 2026

Kitchen Magic Sponsors What’s So Cool About Manufacturing® in the Lehigh Valley

Kitchen Magic Sponsors What’s So Cool About Manufacturing® in the Lehigh Valley

Kitchen Magic joins in supporting the annual What’s So Cool About Manufacturing® contest, helping Lehigh Valley

March 6, 2026

Chef Russell Jackson Announces Spring 2026 Public Programming and Culinary Venture Updates in New York

Chef Russell Jackson Announces Spring 2026 Public Programming and Culinary Venture Updates in New York

New York chef expands public-facing work across experiential dining and hospitality ventures as he appears in a

March 6, 2026

Hexaview Technologies Launches Agentic RIA Framework to Provide Model-Agnostic AI Orchestration for Wealth Managers

Hexaview Technologies Launches Agentic RIA Framework to Provide Model-Agnostic AI Orchestration for Wealth Managers

Hexaview's Agentic RIA framework lets wealth managers swap intelligence engines without touching their infrastructure

March 6, 2026

What Cars Say About Success in the 2020s

What Cars Say About Success in the 2020s

From premium sedans to electric flagships — how personal values, technology, and culture shape the meaning of

March 6, 2026

SnapFig Launches Custom 3D Photo Keychains Ahead of Mother’s Day 2026

SnapFig Launches Custom 3D Photo Keychains Ahead of Mother’s Day 2026

The 3D modeling brand introduces a new line of personalized keychains, allowing customers to turn digital family photos

March 6, 2026

MediDrive and Bambi Announce Powerful New API Integration via Clearview Software for Transportation Providers

MediDrive and Bambi Announce Powerful New API Integration via Clearview Software for Transportation Providers

AI scheduling, automated dispatch, GPS trip confirmation, and streamlined billing offer NEMT providers a faster,

March 6, 2026

Houzeo Publishes a Hyde Park Neighborhood Guide to Simplify Home Search in Chicago

Houzeo Publishes a Hyde Park Neighborhood Guide to Simplify Home Search in Chicago

The latest Hyde Park guide offers well-rounded insights, enabling buyers to evaluate safety, demographic trends, and

March 6, 2026

Interchange™ AI Gateway Hardware Announced by Integral Business Intelligence

Interchange™ AI Gateway Hardware Announced by Integral Business Intelligence

Purpose-built AI gateway hardware and software gives businesses centralized, in-house control over AI activity, data,

March 6, 2026

New Music Alert I’m Not Gay from multi-talented electronic artist Rocco Lino DJ

New Music Alert I’m Not Gay from multi-talented electronic artist Rocco Lino DJ

Track Title: I'm Not Gay Genre: House Launch Date: Out Now! ISRC Code: ITVML2500021 CARDIFF, UNITED KINGDOM, March 6,

March 6, 2026

Inovitech Launches IS-A-TASK 12.0 with Direct Microsoft 365 Integration, Advancing Enterprise eDiscovery Oversight

Inovitech Launches IS-A-TASK 12.0 with Direct Microsoft 365 Integration, Advancing Enterprise eDiscovery Oversight

WASHINGTON, DC, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Inovitech LLC, a Washington, D.C.-based legal

March 6, 2026

Houzeo Publishes a Neighborhood Guide for Flushing, NY, to Simplify Home Search in the Empire State

Houzeo Publishes a Neighborhood Guide for Flushing, NY, to Simplify Home Search in the Empire State

This new neighborhood guide offers insights into livability, affordability, and daily economic factors for New York

March 6, 2026

Avalue Launches EMX-PTLP Thin Mini-ITX Motherboard, Designed for Advanced AI and Industrial Automation

Avalue Launches EMX-PTLP Thin Mini-ITX Motherboard, Designed for Advanced AI and Industrial Automation

TAIPEI, TAIWAN, March 6, 2026 /EINPresswire.com/ — Avalue Technology Inc. (TPEx: 3479.TWO), a provider specializing in

March 6, 2026

CMA of Texas Award Winner Jamie Richards Releases New Single ‘Somewhere In The Middle’

CMA of Texas Award Winner Jamie Richards Releases New Single ‘Somewhere In The Middle’

Winner of Best Traditional Country Song at the CMA of Texas Music Awards and Best New Duo with Kaitlyn Kohler at the

March 6, 2026

Joanne Shaw Taylor Teams Up with Orianthi on Powerful New Single ‘What Good Is My Love?’

Joanne Shaw Taylor Teams Up with Orianthi on Powerful New Single ‘What Good Is My Love?’

A fiery blues-rock duet confronting heartbreak, doubt, and the question of love that isn’t returned NASHVILLE, TN,

March 6, 2026

Solestra Group Launches New Website to Showcase Aerospace & Defense Manufacturing Platform

Solestra Group Launches New Website to Showcase Aerospace & Defense Manufacturing Platform

WILLOW GROVE, PA, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Solestra Group is pleased to announce the launch

March 6, 2026

Private Club Ownership Meets Refined Mountain Living at Hoback Club in Teton Village

Private Club Ownership Meets Refined Mountain Living at Hoback Club in Teton Village

Luxury in the mountains doesn’t need to feel heavy. This residence was designed to be lived in, not preserved. After a

March 6, 2026

City Detect Raises $13M to Help American Cities Fix Blight Proactively

City Detect Raises $13M to Help American Cities Fix Blight Proactively

City Detect raises $13 million Series A led by Prudence to expand into new American cities TUSCALOOSA, AL, UNITED

March 6, 2026

Sharla Riead to Present at 2026 RESNET Conference for Energy Professionals and Code Officials

Sharla Riead to Present at 2026 RESNET Conference for Energy Professionals and Code Officials

Sharla Riead to Present at 2026 RESNET Conference for Energy Professionals and Code Officials Sharla Riead is committed

March 6, 2026

Green Globe Certification Awarded to Pullman Pattaya Hotel G in Thailand

Green Globe Certification Awarded to Pullman Pattaya Hotel G in Thailand

Green Globe Certification has awarded Pullman Pattaya Hotel G its inaugural certification. Green Globe certification is

March 6, 2026

NEOX Expands North American Go-To-Market Strategy with Enterprise Sales and Solution Partnerships

NEOX Expands North American Go-To-Market Strategy with Enterprise Sales and Solution Partnerships

Appoints Bill Cantrell as New Head of Sales and Business Development SANTA CLARA, CA, UNITED STATES, March 6, 2026

March 6, 2026

Enroll Prime Alternative Health Contracting 2026 Expansion Through FMO BenaVest

Enroll Prime Alternative Health Contracting 2026 Expansion Through FMO BenaVest

BenaVest expands Enroll Prime alternative health contracting for 2026, helping agents offer flexible coverage solutions

March 6, 2026

Robert ‘Roby’ Polacek of the RoseBernard Studio Recently Featured on Close Up Radio

Robert ‘Roby’ Polacek of the RoseBernard Studio Recently Featured on Close Up Radio

SAN FRANCISCO, CA, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Hospitality is evolving. Not cosmetically —

March 6, 2026

Organized Cargo Theft Rings Target Detached Trailers; New Security Technology to Be Demonstrated at MATS 2026

Organized Cargo Theft Rings Target Detached Trailers; New Security Technology to Be Demonstrated at MATS 2026

Organized cargo theft rings increasingly target detached trailers; new landing gear security technology will be

March 6, 2026

NutriHarvest®, USDA BioPreferred® Program Champion, Highlights Yield Gains and Water Protection for Regenerative Growing

NutriHarvest®, USDA BioPreferred® Program Champion, Highlights Yield Gains and Water Protection for Regenerative Growing

Field evaluations demonstrate crop performance, soil health, and water quality results enabled by an advanced resource

March 6, 2026

Pre-Orders Open March 5 for ‘Godzilla Evolved’ Statue from Godzilla x Kong: The New Empire.

Pre-Orders Open March 5 for ‘Godzilla Evolved’ Statue from Godzilla x Kong: The New Empire.

Prime 1 Studio announced "Godzilla Evolved" Statue from Godzilla x Kong: The New Empire. Pre-orders began March 5, 2026

March 6, 2026

Superior K9 Training Academy Earns 2025 Best of Georgia Regional Award

Superior K9 Training Academy Earns 2025 Best of Georgia Regional Award

CHATTAHOOCHEE HILLS, GA, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Superior K9 Training Academy has been named

March 6, 2026

Building Resilient Solar Projects in 2026: Tech Innovation and Financial Strength are the Barometers in Decision Making

Building Resilient Solar Projects in 2026: Tech Innovation and Financial Strength are the Barometers in Decision Making

In today’s U.S. solar market, long-term bankability, operational success and proven, innovative systems are defining

March 6, 2026