ARTICLE
29 January 2024

Stanford Study Finds High Percentage Of Errors Using Large Language Models In Legal Contexts

FL
Foley & Lardner
Contributor
Foley & Lardner LLP looks beyond the law to focus on the constantly evolving demands facing our clients and their industries. With over 1,100 lawyers in 24 offices across the United States, Mexico, Europe and Asia, Foley approaches client service by first understanding our clients’ priorities, objectives and challenges. We work hard to understand our clients’ issues and forge long-term relationships with them to help achieve successful outcomes and solve their legal issues through practical business advice and cutting-edge legal insight. Our clients view us as trusted business advisors because we understand that great legal service is only valuable if it is relevant, practical and beneficial to their businesses.
The study by Stanford University found that hallucinations, or the tendency of large language models (LLMs) to produce content that deviates from actual facts or well-established legal principles and precedents...
United States Technology
To print this article, all you need is to be registered or login on Mondaq.com.

The study by Stanford University found that hallucinations, or the tendency of large language models (LLMs) to produce content that deviates from actual facts or well-established legal principles and precedents, occurred from 69% to 88% of the time in response to specific legal queries.

The study applied 200,000 queries against each of GPT 3.5, Llama 2, and PaLM 2 models. Although these generative AI programs have supposedly passed bar exams, they failed at some basic tasks performed by junior attorneys. For example, in a task measuring the precedential relationship between two different cases, most LLMs do no better than random guessing. In answering queries about a court's core ruling (or holding), models were found to hallucinate at least 75% of the time.

The risks of using LLMs for legal research are especially high for:

  • Litigants in lower courts or in less prominent jurisdictions
  • Individuals seeking detailed or complex legal information
  • Users formulating questions based on incorrect premises
  • Those uncertain about the reliability of LLM responses

The findings of this study are particularly concerning given there are dozens of legal tech startups and law firms saying that they are using AI to provide better, more efficient legal services. However, given such poor performance in these tests, anyone using AI or LLMs should exercise extreme caution. The law appears to require more intelligence than artificial intelligence currently offers.

... hallucination rates range from 69% to 88% in response to specific legal queries for state-of-the-art language models. Moreover, these models often lack self-awareness about their errors and tend to reinforce incorrect legal assumptions and beliefs. These findings raise significant concerns about the reliability of LLMs in legal contexts, underscoring the importance of careful, supervised integration of these AI technologies into legal practice.

hai.stanford.edu/...

The content of this article is intended to provide a general guide to the subject matter. Specialist advice should be sought about your specific circumstances.

ARTICLE
29 January 2024

Stanford Study Finds High Percentage Of Errors Using Large Language Models In Legal Contexts

United States Technology
Contributor
Foley & Lardner LLP looks beyond the law to focus on the constantly evolving demands facing our clients and their industries. With over 1,100 lawyers in 24 offices across the United States, Mexico, Europe and Asia, Foley approaches client service by first understanding our clients’ priorities, objectives and challenges. We work hard to understand our clients’ issues and forge long-term relationships with them to help achieve successful outcomes and solve their legal issues through practical business advice and cutting-edge legal insight. Our clients view us as trusted business advisors because we understand that great legal service is only valuable if it is relevant, practical and beneficial to their businesses.
See More Popular Content From

Mondaq uses cookies on this website. By using our website you agree to our use of cookies as set out in our Privacy Policy.

Learn More