Thinking the Unthinkable:
AI in the Service of Justice
By Yonathan Arbel
Professor of Law
Director, Artificial Intelligence Initiative
A case involving a simple trampoline installation in the quiet town of Fairhope, Alabama, may unexpectedly become the catalyst for a profound shift in legal interpretation. In 2020, James Snell, owner of Outdoor Expressions, installed an in-ground trampoline for a local family. When a young girl later injured herself on the unprotected trampoline, Snell found himself embroiled in a lawsuit. His insurance company refused coverage, claiming that trampoline installation wasn’t part of the “landscaping” his policy covered.
Though seemingly straightforward, this dispute has brought to light a fundamental challenge in legal practice — the interpretation of language — potentially marking the dawn of a new era where artificial intelligence (AI) assists in the pursuit of justice and revolutionizes how we understand and apply the law.
The Challenge of Legal Interpretation
For centuries, courts have relied on a limited toolkit to interpret the meaning of words in legal contexts. This toolkit primarily consists of dictionaries, precedents, and Latin maxims like “expressio unius” (the expression of one thing implies the exclusion of others) and “contra proferentem” (ambiguous terms should be interpreted against the drafter). Judges, as native speakers, often rely on their intuition about language. However, these traditional methods have clear limitations.
Dictionaries, while useful, can be outdated and miss nuanced meanings. They cannot account for context, which is crucial in understanding language use. Precedents, while valuable for consistency, often lag behind societal changes, potentially leading to interpretations that are out of touch with current language use. Judges, despite their expertise, are frequently criticized for being disconnected from modern or vernacular language.
These limitations can have significant consequences. Misinterpretation of contract terms, statutes, or other legal documents can lead to unjust outcomes, protracted litigation, and inconsistent application of the law. The need for a more robust, context-aware method of interpretation has long been apparent, but, until recently, no viable solution has emerged.
Enter Generative Interpretation
The trampoline case was the first time that a judge used the novel approach of “generative interpretation” as a form of legal interpretation.
The essence of this approach is elegantly simple yet profound. Large language models (LLMs), trained on vast amounts of internet text data, can provide insights into how language is actually used and understood in real-world contexts. These models are exposed to more textual data during their training than any single human could read in a hundred lifetimes. Through this process, they develop a statistical understanding of language so comprehensive that some models can even pass the bar exam.
This vast reservoir of language knowledge can be tapped to generate a more comprehensive and nuanced understanding of ordinary meaning, potentially revolutionizing how we interpret legal texts. Unlike traditional methods, LLMs can provide context-aware interpretations that reflect current language use across various domains and communities.
This approach is based on my work with Professor David Hoffman of the University of Pennsylvania Law School. Soon to be published in the New York University Law Review, it develops the principles for harnessing the power of LLMs like ChatGPT to inform legal interpretation.
Judicial Recognition and Future Prospects
Back to the trampoline. In a groundbreaking concurring opinion, Judge Kevin Newsom of the U.S. Court of Appeals for the Eleventh Circuit boldly proposed what he calls a “heresy” — that AI-powered language models might inform legal interpretation of ordinary meaning. He relied extensively on our work in informing his analysis of the case.
Judge Newsom posed the pivotal question to various models like ChatGPT: “Is installing an in-ground trampoline ‘landscaping’?” The answer, by all models, was “yes.” They interpreted that in common usage with context provided by the judge, installing an in-ground trampoline would indeed be considered part of a landscaping job.
The affirmative response, while not determinative in this case due to other factors, led Judge Newsom to a profound realization: “At the very least, it no longer strikes me as ridiculous to think that an LLM like ChatGPT might have something useful to say about the common, everyday meaning of the words and phrases used in legal texts.”
This judicial recognition marks a significant milestone in the acceptance of AI-assisted legal interpretation. It suggests that the legal community is beginning to seriously grapple with the potential of these technologies to enhance, and perhaps, transform traditional legal reasoning.
Practical Applications and Best Practices
In our work, Hoffman and I sought to go beyond theory and offer practical guidance for implementing generative interpretation in legal practice. We developed best practices for using LLMs, addressing potential pitfalls like prompt bias and artificial intelligence hallucinations. We also demonstrated the method’s potential through case studies, showing how LLMs could have aided courts in resolving complex interpretative puzzles.
For instance, in cases involving the interpretation of insurance policies after Hurricane Katrina, LLMs could have provided insights into the common understanding of terms like “flood” and “storm surge.” In family court disputes over prenuptial agreements, they could offer nuanced interpretations of potentially ambiguous terms. In commercial agreements, LLMs could help clarify industry-specific jargon that might be misunderstood by generalist judges.
Importantly, generative interpretation doesn’t seek to replace traditional methods but to complement them. Where dictionaries offer static definitions and precedents may lag behind societal changes, LLMs can provide real-time insights into language usage across diverse contexts. For instance, in a case involving technical jargon, an LLM could offer nuanced context-aware interpretations based on usage in specific industries, outperforming general-purpose dictionaries.
The implementation of this approach, however, requires careful consideration. We suggest four best practices:
- Use multiple LLMs to cross-verify results, mitigating the risk of relying on a single model’s potential biases or errors.
- Clearly document prompts and responses for transparency, allowing for scrutiny and replication of the interpretation process.
- Always view LLM outputs as advisory rather than determinative, preserving the crucial role of human judgment in legal decision-making.
- Emphasize the importance of human oversight to catch potential biases or errors in AI responses.
Impact on the Legal System
This paradigm shift could significantly impact various stakeholders in the legal system. Lawyers might need to develop new skills in prompt engineering and AI interpretation, potentially changing how they prepare and argue cases. Judges could have access to more comprehensive language data, potentially leading to more informed and consistent decisions. Clients might also benefit from more predictable and consistent legal interpretations, potentially reducing litigation costs and uncertainty.
However, the adoption of generative interpretation also raises important questions about the evolving role of human judgment in law and the potential for over-reliance on AI tools. These questions will likely occupy legal philosophers and ethicists for years to come. How do we balance the benefits of AI-assisted interpretation with the need for human discretion and empathy in the legal process? How can we ensure that the use of LLMs doesn’t inadvertently perpetuate or amplify existing biases in the legal system?
Broader Implications and Future Research
This work on generative interpretation is part of a broader initiative, which I am leading as the director of AI studies at The University of Alabama School of Law. This new initiative seeks to position the Law School at the frontier of legal pedagogy in the nation. As part of this initiative, the school offers an innovative “AI for Lawyers” course, where law students build practical AI tools to assist in legal tasks.
My broader research agenda spans various aspects of AI in law, from managing systemic risks from AI, to integrating AI into the judiciary. I explore offering consumer AI tools to deal with complex legal documents and building new AI tools to simulate how juries and ordinary people determine who is the “reasonable person.” These efforts represent a comprehensive approach to understanding and shaping the role of AI in the legal system and mark the Law School's commitment to innovation in pedagogy and research.
As we stand on the brink of this new era, the potential of AI in legal interpretation is both exciting and challenging. It promises more accurate, consistent, and accessible legal interpretations, potentially reducing litigation and improving access to justice. However, it also raises important questions about the role of human judgment in law and the ethical implications of AI in legal decision-making.
Conclusion
In the end, the story of a trampoline in Fairhope, Alabama, may be remembered not for the legal dispute it sparked, but for the transformative ideas it helped to propel into the mainstream of legal thought. As we move forward, frameworks developed by scholars, and forward-thinking jurists like Judge Newsom, will undoubtedly play a crucial role in shaping the future of legal interpretation in the age of artificial intelligence.
The integration of AI into legal interpretation represents a significant shift in how we approach the law. It offers the potential to make legal interpretation more accurate, consistent, and responsive to real-world language use. At the same time, it challenges us to rethink fundamental aspects of legal reasoning and decision-making.
As this new approach gains traction, it will be crucial to monitor its impacts carefully, refine the methodologies, and ensure that the use of AI in legal interpretation serves the ultimate goal of a fair and just legal system. The symbiotic relationship, with scholarly and judicial work developing in tandem, gives us reason to be optimistic about the future of AI in law.
In the coming years, we can expect to see further developments in this field, with more courts potentially adopting AI-assisted interpretation methods, law schools incorporating AI into their curricula, and legal technology companies developing specialized tools for generative interpretation. The legal profession, known for its traditionalism, stands at the cusp of a technological revolution that could fundamentally change how we understand and apply the law.
As we navigate this new frontier, it will be essential to maintain a balance between embracing the potential of AI and preserving the human elements that are central to the practice of law. The story of generative interpretation is not just about technology; it’s about reimagining how we can make the legal system more accurate, efficient, and ultimately, more just.
