Chapter 3: Generative AI Models and Tools
Chapter Overview
In Chapter 2, we dove into the inner workings of generative AI to see how neural networks, large language models, and the transformer architecture collaborate to produce human-like text. We discovered how attention mechanisms enable these models to handle context, and why data quality, along with thoughtful human oversight, matters so much. We also explored where generative AI excels, such as piecing together known patterns, and where it falls short, like truly novel scenarios. Ultimately, we learned that understanding the “nuts and bolts” of AI helps us gauge when and how to rely on these tools responsibly in legal practice.
In this chapter, we will focus more concretely on how to apply generative AI in the real world and what’s out there in the world of generative AI tools. You will:
Compare proprietary large language models (LLMs) like ChatGPT and Claude with open-source models.
Explore the distinctions between predictive AI models and reasoning AI models, terms that you will hear often in tech and legal circles.
Discover how AI copilots differ from AI agents, and what that means for you as a future legal professional.
Examine popular AI tools, including ChatGPT 4o, Anthropic Sonnet 3.5, Google Gemini, and Perplexity, so you can recognize their unique benefits and challenges.
By the end of this chapter, you should be able to confidently explain what each of these generative AI tools does, decide when a particular solution is best suited for the task at hand, and identify the practical considerations, like cost, security, and ease of integration, that should influence your choice.
Proprietary vs. Open-Source Large Language Models (LLMs)
Imagine you’re choosing between using a commercial legal research platform that has all the bells and whistles or building a custom research tool from scratch. On one hand, the commercial platform might be easier to adopt and offer ready-made features. On the other hand, building your own system could give you greater flexibility and customization. This same dynamic plays out in the world of large language models.
Proprietary LLMs
Proprietary LLMs (like ChatGPT or Anthropic Claude) are owned and maintained by a single organization. These companies manage every aspect of the model, from how it’s trained to how users connect with it, often with user-friendly interfaces that many people can use without deep technical knowledge.
- Convenience: They usually come with robust interfaces, good customer support, and polished features.
- Accessibility: You can start using them quickly. You don’t need to manage the system on your own servers.
- Ongoing Updates: The company will release improvements, maintain the software, and handle bug fixes.
However, proprietary models do have some limitations:
- Less Transparency: Companies often don’t disclose full details about how the model was trained or what data it was fed.
- Vendor Lock-In: You might be dependent on a single vendor. If they raise prices or change policies, you have limited recourse.
- Limited Customization: Although some proprietary tools are flexible, you cannot access the underlying model weights or extensively modify the model’s behavior.
Open-Source LLMs
Open-source LLMs (like certain versions of LLaMA or other community-supported projects) make their code or model weights publicly available. This transparency allows anyone with sufficient resources and expertise to adapt the model for their needs.
- Flexibility: You can tailor the model to specific tasks or domains, like building a specialized legal research assistant that understands your firm’s style of drafting.
- Community Support: A large community of developers often contributes improvements, extensions, and bug fixes.
- Cost Control: There’s no monthly subscription if you run the model yourself, though you may incur cloud or hardware expenses.
Open-source models do have disadvantages:
- Technical Complexity: Installing, fine-tuning, and maintaining an open-source model may require in-house technical talent.
- Ongoing Maintenance: You (or your team) become responsible for updates, security patches, and bug fixes.
- Hardware Requirements: Depending on the size of the model, you might need powerful computing resources to run it effectively.
Callout: Key Term – LLM (Large Language Model)
A Large Language Model is a type of AI system trained on vast amounts of text. It learns statistical patterns of language and can generate coherent, contextually relevant responses to prompts. Think of it as a supercharged version of your phone’s autocomplete, but capable of much more nuanced understanding and creation of text.
Predictive AI Models vs. Reasoning Models
As AI continues to evolve, you may hear terms like “predictive model” or “reasoning model.” These terms help describe how an AI system processes information and the kinds of tasks it performs well. Let’s break down what each type of model does and introduce three specific examples you may encounter: GPT-4o, o1, and o3.
GPT-4o: A Predictive AI Model
GPT-4o is an example of a predictive AI model. Introduced in May 2024, it can process and generate text, images, audio, and video. That’s what we mean when we say it’s “multimodal.” Because it works with so many media types, GPT-4o can:
- Draft legal documents or motions (text).
- Transcribe depositions (audio).
- Analyze and interpret video footage (video).
- Suggest quick edits or summaries for code, legal content, or marketing materials.
Many lawyers appreciate GPT-4o for its speed. It offers faster response times and reduced operational costs compared to earlier versions. This speed is especially useful for tasks such as real-time content updates or code editing. If you need quick turnarounds, say, drafting a motion in court under time pressure, GPT-4o might be the right choice.
o1: A Reasoning AI Model
Released in September 2024, o1 shifts focus from pure prediction to reasoning. Think of it as an AI designed to “think aloud” (internally) before providing an answer. This feature, often described as a “chain of thought,” helps it tackle complex tasks more accurately than a purely predictive model.
Because o1 excels at analytical thinking, you might use it for:
- Complex coding tasks: If your firm automates document generation with custom scripts, o1 might debug or optimize that code more effectively.
- Mathematical or scientific problems: For example, analyzing detailed economic data in an antitrust case.
- Deep legal analysis: For tasks that require methodical reasoning, such as interpreting complicated statutory language or comparing multiple cases to derive legal arguments.
o3: Advanced Reasoning Model
o3, introduced in December 2024, is the successor to o1. It refines the reasoning approach even further, using an internal “private chain of thought” that allows it to map out a plan and reason through tasks before arriving at an answer.
In testing, o3 demonstrated impressive performance in challenging settings:
- Scored 87.7% on the GPQA Diamond benchmark, which includes expert-level science questions not publicly available online.
- Achieved 71.7% on the SWE-bench Verified software engineering benchmark, surpassing o1’s 48.9%.
The key takeaway? o3 is built for sophisticated problem-solving. If your legal work demands advanced reasoning, such as analyzing complex procedural rules or dissecting intricate regulatory frameworks, o3 might be the ideal tool.
Example Scenario
Context: You’re working on a patent infringement case involving cutting-edge biotech.
Predictive Approach: GPT-4o might quickly summarize the patents and relevant court rulings, or generate a draft argument.
Reasoning Approach: o1 or o3 could delve deeper into the technical details, offering a line-by-line analysis that highlights nuances in the claim language or prior art references.
In practice, you might use both. GPT-4o for speed and summarization, and o3 for deep analytical insights.
Key Differences
- GPT-4o: Multimodal, fast, and cost-effective for generating content in various formats.
- o1 and o3: Built for complex reasoning, with an internal chain-of-thought approach that helps them tackle advanced analytical tasks.
In sum, if you need quick content generation, across text, audio, and video, GPT-4o is often your best bet. If your task demands careful, methodical thinking (e.g., analyzing complicated legal issues or performing detailed code debugging), then a reasoning model like o1 or o3 is a stronger choice.
Practice Pointer
When you’re deciding between a predictive or reasoning model, consider the complexity of the legal question. Does it demand in-depth analysis (reasoning) or a rapid overview (prediction)? Think about the time constraints you have, the degree of risk in getting an answer wrong, and whether the project involves tasks that benefit from the AI “thinking” through multiple steps.
Copilots vs. Agents
Generative AI tools generally fall into two categories: copilots and agents. While these terms aren’t always used consistently in every product description, they represent helpful concepts for understanding how much autonomy the AI has in carrying out tasks.
What Is a Copilot?
Copilots function as assistants or extensions of a human operator. They enhance your workflow but still require human oversight. For example, a copilot might:
- Generate the first draft of a contract or legal memorandum.
- Summarize discovery documents or deposition transcripts.
- Check grammar and spelling or suggest clarifications in your writing.
A copilot is like a junior associate who comes prepared with a rough draft or background research for your review. You remain in control and bear responsibility for the final work product. This oversight is crucial, especially in the legal profession, where your name (and potentially your license) is on the line.
What Is an Agent?
Agents are designed to work semi-autonomously or sometimes even fully autonomously. Agents combine an LLM with a goal and instructions, knowledge, and access to tools. They can perform tasks end-to-end with minimal human intervention. In legal settings, an agent might:
- Automate client intake, collecting key information and generating initial conflict checks.
- Handle e-discovery by scanning, categorizing, and summarizing thousands of documents.
- Draft full legal documents or fill out forms based on client data.
While agents can save significant amounts of time, they also raise ethical considerations. If your agent is drafting a legal document for filing with a court, do you trust it to get everything right? If it makes a critical error, are you at risk of malpractice? For these reasons, many law firms still require a final human review, even if an agent does 90% of the legwork.
Callout: Key Term – Autonomous
When we say an AI tool is autonomous, we mean it can act on its own without constant human input. In law, this autonomy can be helpful for routine tasks but also risky if the AI makes flawed decisions. Always assess the appropriate level of human supervision.
Practical Considerations Before Choosing an AI Tool
Picking the right AI tool for your legal work isn’t just about which has the best performance metrics or the coolest features. You also need to account for the real-world practicalities of practicing law.
1. Usability
- Non-Technical Staff: In many law offices, paralegals, legal secretaries, or attorneys themselves may not have advanced technical skills. A tool with a steep learning curve might go unused.
- User Interface: Is it intuitive and easy to navigate? Does it require complex prompts or is it straightforward?
2. Integration
- Practice Management Systems: If your firm uses Clio, MyCase, or another platform, can this AI tool integrate seamlessly to avoid duplicating efforts?
- Research Platforms: Tools like Westlaw or Lexis may already have AI features. Will adopting a separate tool create redundancy or synergy?
3. Cost-Effectiveness
- Budget Constraints: Not every firm can pay high subscription fees for advanced AI features.
- Value Proposition: Is the tool going to save you enough time or reduce errors enough to justify its cost?
4. Security and Privacy
- Client Confidentiality: Lawyers must safeguard sensitive client data. Some AI tools log user queries or store data on servers you don’t control.
- Legal Compliance: Consider data protection laws, such as GDPR or HIPAA, and attorney-client privilege requirements.
Practice Pointer
Before integrating a new AI tool, draft a concise checklist of must-have features and constraints. Include factors like data security, ease of learning, and cost. Having a clear list ensures you choose a tool that aligns with your firm’s unique needs.
Popular Generative AI Tools
With the fundamentals in mind, predictive vs. reasoning, copilots vs. agents, proprietary vs. open-source, let’s now explore some specific generative AI tools making waves in the legal industry. Each tool has its strengths, weaknesses, and ideal use cases.
Benefits of AI Tools for Lawyers
Regardless of the tool you choose, here are some broad benefits you can expect from generative AI:
- Increased Productivity: Automated document review, legal research, and initial drafting free you to focus on higher-value work.
- Improved Accuracy: Consistency in language and formatting, and fewer “oops” moments where an outdated clause sneaks into a contract.
- Enhanced Client Communication: Summaries in plain language can help clients understand their cases better, and swift drafting improves responsiveness.
- Cost Savings: Reduced hours on repetitive tasks can drive down operational costs for both you and the client.
- Better Access to Information: Quick parsing of vast databases or case law can uncover critical insights faster than manual research.
ChatGPT 4o (and o1)
ChatGPT 4o, developed by OpenAI, is a multimodal AI model capable of handling text, audio, and visual inputs. Its speed is a standout feature: it can respond in near-real time, facilitating natural back-and-forth conversations.
Key Features
Multimodal Input and Output
- Process and respond to text, audio, and images.
- Potential to transform client interaction by allowing multiple input types (e.g., voice commands).
- Process and respond to text, audio, and images.
Fast Response Time
- Responds to audio inputs in as little as a few hundred milliseconds, close to human conversational speed.
Vision Mode
- Interprets visual content, which can be applied to analyzing images, diagrams, or even videos.
Reasoning Capability with o1
- o1 can “think before speaking,” adding a layer of deeper analysis for complex tasks.
Advanced Data Analysis
- Handles spreadsheets, helping lawyers track case details or financial records more efficiently.
Text Analysis
- Can review legal documents, check grammar, extract key information, and even translate content.
File Compatibility
- Supports multiple file types (PDF, .docx, .txt, .rtf, .xlsx) commonly used in legal practice.
Additional OpenAI Features
- Advanced Voice: Real-time, spoken-language interaction.
- Tasks: Schedule future actions or recurring updates, like daily docket reminders.
- GPTs: Create custom versions of ChatGPT for specialized tasks (e.g., a custom GPT for trademark law).
- Canvas: A collaborative editing space adjacent to the chat window for real-time drafting and editing.
- Projects: Organize and manage multiple files, chats, and custom instructions in a single workspace.
- Search: Access and retrieve real-time information from the internet, with source citations for transparency.
Pricing
- ChatGPT Plus: $20 per month for priority access, faster response times, and new features.
- ChatGPT Pro: $200 per month, offering expanded usage and rate limits.
Technical Aspects
- 2x Faster processing than GPT-4 Turbo.
- 5x Higher rate limits compared to GPT-4 Turbo.
Pros and Cons of Implementation
Pros | Cons |
---|---|
Increased Efficiency: Automates tasks like document review. | Potential for Bias: AI models can mirror biases in training. |
Improved Accuracy: Reduces errors in contracts and memos. | Dependence on Technology: Over-reliance can impede skill growth. |
Enhanced Client Communication: Summaries in plain language. | Data Security: Sharing sensitive info may raise confidentiality issues. |
Accessibility: Text-to-speech can help users with disabilities. | Limited Creativity: AI lacks the strategic nuance of human insight. |
Cost-Effectiveness: Long-term savings by freeing attorney hours. | Need for Human Oversight: AI-generated work still requires review. |
Anthropic Sonnet 3.5 (Claude)
Anthropic Sonnet 3.5, often referred to simply as Claude, aims to be helpful, harmless, and honest. As an ethical AI system, Claude incorporates an approach called Constitutional AI to align the model with human values.
Key Features
Constitutional AI
- A technique to reduce harmful or biased outputs by aligning the model’s decisions with a set of agreed-upon principles.
Strong Reasoning and Comprehension
- Useful for analyzing legal documents, writing research memos, and summarizing nuanced information.
Natural Language Processing
- Helps produce clearer, more concise text, benefiting legal drafting and client communications.
Context Window (200,000 tokens)
- Can handle lengthy documents without losing track, making it suitable for e-discovery or large contract reviews.
Unique Anthropic Features
- Computer Control: Claude can operate a computer autonomously, such as opening applications, browsing websites, and entering text.
- Artifacts: Creates dynamic outputs like code previews, animations, and interactive visualizations in a separate panel.
- Projects: Manage multiple chats under one umbrella with shared background info, making it ideal for complex, multi-document projects.
Pricing
- Claude Pro: $20 per month.
Technical Aspects
- Context Window: 200,000 tokens, which is larger than many other models.
- Parameters: Not publicly disclosed.
Pros and Cons of Implementation
Pros | Cons |
---|---|
Ethical AI: Reduces biased or harmful outputs. | Limited Transparency: Exact model details are undisclosed. |
Strong Reasoning and Comprehension | Cost: Still requires paid plans for extended features. |
Natural Language Processing | Potential for Over-Reliance: Always verify AI outputs. |
Large Context Window |
Google Gemini
Google Gemini is a multimodal AI model developed by Google AI. Like GPT-4o, it can process and generate text, images, audio, and even code.
Key Features
Multimodal Capabilities
- Can handle various information types, useful for analyzing scanned PDFs, images from case evidence, or audio interviews.
Long Context Window
- Gemini 1.5 Pro can handle up to 2 million tokens in one go, making it ideal for large-scale e-discovery.
Advanced Reasoning
- Built to excel at complex analysis and problem-solving tasks.
Integration with Google Services
- Syncs with Google Search, Google Docs, and other G-Suite tools, streamlining collaboration and research.
Pricing
- Google One AI Premium Plan: $19.99 per month for Gemini Advanced.
Technical Aspects
- Context Window: Up to 2 million tokens for Gemini 1.5 Pro.
- Number of Parameters: Varies by model; smaller “Nano” versions exist for lighter tasks.
Pros and Cons of Implementation
Pros | Cons |
---|---|
Multimodal Capabilities: Handles text, images, audio. | Cost: Premium plans may be prohibitive for some smaller firms. |
Long Context Window: Suitable for massive document sets. | Integration Challenges: May need technical support to integrate fully. |
Advanced Reasoning: Good for complex legal questions. | Dependence on Google: Not everyone prefers or trusts deep Google integration. |
Google NotebookLM
Google NotebookLM is an AI-powered research and note-taking tool. By uploading PDFs, Google Docs, websites, or Google Slides, NotebookLM helps you synthesize and summarize complex information. It provides inline citations so you can verify exactly where the answers are coming from.
- Automatic Summaries: Quickly distills large documents into key bullet points.
- Question Generation: Helps create study guides or discussion prompts.
- Audio Overviews: Transforms text into podcast-like audio summaries.
For law students, NotebookLM can be a handy tool to break down long cases or research papers into more digestible chunks.
Google Advanced with Deep Research
Google’s Gemini Advanced with Deep Research is designed to gather and synthesize information from the web autonomously. You input a research question, and it:
- Plans its approach to researching that question.
- Conducts iterative searches, refining results as it goes.
- Compiles a detailed report with source citations.
This is helpful when performing preliminary research on emerging legal issues. It can produce a well-organized overview, which you can then refine for briefs or internal memos.
Google Learn About
Google Learn About is an experimental AI-driven platform that provides deep, interactive learning on a wide range of topics. Built on the LearnLM model, it tailors responses to different learning levels and includes educational aids such as images, videos, and quizzes.
While not exclusively focused on law, it’s easy to imagine “Learn About Torts” or “Learn About Contract Drafting” modules that help new attorneys or clients grasp fundamental legal concepts in an engaging, interactive way.
Perplexity
Perplexity functions more like an AI-powered search engine with conversational abilities. It’s useful for research-driven tasks where you want quick answers plus source citations.
Key Features
- AI-Powered Search: Finds and summarizes relevant web pages, academic papers, and news articles.
- Source Attribution: Clearly cites its information so you can verify accuracy.
- Conversational Interface: Allows back-and-forth queries, refining your search as you go.
Because of its emphasis on credible sources, Perplexity can help lawyers or law students gather references. However, it doesn’t specialize in drafting or analyzing legal text to the same extent as GPT-4o or Claude.
Benefits for Lawyers
- Efficient Legal Research: Quick scanning of case law, statutes, and articles.
- Diverse Sources: Aggregates from multiple platforms and databases.
- Credibility Checks: Easily verify the reliability of each source.
Pricing
- Free tier with basic features.
- Perplexity Pro: $20 per month for faster responses and larger context windows.
Generative AI Tools: A Comparative Analysis
Below is a side-by-side comparison of some of the tools we’ve discussed. While these details might change as the tools evolve, this table offers a snapshot of their unique strengths.
Feature | ChatGPT 4o | o1 | Anthropic Claude 3.5 | Google Gemini | Perplexity |
---|---|---|---|---|---|
Input Modalities | Text, audio, visual | Text, visual | Text | Text, images, audio, code | Text |
Context Window | 128,000 tokens | 128,000 tokens | 200,000 tokens | Up to 2 million tokens | Up to 32,000 (Pro) |
Number of Parameters | 1.76 trillion | Not publicly disclosed | Not publicly disclosed | Varies (Nano to Pro models) | Not publicly disclosed |
Key Strengths | Multimodal, fast | Deep reasoning, complex problem-solving | Ethical AI, strong reasoning | Multimodal, large context | Efficient search, citations |
Pricing | $20/mo (Plus) / $200/mo (Pro) | Included with ChatGPT subscription | $20/mo | ~$19.99/mo + varied plans | Free or $20/mo (Pro) |
Example Scenario
Situation: A mid-sized law firm wants to adopt an AI tool to assist with e-discovery. They handle large litigation cases requiring review of millions of documents.
- Gemini with its 2 million token context window might be the most appealing for reviewing that huge volume of text.
- Claude is also attractive due to its extensive context window and ethical alignment.
- ChatGPT 4o could help draft quick summaries of findings in a more user-friendly way.
Ultimately, the firm might combine the strengths of multiple models, using Gemini or Claude for the heavy-lifting e-discovery and ChatGPT for drafting short client summaries.
Opportunities for Law Students
For law students, mastering at least a working knowledge of these tools is not optional, it’s increasingly expected. Clients today want efficient, cost-effective, and accurate legal services. When you can say, “I know how to use generative AI ethically and effectively,” you stand out in job interviews and on the job.
- Internships: Impress your supervising attorney by recommending an AI tool for a research project, or by quickly generating a well-polished first draft of an agreement.
- Career Growth: Law firms are actively seeking associates who can navigate new technologies. Demonstrating proficiency can open doors for leadership on tech-related projects.
- Practice Innovation: As you advance in your career, you can spearhead technology initiatives at your firm or legal department, improving efficiency and potentially generating new revenue streams.
Callout: Key Term – Legal Tech Literacy
“Legal tech literacy” is becoming as fundamental as knowing how to format a brief or file a motion. It’s about understanding what tools exist, how they can help, and knowing enough to ask the right questions, even if you’re not a programmer.
Chapter Recap
We covered a lot in this chapter:
Proprietary vs. Open-Source LLMs
- Proprietary models offer convenience and user support, but come with closed systems and potential vendor lock-in.
- Open-source models offer flexibility and transparency, but can be more challenging to maintain.
- Proprietary models offer convenience and user support, but come with closed systems and potential vendor lock-in.
Predictive vs. Reasoning Models
- Predictive models (like GPT-4o) generate quick, versatile responses across multiple media types.
- Reasoning models (like o1 and o3) engage in a deeper “chain-of-thought,” excelling in complex problem-solving.
- Predictive models (like GPT-4o) generate quick, versatile responses across multiple media types.
Copilots vs. Agents
- Copilots assist humans with tasks but require oversight, ideal for drafting, summarizing, and editing.
- Agents can handle tasks autonomously, which can be powerful but also raises ethical and accountability concerns.
- Copilots assist humans with tasks but require oversight, ideal for drafting, summarizing, and editing.
Practical Considerations
- Usability, integration, cost, security, and privacy are crucial when selecting an AI tool.
Popular Generative AI Tools
- ChatGPT 4o (and o1): Multimodal, fast, with specialized reasoning capabilities in o1.
- Anthropic Sonnet 3.5 (Claude): Ethical focus, large context window, strong reasoning.
- Google Gemini: Multimodal with an extremely large context window, integrated into the Google ecosystem.
- NotebookLM, Advanced with Deep Research, Learn About: Google’s suite of AI research and note-taking tools.
- Perplexity: Search-centric model with conversational and citation features.
- ChatGPT 4o (and o1): Multimodal, fast, with specialized reasoning capabilities in o1.
You’re now equipped to weigh the pros and cons of each major AI player in the legal space and have a framework for deciding whether you need a predictive or reasoning model, a copilot or an agent, and a proprietary or open-source solution.
Final Thoughts
Chapter 3 explored how generative AI models, both predictive and reasoning, are reshaping legal workflows. We looked at proprietary tools that offer ready-to-use solutions and open-source models that provide deeper customization. We also examined the difference between AI copilots, which enhance your work under human oversight, and more autonomous AI agents, which can tackle tasks with minimal input but present greater ethical and accountability concerns.
As you move forward, consider the following:
- No Model Is Infallible: Even advanced systems like GPT-4o or o3 can produce errors or biased outputs. Human review remains critical in legal contexts.
- Quality Data, Quality Results: The datasets used to train or fine-tune a model directly impact its accuracy and relevance, especially for specialized legal tasks.
- Model Size Isn’t Everything: A bigger model may handle more data, but it doesn’t always translate to better performance on nuanced legal questions.
- Ethics and Oversight: Transparency in how an AI system arrives at its results is essential, and ethical considerations, such as client confidentiality, must guide every adoption decision.
Generative AI offers the promise of faster drafting, more comprehensive research, and innovative client communications. Yet, caution is warranted: reliance on these systems without proper checks can jeopardize client trust and professional standards. By maintaining a balanced perspective, embracing AI’s efficiencies while upholding legal ethics, you can harness these tools to elevate your practice in a responsible way.
What’s Next?
In Chapter 4, we’ll look at legal-specific AI tools such as CoCounsel, Spellbook, AI.law, and Alexi. These tools directly integrate into law office tasks, like drafting briefs, analyzing contracts for risk, or synthesizing case law, often providing ready-made solutions fine-tuned for the legal profession. We’ll dive deeper into their specific features, how they integrate with existing workflows, and relevant ethical considerations. Stay tuned to learn how these specialized tools can streamline your day-to-day legal practice and transform how you approach client service.