Technical · May 12, 2026 · 10 min read

ChatGPT vs Local LLM for Sensitive Business Data: How to Decide

Most Canadian business owners I work with have asked a version of the same question: "Can I just paste this client document into ChatGPT?" Here's the framework I use with them — and why the honest answer is usually no.

The default move when generative AI lands in a small business is to open ChatGPT and paste in whatever document is on the desk. It's fast, it's free, and it produces real value. The trouble is that the documents on the desks of most Canadian SMBs — patient charts, lease agreements, financial statements, grant applications — are exactly the kinds of documents that shouldn't be pasted into a public AI service in the first place.

This isn't a hypothetical risk. It's already happening on every team I've worked with. The question isn't whether ChatGPT and tools like it have a place in your business — they do. The question is which kinds of work belong in the cloud, and which kinds need to stay on your own hardware. This is the framework I walk clients through.

The trade-off in one sentence

Cloud AI — ChatGPT, Claude, Gemini, Copilot — gives you the best raw capability at the cost of sending your data to someone else's machine. A local LLM running on your own hardware gives you full data control at the cost of running a slightly less capable model. Everything else in this debate is a footnote to that sentence.

Because the trade-off is so clean, the decision tends to be clean too. Once you accept that some categories of information must never leave your network, the question stops being "which is better?" and becomes "which tool do I use for this specific document?"

What actually happens when you paste a client document into ChatGPT

Three things happen the moment that document leaves your network:

It travels across the public internet to a third-party data centre, usually in the United States.
It is stored, at minimum temporarily, by the AI provider. Their policies tell you what they say they do with it. Their actual practice is harder to verify.
It becomes subject to the legal jurisdiction of wherever the servers live — including subpoenas, government data requests, and any future breach at the vendor.

For Canadian businesses, this matters under PIPEDA. Personal information about clients, patients, tenants, or employees is regulated. PIPEDA doesn't forbid you from sending it across a border — but it requires that you handle it with care, get appropriate consent, and remain accountable for what happens to it. "I pasted it into ChatGPT and now I have no idea what they did with it" is not a defensible position if a privacy commissioner ever asks.

Most enterprise versions of these tools (ChatGPT Enterprise, Claude for Work, Microsoft Copilot for Business) have policies that say your data is not used for model training and is retained only briefly. Those policies are real and they matter. They don't change the fact that your data has left your network, that you're trusting a vendor to honour the policy, and that you are still exposed to breaches at their end and policy changes you don't control.

When ChatGPT is the right tool

I want to be clear: cloud AI is not the villain. For the right work, it is the best tool available. Use it for:

Public information — research, learning, summarizing news articles.
Drafting work that will be public anyway — blog posts, marketing copy, social media drafts.
Brainstorming, outlining, generic templates.
Code that doesn't reveal proprietary business logic.
Anything you would be fine seeing leaked.

The simplest test I give teams: "Would you be comfortable if this exact text appeared on your competitor's blog tomorrow?" If yes, cloud AI is fine. If no, you have a different problem.

When you need a local LLM instead

Local AI is the right answer when the data itself is the asset you're trying to protect. The clearest cases I see in my Canadian client work:

Patient records, intake forms, clinical notes — anything covered by health information custodian rules in your province.
Tenant data, lease drafts, building incident reports — anything where individuals trusted you with personal information.
Legal documents, client correspondence, contracts under negotiation.
Financial records, internal memos, board materials.
Anything you're under contract not to disclose to third parties — which often quietly includes "and not to AI vendors."

For all of these, a local LLM — a language model that runs entirely on your own hardware — is the architectural answer. The model lives on your machine. Documents go in, output comes out, and nothing crosses your network boundary. There is no vendor between you and the AI.

"But how good are local LLMs, really?"

This is the question every client asks, and it deserves a straight answer.

Open-weights models like Gemma (Google's open model family) and Llama (Meta's) have made dramatic progress over the past two years. For most office work — drafting, rewriting, formatting, summarizing, answering questions about documents — modern small and mid-sized open models are very close to ChatGPT-class quality. The gap is real, but it's narrower than most people assume.

Where cloud models still win clearly: long-context reasoning across many documents, complex multi-step problem solving, and the most demanding coding tasks. Where local models are now genuinely good enough: most of what a small business actually does with AI in a given week.

The way I frame it for clients: a well-chosen local LLM running on appropriate hardware will do about 80–90% of what your team uses ChatGPT for today, just as well. The remaining 10–20% — the truly demanding work — you can still send to cloud AI, with the data discipline to make sure nothing sensitive goes with it.

The decision framework

Four questions, in order, for any document or task:

Question	If yes
Would this leak harm a client, patient, employee, or your business?	Stop. Local AI only.
Is this personal information under PIPEDA?	Stop. Local AI only.
Are you contractually forbidden from sharing it with third parties?	Stop. Local AI only.
Can you redact or abstract the sensitive parts before using cloud AI?	Cloud AI is fine, with discipline.

If the answer to all four is "no harm, not personal, not contractually restricted, no redaction needed" — go ahead and use ChatGPT. If any of the first three is yes, local AI is your default. The fourth question is the practical escape valve: most teams can do useful work in the cloud by redacting names, numbers, and identifiers before pasting. It just has to be a habit, not an afterthought.

What "running a local LLM" actually looks like

The mental picture most business owners have is a row of black servers humming in a closet. That's not what a small-business local AI setup looks like today.

For most Canadian SMBs I work with, local AI starts with a single dedicated machine — sometimes a well-spec'd laptop, sometimes a small desktop with a consumer GPU. Models in the 4-billion to 12-billion parameter range run comfortably on this kind of hardware and handle the bulk of office work. Larger models exist if you need them, but you can get a long way before you need to think about a dedicated server.

The software side has also improved dramatically. Tools like Ollama, LM Studio, and a growing crop of small-business-friendly local AI apps mean you don't need a data engineer to get started. You install something, point it at a folder of documents, and start working. For people who want a turnkey local AI tool specifically for documents, I'm building DocBee — a local AI document generator and formatter running on Gemma, designed for Canadian SMBs that need privacy without engineering overhead.

The cost reality

One of the surprises in this conversation: local AI is often cheaper, not more expensive, once you cross a small team size.

ChatGPT Enterprise and similar cloud AI plans are billed per user, per month. For a 20-person team, the annual cost runs into five figures. A capable local AI setup is a one-time hardware purchase — typically a few thousand dollars total — plus minor ongoing maintenance. The crossover point comes fast.

For very small teams (under five users), cloud AI may still be cheaper, especially if the work is light. For everyone else, the financial argument tends to favour local once you've accounted for a full year of subscriptions across the whole team.

A practical hybrid approach

The teams I see succeed with this don't pick one tool and stick to it. They run a hybrid:

Local LLM as the default for any work involving real client information, internal documents, or anything sensitive.
Cloud AI explicitly approved for marketing copy, research, brainstorming, and any work that's already public-facing.
A one-page team policy that names which tool to use for which kinds of work — so people aren't making the judgment call alone every time.

This is the architecture I recommend to almost every Canadian small business, clinic, and non-profit I work with. It captures the productivity wins of generative AI without forcing your team to choose between speed and privacy.

The verdict

If your business handles regulated, personal, or otherwise sensitive information — and almost every Canadian SMB does — your default tool for AI work should be a local LLM, not ChatGPT. Cloud AI has a place in your stack, but it should be the exception, used deliberately for work that doesn't carry privacy risk.

The good news: the technology to run local AI is now mature, affordable, and accessible to teams that don't have a dedicated IT department. The privacy guarantee you get from a model running on your own hardware is technical and architectural — not a vendor promise — and that's the kind of guarantee Canadian privacy laws were built to favour.

The harder news: figuring out which workflows belong where, sizing the hardware, picking the right model, and getting your team to actually use it requires either time you don't have or someone who's done it before. That's the work.

If this resonates and you're thinking about local AI for your team, two things you can do today: try DocBee, the local AI document tool I'm building for Canadian SMBs, or book a 30-minute call to talk through what a local AI setup would look like for your specific situation.