Unstructured Data 1: Why It Is the Risk We Can’t Ignore

We call ourselves data-driven. Dashboards glow, KPIs tick upwards, and AI pilots are everywhere.
Yet there's an elephant in the room: most of our data isn't in those dashboards at all.
Analysts estimate that 80–90% of new enterprise data is unstructured — sitting in emails, chats, files, videos, and collaboration tools (Felicis, citing Gartner & IDC).
It's large, messy, and growing fast. And when it comes to managing it, most of us would rather look away - are looking away.
I am calling out this elephant, we need to look at it.
Why It Matters — Different Viewpoints
Business & Finance: Cost Without Control
Unstructured data eats into budgets. Surveys show that 62% of organisations see opportunity in unstructured data for operational efficiency, yet nearly 70% admit they're not well equipped to leverage GenAI on it (Qlik/ETR Survey, 2024). That means spend without return, investment without clear value.
ICT & Operations: Scale Without Visibility
Industry reports highlight growth rates of 55–65% annually for unstructured data (Nexus Frontier - 2021, mind). At that pace, yesterday's manageable file share becomes today's multi-petabyte problem. ICT teams feel the pressure in backups, migrations, and infrastructure planning — but often lack clarity about what the data contains, and importantly, lack the attention and resourcing to address this.
Risk, Legal & Compliance: Exposure Without Awareness
Unstructured stores are where sensitive information lurks. A Komprise 2024 survey found that 47% of organisations rank governance and security of unstructured data as their top capability need (Komprise Report 2024). Regulation doesn't forgive ignorance: GDPR, CCPA, Privacy Act 2020 (NZ), and sector laws expect clarity and control. Without visibility, compliance is a gamble.
Strategy & AI Leaders: Promise Without Readiness
We're told AI will be transformative. But "dark data" — unstructured and underutilised — makes up around 90% of enterprise information (TechRadar, 2025). If that dark data is messy, duplicative, or unclassified, then Copilot or Gemini risk amplifying noise as much as insight. Instead of accelerating progress, AI may shine a spotlight on risks we'd prefer remained hidden.
The Blind Sages and the Elephant
An ancient parable tells of blind sages encountering an elephant for the first time.
- One touches the trunk and insists it's a snake.
- Another, the tusk — clearly a spear.
- Another, the leg — surely a tree.
Each perspective was valid. Yet without context, each was incomplete, and none grasped the full truth. That's us with unstructured data.
- Finance touches the budget and feels only the drain of cost.
- ICT touches the petabytes and feels only the complexity of scale.
- Risk & Compliance touch the regulations and feel only the exposure.
- Strategy touches AI pilots and feels only the promise wasted.
Each view is real, but partial. No one sees the whole elephant. And as long as we act separately, we misunderstand both the scale of the problem and the path forward.
Setting the Stage
This is a theme I want to lean into and discuss here. Over the coming posts, I'll explore different parts of this elephant:
- How the unstructured tide has grown so large.
- What risks it hides in plain sight.
- Why GenAI is both a curse and a blessing.
- And how all parts of the business and ICT can face the elephant together.
Step by step, we'll build a fuller picture.
Hope: A Way Forward
I am not all doom and gloom, in-fact that is why I am writing this. We can get control, we can get context, we can turn this elephant to value — together. Yes, the elephant is big. But it isn't immovable.
There's another saying: the only way to eat an elephant is one bite at a time.
That's how we must approach unstructured data. Start small:
- Tag and classify what matters most for AI readiness.
- Dispose of obvious ROT.
- Protect the highest-risk data first.
Each bite reduces risk, frees cost, and builds trust. And as business, ICT, risk, and finance align, the elephant stops being invisible. It becomes visible, manageable — and finally, valuable.
Call to Action
What part of the elephant does your organisation feel most strongly today — cost, risk, complexity, or lost opportunity?
In the next post, we'll look at where the unstructured soup comes from — and why it keeps growing faster than we can manage.
Photo by Francesco Ungaro: Pexels
