Unstructured Data 4: The Double-Edged Sword

23/11/2025

In this series, we've walked around the elephant in the room: unstructured data.

  • In the first post, we named it – this massive, invisible problem that different leaders see only in part, like the blind sages.
  • We watched the "rising tide", the relentless flood of content from the very collaboration tools we use to be productive.
  • And last week, we opened the closet and found the "skeletons" – the sensitive, outdated, and risky data lurking in the unstructured soup.

The Veil has been Cut Away

For decades, we've relied on "security by obscurity". We hoped that as long as this data was buried deep enough in old file shares and forgotten email threads, it was effectively safe. The problem was way to hard to look at, let alone address.

Now, that assumption is dead. Generative AI is here, and it's a double-edged sword.

The Curse: The Blade We Can't Ignore

The "curse" of GenAI is simple: it will find everything.

Tools like Microsoft Copilot and Google's Gemini are designed to read, summarise, and connect all the information you give them. They are not looking for key words, but semantic inference, they understand the context of the content. Importantly, they will not distinguish between a current, approved policy and an obsolete draft from 2015. They will not recognise that a spreadsheet of customer data saved in a personal chat is a privacy breach waiting to happen.

When your AI surfaces this "dark data," it doesn't just create a security risk; it creates a strategy-corrupting risk.

Your shiny new AI tool, fed on a diet of unstructured soup, will:

  • Contradict current strategy with obsolete decisions.
  • Pollute insights with outdated standards.
  • Expose sensitive IP, financial records, and personal data.

This is the sharp edge of the sword. It's the risk from Post 3, now unveiled by AI.

The Blessing: The Handle We Can Finally Grip

Here is the strategic pivot: this curse is also a profound blessing.

For years, ICT, Risk, and Compliance leaders have tried to get traction on data governance. We've used the "blind sages" parable, pointing to cost, risk, and complexity. But it's been a hard sell - an infrastructure problem, cost with no business value, not a business-critical priority.

GenAI has changed the conversation overnight.

When I talk with the C-suite they are no longer asking if we should use AI; they are asking how fast we can deploy it. They see the elephant now and they understand it is getting in their way.

Suddenly, governance is no longer a cost centre. It is the essential prerequisite for innovation. You cannot safely and effectively use Copilot or Gemini if your data estate is a minefield of skeletons.

This is the blessing: GenAI provides the single most powerful business case for information governance we have ever had. It unlocks executive focus and, crucially, executive budget. It's the forcing function that finally makes the invisible elephant visible to everyone.

The Takeaway: The Best Chance We've Ever Had

The threat of GenAI surfacing our messy, unstructured data is the best chance we've had to face the elephant and address it.

It reframes the entire problem.

  • Before: "We need to clean up our data to reduce risk." (A cost)
  • Now: "We need to govern our data to unlock AI value." (An enabler)

The sword is in our hand.

Wielded badly, it will expose every risk we've ignored.

Wielded correctly, it's the tool we can use to cut through the complexity, tame the elephant, and finally turn decades of dark data into a strategic asset.

Call to Action

The time for ignoring the elephant is over. The sword is here - do we move the metaphor to Damocles? 🤔

If you had to make your unstructured data estate AI-ready today, where would you even begin?

Photo by Zain Abba: Pexel 

I'm always on LinkedIn. Let's korero there. 

© 2025 Kefyn's Digital Life blog. All rights reserved.
Powered by Webnode Cookies
Create your website for free! This website was made with Webnode. Create your own for free today! Get started