Products

Problems
we solve

We can help your business

Request a Free Demo / trial

Insights

Insights | From a different perspective
10 March, 2025

Software Testers: Humanity’s Best Chance Against Rogue AI

Software Testers v Rogue AI

How ethical QA practices could prevent our HAL 9000 moment.

In the race to protect us against rogue AI, our best defence might not be scientists or politicians, but the often-overlooked heroes of the tech world: software testers. As AI systems increasingly mediate healthcare, criminal justice, and military decisions, this unlikely profession could hold the key to preventing existential catastrophe.

You might think this is far-fetched, but we’re at an inflection point in society and the world is poised to change more dramatically than ever.

I doubt that people living at the turn of the 19th Century had any concept of what would come over the next hundred years. And while the industrial revolution was indeed a biggy, the coming AI revolution will be more significant, impactful, and potentially more dangerous.

Just as the Industrial Revolution brought unforeseen challenges that required new safety measures, the AI revolution will demand unprecedented ethical safeguards to protect humanity.

Where engineers came to dominate the Industrial Revolution and protect people against the dangers of machines, software engineers—and testers in particular—will be uniquely positioned to safeguard humanity as AI takes greater control of our daily lives.

The Potential Threats of Rogue AI

The potential threats posed by AI are multifaceted; ranging from digital risks like large-scale cyberattacks and fraud, to societal and political risks such as the proliferation of synthetic media and deepfakes that could erode public trust and manipulate populations.

Physical risks loom as AI becomes embedded in critical infrastructure, while economic disruption through job displacement and the development of autonomous weapons raise serious ethical and security concerns.

Perhaps most alarming are the long-term existential risks posed by advanced AI systems. Some researchers warn that AI could act unpredictably or pursue goals misaligned with human values.

As AI capabilities advance rapidly, the need for ethical considerations and safeguards becomes increasingly urgent to ensure that AI development benefits humanity without inadvertently leading to catastrophic outcomes.

Current P1 Incidents Will Seem Trivial

Software defects are already a significant issue for many, but as far as I’m aware, we’ve not seen any that have come close to extinction-level events.

We’ve seen countless banks, retailers, space agencies, and game developers lose millions, if not billions, of pounds because of dodgy code. According to a report from CISQ, “For the year 2020, we determined the total Cost of Poor Software Quality (CPSQ) in the US is $2.08 trillion”.

While critical in a software sense, these costly live issues will pale into significance compared to defective AI solutions that could legitimately lead to the end of the world as we know it, because of lack of adequate controls for the AI in the decisions it can make.

The Limits of Traditional Testing

While traditional software testing ensures systems function as intended, AI introduces unpredictable variables that demand a paradigm shift. Current practices focus narrowly on validating predefined rules, leaving dangerous blind spots when applied to self-learning systems.

Generally speaking, QA teams currently focus on verifying explicit requirements:

  • Functional compliance with specifications
  • Performance under expected conditions
  • Technical bug identification and reporting

However, this approach is not enough for AI systems. AI’s capacity for emergent behaviour requires testers to expand from technical validators to ethical auditors.

The differences between traditional software testing and AI testing requirements can be thought of like this:

Traditional TestingAI Testing
Predefined inputs and outputsDynamic, evolving responses
Deterministic behaviourProbabilistic outcomes
Focus on functionalityFocus on ethics and safety
Bug-orientedBias and misalignment-oriented

The Importance of Ethical AI Testing

This shift from deterministic to probabilistic outcomes means testers must anticipate a broader range of possible behaviours, including those that may emerge unexpectedly.

The ‘HAL 9000 moment‘ mentioned in the subtitle refers to the fictional AI in ‘2001: A Space Odyssey’ that turns against its human crew to protect the mission. While only a story, this is often cited as an example of the potential dangers of advanced AI systems that follow poorly defined success criteria.

When it comes to AI, testers must employ Ethical QA that goes beyond functionality testing to assess an AI’s decision-making process, potential biases, and alignment with human values and ensure there are adequate safeguards.

After all, Don’t kill astronauts wasn’t in HAL’s spec sheet.

Bridging the Imagination Gap

Testers face a fundamental challenge: you can’t write test cases for scenarios nobody anticipated. This gap between human foresight and machine creativity demands systematic imagination.

Testers must now assess whether an AI works and whether its decisions are ethically sound and socially beneficial.

As mentioned in the intro, roles are changing, and testers must evolve to fit this new position. To fulfil their role as AI Guardians, testers will need a blend of technical expertise, ethical understanding, and domain knowledge in psychology, sociology and related areas.

I would argue that Ethical AI software testers should also:

  • Advocate for ethical requirements in early development stages
  • Treat sci-fi scenarios like 2001’s HAL rebellion as legitimate test cases
  • Push for governance frameworks ensuring human accountability

You might think that we are way off having to worry about this but are we. The AI we have access to will be generations behind what is being developed. Therefore, how do we know what it can do and the conclusions it may draw.

For instance, if you search for “what is causing the climate crisis” The AI generated answer that Google AI comes up with is:

If you were then to ask AI “how to prevent climate change” how long before it could come up with the scenario that restricting or removing human activity is the answer. We are now in a Terminator type scenario.

I get that many reading this may see this as farfetched, but is it? Many tech leaders and governments are advocating for safeguards to be built in.

Guardians of the Code, and Mankind’s Defence Against The Machines

As AI systems grow more autonomous, software testers must become vital counterweights against accidental catastrophes and deliberate misuse. Their propensity to ask “What if?” may determine whether technology elevates or destroys humanity.

The existential stakes transform testers from validators to custodians. Their new toolkit – blending sci-fi imagination with rigorous testing protocols – makes them the immune system for our AI-dependent civilization.

When the next HAL-like system inevitably emerges, it won’t be stopped by philosophers or politicians, but by a tester who noticed the ethical equivalent of a missing semicolon.

Stephen Davis
by Stephen Davis

Stephen Davis is the founder of Calleo Software, a OpenText (formerly Micro Focus) Gold Partner. His passion is to help test professionals improve the efficiency and effectiveness of software testing.

To view Stephen's LinkedIn profile and connect 

Stephen Davis LinkedIn profile

10th March 2025
What can testers learn from SpaceX

What Can Testers Learn From SpaceX?

As a test professional, I’ve seen countless projects where defects are treated as disasters rather than learning opportunities. But what if we flipped that mindset? What if software development projects embraced failure as SpaceX does—not as an end, but as the beginning of progress?

video to defect

How to Generate Defect Reports from Videos!

Testers can now convert video recordings into detailed defect reports. This groundbreaking functionality accelerates project timelines with AI-powered speed and accuracy. Not only does this technology provide the holy trinity of speed, quality and cost savings, but it also solves a huge—often unspoken—issue on many projects: the breakdown of dev/test relations at the worst possible time.

Video to Software Tests

A Testing Revolution? How to Turn Videos into Manual and Automated Test Cases

Imagine being able to record a user story and instantly turn it into manual and automated tests—how much time and effort would you save? Whether you’re preparing for SIT, UAT or streamlining regression testing, you can now generate manual and codeless automated test cases directly from video recordings, leveraging cutting-edge AI technology to streamline your testing processes.

Test Automation what's new

What’s New: Exciting Test Automation Tool Updates

As great as OpenText is at software development, it’s not always the best at keeping people informed about changes. So, today, I’m sharing a few recent updates to the OpenText automation tools. These are just a tiny sample of recently implemented changes. They focus on cloud capabilities, AI-powered object detection, codeless testing, and streamlined workflows that make test automation more accessible and efficient than ever.

Software Testing in 2030

Software Testing in 2030: 4 Ways QA Will Change

Over the next five years, software and software testing are set to evolve at a rate we’ve never seen. In fact, it has already started. Over the last few years, everyone remotely involved in tech has witnessed the constant change in the way things are done. This seemingly non-stop innovation has been driven by emerging technologies, shifting development paradigms, and businesses reevaluating their priorities… and is set to accelerate.

4 testing breakthroughs

Software Testing AI: 4 Breakthroughs You Can’t Ignore in 2025

It’s 2025 and software testing AI can no longer be ignored. AI innovations in software testing can deliver unprecedented efficiency gains and bridge the gap between manual and automated workflows. This article contains four software testing AI breakthroughs you can’t ignore in 2025.

Remote Software Testing

Remote Testing Teams: 4 Strategies to Avoid Collaboration Disaster

It’s been years since the pandemic. Still, many companies I speak to have struggled to adapt to changing practices and have failed to implement effective working habits. Unfortunately, you can’t just continue as if nothing has changed—this approach just won’t cut it anymore. In this week’s insight, I provide four actionable approaches that I have picked up from the many successful testing projects I talk to. These easy fixes will help you prevent collaboration disasters in your remote testing teams.

Top Software Lists

Exposed Why ‘Top Software’ Lists Can’t Be Trusted!

You see them everywhere. Top 10 this, top 20 that. We have all searched for lists that rank products. Whether cars, phones, software, or anything else. But how trustworthy are the ‘top software’ lists on the internet?

How to Choose A Test Management Tool

How to Choose The Right Test Management Tool

Test management tools ensure efficient, effective, and auditable testing processes. When choosing an enterprise-level test management solution, it’s essential to use a proven and trusted solution.

Insights

Search

Related Articles

InsightsTrending

To get other software testing insights, like this, direct to you inbox join the Calleo mailing list.

You can, of course, unsubscribe 

at any time!

By signing up you consent to receiving regular emails from Calleo with updates, tips and ideas on software testing along with the occasional promotion for software testing products. You can, of course, unsubscribe at any time. Click here for the privacy policy.

Sign up to receive the latest, Software Testing Insights, news and to join the Calleo mailing list.

You can, of course, unsubscribe at any time!

By signing up you consent to receiving regular emails from Calleo with updates, tips and ideas on software testing along with the occasional promotion for software testing products. You can, of course, unsubscribe at any time. Click here for the privacy policy.