Back

Is it time to start trusting the machine?

Machine learning outcomes are testable and corrective measures remain consistent, unlike in humans.

“People are not your most important asset. The right people are.” – Jim Collins, author and lecturer on company sustainability and growth.

“Are the right people in the right roles?” [This is] the single most important factor for leadership success and for organisational success.” – Gail Kelly, former CEO of Westpac

How many research papers do we need to read or edicts from top-class CEOs before we get the message that in every organisation, it all comes down to the people?

Adam Bryant who pens the terrific weekly column, The Corner Office, for the NYT has interviewed a diverse pool of leaders, and a common theme from 99 % of his interviews with CEOs is that success correlates with hiring the best team.

My former boss Tracey Fellows, CEO of the REA Group, was also fond of saying that it is ‘people’ that keeps her up at night more than any other business challenge.

Most hiring in most organisations relies 100 % on people to make those most important decisions. Yet we do so with little objective data. Instead, we have layers upon layers of bias! And to give you an idea of how many there are, here is a whooping full Wikipedia list of cognitive biases for you to check out. This article lays out in great detail a plethora of cloudy, smeary and hazy biases I didn’t know could exist.

It concludes that they are mostly unalterable and fixed, regardless of how much unconscious bias training you attend in your lifetime.


There is no scalable, efficient and reliable way to train us out of our biases. Our biases are so embedded and invisible; mostly, we just can’t ‘check ourselves’ at the moment to manage them.

So, how is that diversity hiring program going?

Read: Why a Lack of Diversity is Costing Your Business

In some functions/ departments, your “Hiring for Diversity” may be going very well. However, diversity training and hiring isn’t repeatable, where humans are involved. And, if humans could be trained out of their biases, we may get more diversity in our new hires. But then, do we know that we are getting the ‘better’ hire from the applicant pool? How CAN you tell if you have no method of reliably testing for what matters for success?

You might say we rely on CVs to give us that ‘insight’ but did you know CVs are usually crafted, designed, worded and reworded to ‘best-light’ the applicant. Ever appointed an Excel whizz, who on hire doesn’t know a pivot from a concatenate? Or even worse, who cannot apply logic, reasoning and critical thought?

We have all done this – apply crude (biased) filters to screen applications:

  • Blue-chip companies on their CV – tick!
  • Stayed in their role for two years on average – tick!
  • Promoted at least once inside of a (good) organisation – tick!
  • Good school – tick!
  • Impressive referees – tick tick tick!

Because biases appear to be so hardwired and inalterable, it is more straightforward to remove bias from algorithms than from people.

This gives AI the potential to create a future where important insights underpinning decisions such as hiring, are made more fairly.


The machine can be trained to help you make repeatable and stable decisions.

Read: Why Machines make better decisions than humans (oh and why I hate Simon Sinek)

Algorithmic bias is not the elephant in the room. Some argue that algorithms themselves have bias. The reality is that machine learning, by its very definition, is aiming to find patterns in large volumes of data, mostly latent, to support decisive actions. Removing bias is driven by what bits of training data you use to feed the machine.

You can ensure there is no (or limited) bias in the machine learning and it is all about two things:

  1. What data is being used to build the model?
  2. What are you doing to that data to build the model?

If you build models from the profile of your talent and that talent is homogenous and monochromatic, then so will be the data model and you are back to self-reinforcing hiring.

If you are using data which looks at age, gender, ethnicity and all those visible markers of bias, then, sure enough, you will amplify that bias in your machine learning. Relying on internal performance data to make people decisions, that is like layering bias-upon-bias. Similar to building a sentencing algorithm with sentencing data from the US court system, which is already biased against black men.

So instead of lumping all AI and ML into one big bucket of ‘bias’, look beneath the surface to understand what’s going into the machine as that is where amplification risks loom large.


To ensure you are using machine learning wisely, only use objective data which has no biodata (that means a big NO to CV and social media scraping). Test rigorously and adjust to learn continuously. And, be certain to use multiple machine learning models to continuously triangulate the model versus relying on one version of the truth.

Machines are better at learning this stuff.

Unlike trying to solve human bias, machine learning is repeatable, stable, consistent and most importantly, testable. The value to the organisation is of course, immense.

  • Every applicant gets a fair go at the role;
  • Every applicant is assessed;
  • Hire the person who will succeed vs someone your gut tells you will succeed;
  • Use fewer resources to hire;
  • Reduce the cost of hire.

Now that is ticking all the right boxes. It makes the possibility of objective and valid decisions available at scale, a probability.

Machine learning outcomes are testable and corrective measures remain consistent, unlike in humans.

The ability to test both training data and outcome data, continuously, allows you to detect and correct the slightest bias if it ever occurs.

Soon (maybe already) you will be putting yours, and your loved ones live in the hands of algorithms when you ride in that self-driven car. Algorithms are extensions to our cognitive ability helping us make better decisions, faster and consistently based on data, even in hiring.


To keep up to date on all things “Hiring with Ai” and Machine Learning subscribe to our blog!

You can try out Sapia’s Chat Interview right now – HERE. Else leave us your details to get a personalised demo


Blog

Neuroinclusion by design. Not by exception.

Why neuroinclusion can’t be a retrofit and how Sapia.ai is building a better experience for every candidate.

In the past, if you were neurodivergent and applying for a job, you were often asked to disclose your diagnosis to get a basic accommodation – extra time on a test, maybe the option to skip a task. That disclosure often came with risk: of judgment, of stigma, or just being seen as different.

This wasn’t inclusion. It was bureaucracy. And it made neurodiverse candidates carry the burden of fitting in.

We’ve come a long way, but we’re not there yet.

Shifting from retrofits to inclusive-by-design

Over the last two decades, hiring practices have slowly moved away from reactive accommodations toward proactive, human-centric design. Leading employers began experimenting with:

  • Sharing interview questions in advance

  • Replacing group exercises with structured simulations

  • Offering a variety of assessment formats

  • Co-designing assessments with neurodiverse candidates

But even these advances have often been limited in scope, applied to special hiring programs or specific roles. Neurodiverse talent still encounters systems built for neurotypical profiles, with limited flexibility and a heavy dose of social performance pressure.

Hiring needs to look different.

Insight 1: The next frontier of hiring equity is universal design

Truly inclusive hiring doesn’t rely on diagnosis or disclosure. It doesn’t just give a select few special treatment. It’s about removing friction for everyone, especially those who’ve historically been excluded.

That’s why Sapia.ai was built with universal design principles from day one.

Here’s what that looks like in practice:

  • No time limits — Candidates answer at their own pace
  • No pressure to perform — It’s a conversation, not a spotlight
  • No video, no group tasks — Just structured, 1:1 chat-based interviews
  • Built-in coaching — Everyone gets personalised feedback

It’s not a workaround. It’s a rework.

Insight 2: Not all “friendly” methods are inclusive

We tend to assume that social or “casual” interview formats make people comfortable. But for many neurodiverse individuals, icebreakers, group exercises, and informal chats are the problem, not the solution.

When we asked 6,000 neurodiverse candidates about their experience using Sapia.ai’s chat-based interview, they told us:

“It felt very 1:1 and trustworthy… I had time to fully think about my answers.”

“It was less anxiety-inducing than video interviews.”

“I like that all applicants get initial interviews which ensures an unbiased and fair way to weigh-up candidates.”

Insight 3: Prediction ≠ Inclusion

Some AI systems claim to infer skills or fit from resumes or behavioural data. But if the training data is biased or the experience itself is exclusionary, you’re just replicating the same inequity with more speed and scale.

Inclusion means seeing people for who they are, not who they resemble in your data set.

At Sapia.ai, every interaction is transparent, explainable, and scientifically validated. We use structured, fair assessments that work for all brains, not just neurotypical ones.

Where to from here?

Neurodiversity is rising in both awareness and representation. However, inclusion won’t scale unless the systems behind hiring change as well.

That’s why we built a platform that:

  • Doesn’t rely on disclosure

  • Removes ambiguity and pressure

  • Creates space for everyone to shine

  • Measures what matters, fairly

Sapia.ai is already powering inclusive, structured, and scalable hiring for global employers like BT Group, Costa Coffee and Concentrix. Want to see how your hiring process can be more inclusive for neurodivergent individuals? Let’s chat. 

Read Online
Blog

Skills Measurement vs Skills Inference – What’s the Difference and Why Does It Matter?

There’s growing interest in AI-driven tools that infer skills from CVs, LinkedIn profiles, and other passive data sources. These systems claim to map someone’s capability based on the words they use, the jobs they’ve held, and patterns derived from millions of similar profiles. In theory, it’s efficient. But when inference becomes the primary basis for hiring or promotion, we need to scrutinise what’s actually being measured and what’s not.

Let’s be clear: the technology isn’t the problem. Modern inference engines use advanced natural language processing, embeddings, and knowledge graphs. The science behind them is genuinely impressive. And when they’re used alongside richer sources of data, such as internal project contributions, validated assessments, or behavioural evidence, they can offer valuable insight for workforce planning and development.

But we need to separate the two ideas:

  • Skills Measurement: Directly observing or quantifying a skill based on evidence of actual performance. 
  • Skills Inference: Estimating the likelihood that someone has a skill, based on indirect signals or patterns in their data. 

The risk lies in conflating the two.

The Problem Isn’t Inference of Skills. It’s the Data Feeding It

CVs and LinkedIn profiles are riddled with bias, inconsistency, and omission. They’re self-authored, unverified, and often written strategically – for example, to enhance certain experiences or downplay others in response to a job ad. 

And different groups represent themselves in different ways. Ahuja (2024) showed, for example, that male MBA graduates in India tend to self-promote more than their female peers. Something as simple as a longer LinkedIn ‘About’ section becomes a proxy for perceived competence.

Job titles are vague. Skill descriptions vary. Proficiency is rarely signposted. Even where systems draw on internal performance data, the quality is often questionable. Ratings tend to cluster (remember the year everyone got a ‘3’ at your org?) and can often reflect manager bias or company culture more than actual output.

Sophisticated ≠ Objective

The most advanced skill inference platforms use layered data: open web sources like job ads and bios, public databases like O*NET and ESCO, internal frameworks, even anonymised behavioural signals from platform users. This breadth gives a more complete picture, and the models powering it are undeniably sophisticated.

But sophistication doesn’t equal accuracy.

These systems rely heavily on proxies and correlations, rather than observed behaviour. They estimate presence, not proficiency. And when used in high-stakes decisions, that distinction matters.

Transparency (or Lack Thereof)

In many inference systems, it’s hard to trace where a skill came from. Was it picked up from a keyword? Assumed from a job title? Correlated with others in similar roles? The logic is rarely visible, and that’s a problem, especially when decisions based on these inferences affect access to jobs, development, or promotion.

Presence ≠ Proficiency

Inferred skills suggest someone might have a capability. But hiring isn’t about possibility. It’s about evidence of capability. Saying you’ve led a team isn’t the same as doing it well. Collecting or observing actual examples of behaviour allows you to evaluate someone’s true competence at a claimed skill. 

Some platforms try to infer proficiency, too, but this is still inference, not measurement. No matter how smart the model, it’s still drawing conclusions from indirect data.

By contrast, validated assessments like structured interviews, simulations, and psychometric tools are designed to measure. They observe behaviour against defined criteria, use consistent scoring frameworks (like Behaviourally Anchored Rating Scales, or BARS), and provide a transparent, defensible basis for decision-making. In doing this, the level or proficiency of a skill can be placed on a properly calibrated scale. 

But here’s the thing: we don’t have to choose one over the other.

A Smarter Way Forward: The Hybrid Model

The real opportunity lies in combining the rigour of measurement with the scalability of inference.

Start with measurement
Define the skills that matter. Use structured tools to capture behavioural evidence. Set a clear standard for what good looks like. For example, define Behaviourally Anchored Rating Scales (BARS) when assessing interviews for skills. Using a framework like Sapia.ai’s Competency Framework is critical for defining what you want to measure. 

Layer in inference
Apply AI to scale scoring, add contextual nuance, and detect deeper patterns that human assessors might miss, especially when reviewing large volumes of data.

Anchor the whole system in transparency and validation
Ensure people understand how inferences are made by providing clear explanations. Continuously test for fairness. Keep human oversight in the loop, especially where the stakes are high. More information on ensuring AI systems are transparent can be found in this paper.

This hybrid model respects the strengths and limits of both approaches. It recognises that AI can’t replace human judgement, but it can enhance it. That inference can extend reach, but only measurement can give you higher confidence in the results.

The Bottom Line

Inference can support and guide, but only measurement can prove. And when people’s futures are on the line, proof should always win.

References

Ahuja, A. (2024). LinkedIn profile analysis reveals gender-based differences in self-presentation among Indian MBA graduates. Journal of Business and Psychology.

 

Read Online
Blog

Making Healthcare Hiring Human with Ethical AI

Hiring for care is unlike any other sector. Recruiters are looking for people who can bring empathy, resilience, and energy to the most demanding human roles. Whether it’s dental care, mental health, or aged care, new hires are charged with looking after others when they’re most vulnerable. The stakes are high. 

Hiring for care is exactly where leveraging ethical AI can make the biggest impact.

Hiring for the traits that matter

The best carers don’t always have the best CVs.

That’s why our chat-based AI interview doesn’t screen for qualifications. It screens for the the skills that matter when caring for others. The traits that define a brilliant care worker, things like:

Empathy, Self-awareness, Accountability, Teamwork, and Energy. 

The best way to uncover these traits is through structured behavioural science, delivered through an experience that allows candidates to open up. Giving candidates space to give real-life, open-text answers. With no time pressure or video stress. Then, our AI picks up the signals that matter, free from any demographic data or bias-inducing signals.

Candidates’ answers to our structured interview questions aren’t simply ticking boxes. They’re a window into how someone shows up under pressure. And they’re helping leading care organisations hire people who belong in care and those who stay.

Inclusion, built in

Inclusivity should be a core foundation of any talent assessment, and it’s a fundamental requirement for hirers in the care industry. 

When healthcare hirers use chat-based AI interviews, designed to be inclusive for all groups, candidates complete their interviews when and where they choose, without the bias traps of face-to-face or phone screening. There are no accents to judge, no assumptions, just their words and their story.

And it works:

  • 91.8% of carer candidates complete their interviews
  • Carer candidates report 9/10 Candidate Satisfaction with their interview experience 
  • 80% of candidates would recommend others to apply 
  • Every candidate receives personalised feedback, regardless of the outcome

Drop-offs are reduced, and engagement & employer brand advocacy go up. Building a brand that candidates want to work for includes providing a hiring experience that candidates want to complete. 

Real outcomes in care hiring

Our smart chat already works for some of the most respected names in healthcare and community services. Here’s a sample of the outcomes that are possible by leveraging ethical AI, a validated scientific assessment, wrapped in an experience that candidates love: 

Anglicare – a leading provider of aged care services
  • Time-to-offer dropped from 40+ days to just 14
  • Candidate pool grew by 30%
  • Turnover dropped by 63%
Abano Healthcare – Australasia’s largest dental support organisation
  • 1,213 recruiter hours saved  in the first month (67 hours per individual hiring team member) 
  • $25,000 saved in screening and interviewing time
Berry Street – a not for profit family & child services organisation
  • Time-to-hire down from 22 to 7 days
  • 95.4% of candidates completed their chat interviews

A smarter way to hire

The case study tells the full story of how Sapia.ai helped Anglicare, Abano Healthcare, and Berry Street transform their hiring processes by scaling up, reducing burnout, and hiring with heart. 

Download it here:

Read Online