Back

7 critical questions to ask when selecting your ‘Ai for Hiring’ technology

 

Interrupting bias in people decisions

We hope that the debate over the value of diverse teams is now over.  There is plenty of evidence that diverse teams lead to better decisions and therefore, business outcomes for any organisation.

This means that CHROs today are being charged with interrupting the bias in their people decisions and expected to manage bias as closely as the  CFO manages the financials.

But the use of Ai tools in hiring and promotion requires careful consideration to ensure the technology does not inadvertently introduce bias or amplify any existing biases.

To assist HR decision-makers to navigate these decisions confidently, we invite you to consider these 8 critical questions when selecting your Ai technology.

You will find not only the key questions to ask when testing the tools but why these are critical questions to ask and how to differentiate between the answers you are given.

Question 1  

What training data do you use?

Another way to ask this is: what data do you use to assess someone’s fit for a role?

First up- why is this an important question to ask …

Machine-learning algorithms use statistics to find and apply patterns in data.  Data can be anything that can be measured or recorded, e.g. numbers, words,  images, clicks etc. If it can be digitally stored, it can be fed into a machine-

learning algorithm.

The process is quite basic: find the pattern, apply the pattern.

This is why the data you use to build a predictive model, called training data, is so critical to understand.

In HR, the kinds of data that could be used to build predictive models for  hiring and promotion are:

  • CV data and cover letters
  • Games built to measure someone’s memory capacity and processing speed
  • Behavioural data, e.g. how you engage in an assessment,
  • Video Ai can capture how you act in an interview—your gestures, pose, lean, as well as your tone and cadence.
  • Your text or voice responses to structured interview questions
  • Public data sources such as your social media profile, your tweets, and other social media activity

If you consider the range of data that can be used in training data, not all data sources are equal, and on its surface, you can certainly see how some carry the risk of amplifying existing bases and the risk of alienating your candidates.

Consider the training data through these lenses:

> Is the data visible or opaque to the candidate?

Using data that is invisible to the candidate may impact your employer brand. And relying on behavioural data such as how quickly a candidate completes the assessment, social data or any data that is invisible to the candidate might expose you to not only brand risk but also a legal risk. Will your candidates trust an assessment that uses data that is invisible to them, scraped about them or which can’t be readily explained?

Increasingly companies are measuring the business cost from poor hiring processes that contribute to customer churn. 65% of candidates with a positive experience would be a customer again even if they were not hired and 81% will share their positive experience with family, friends and peers (Source: Talent Board).

Visibility of the data used to generate recommendations is also linked to explainability which is a common attribute now demanded by both governments and organisations in the responsible use of Ai.

Video Ai tools have been legally challenged on the basis that they fail to comply with baseline standards for AI decision-making, such as the OECD AI Principles and the Universal Guidelines for AI.

Or that they perpetuate societal biases and could end up penalising nonnative speakers, visibly nervous interviewees or anyone else who doesn’t fit the model for look and speech.

If you are keen to attract and retain applicants through your recruitment pipeline, you may also care about how explainable and trustworthy your assessment is. When the candidate can see the data that is used about them and knows that only the data they consent to give is being used, they may be more likely to apply and complete the process. Think about how your own trust in a recruitment process could be affected by different assessment types.

> Is the data 1st party data or 3rd party data?

1st party data is data such as the interview responses written by a candidate to answer an interview question. It is given openly, consensually and knowingly. There is full awareness about what this data is going to be used for and it’s typically data that is gathered for that reason only.

3rd party data is data that is drawn from or acquired through public sources about a candidate such as their Twitter profile. It could be your social media profile. It is data that is not created for the specific use case of interviewing for a job, but which is scraped and extracted and applied for a different purpose. It is self-evident that an Ai tool that combines visible data and 1st party data is likely to be both more accurate in the application for recruitment and have outcomes more likely to be trusted by the candidate and the recruiter.


Trust matters to your candidates and to your culture …

At PredictiveHire, we are committed to building ethical and engaging assessments. This is why we have taken the path of a text chat with no time pressure. We allow candidates to take their own time, reflect and submit answers in text format.

We strictly do not use any information other than the candidate responses to the interview questions (i.e. fairness through unawareness – algorithm knows nothing about sensitive attributes).

For example, no explicit use of race, age, name, location etc, candidate behavioural data such as how long they take to complete, how fast they type, how many corrections they make, information scraped from the internet etc. While these signals may carry information, we do not use any such data.


2. Can you explain why ‘person y’ was recommended by the Ai and not ‘person z’?

Another way to ask this is – Can you explain how your algorithm works? and does your solution use deep learning models?

This is an interesting question especially given that we humans typically obfuscate our reasons for rejecting a candidate behind the catch-all explanation of “Susie was not a cultural fit”.

For some reason, we humans have a higher-order need and expectation to unpack how an algorithm arrived at a recommendation. Perhaps because there is not much to say to a phone call that tells you were rejected for cultural fit.

This is probably the most important aspect to consider, especially if you are the change leader in this area. It is fair to expect that if an algorithm affects someone’s life, you need to see how that algorithm works.

Transparency and explainability are fundamental ingredients of trust, and there is plenty of research to show that high trust relationships create the most productive relationships and cultures.

This is also one substantial benefit of using AI at the top of the funnel to screen candidates. Subject to what kind of Ai you use, it enables you to explain why a candidate was screened in or out.

This means recruitment decisions become consistent and fairer with AI  screening tools.

But if Ai solutions are not clear why some inputs (called “features” in machine learning jargon) are used and how they contribute to the outcome,  explainability becomes impossible.

For example, when deep learning models are used, you are sacrificing explainability for accuracy. Because no one can explain how a particular data feature contributed to the recommendation. This can further erode candidate trust and impact your brand.

The most important thing is that you know what data is being used and then ultimately, it’s your choice as to whether you feel comfortable to explain the algorithm’s recommendations to both your people and the candidate.

3. What assumptions and scientific methods are behind the product? Are they validated?

Assessment should be underpinned by validated scientific methods and like all science, the proof is in the research that underpins that methodology.

This raises another question for anyone looking to rely on AI tools for human decision making – where is the published and peer-reviewed research that ensures you can have confidence that a) it works and b) it’s fair.

This is an important question given the novelty of AI methods and the pace at which they advance.

At PredictiveHire, we have published our research to ensure that anyone can investigate for themselves the science that underpins our AI solution.


INSERT RESEARCH


We continuously analyse the data used to train models for latent patterns that reveal insights for our customers as well as inform us of improving the outcomes.

4. What are the bias tests that you use and how often do you test for bias?

It’s probably self-evident why this is an important question to ask. You can’t have much confidence in the algorithm being fair for your candidates if no one is testing that regularly.

Many assessments report on studies they have conducted on testing for bias.  While this is useful, it does not guarantee that the assessment may not demonstrate biases in new candidate cohorts it’s applied on.

The notion of “data drift” discussed in machine learning highlights how changing patterns in data can cause models to behave differently than expected, especially when the new data is significantly different from the training data.

Therefore on-going monitoring of models is critical in identifying and mitigating risks of bias.

Potential biases in data can be tested for and measured.

These include all assumed biases such as between gender and race groups that can be added to a suite of tests. These tests can be extended to include other groups of interest where those group attributes are available like  English As Second Language (EASL) users.

On bias testing, look out for at least these 3 tests and ask to see the tech manual and an example bias testing report.

  • Proportional Parity Test. This is the standard EEOC measure for adverse impact on selection and recommendations.
  • Score Distribution Test. This measures whether the assessment score distributions are similar across groups of interest
  • Fairness Test. This measures whether the assessment is making the same rate of errors across groups of interest

INSERT IMAGE


At PredictiveHire, we conduct all the above tests. We conduct statistical tests to check for significant differences between groups of feature values,  model outcomes and recommendations. Tests such as t-tests, effect sizes,  ANOVA, 4/5th, Chi-Squared etc. are used for this. We consider this standard practice.

We go beyond the above standard proportional and distribution tests on fairness and adhere to stricter fairness considerations, especially at the model training stage on the error rates. These include following guidelines set by  IBM’s AI Fairness 360 Open Source Toolkit. Reference: https://aif360.mybluemix.net/) and the Aequitas project at the Centre for  Data Science and Public Policy at the University of Chicago

We continuously analyse the data used to train models for latent patterns that reveal insights for our customers as well as inform us of improving the outcomes.

5. How can you remove bias from an algorithm?

We all know that despite best intentions, we cannot be trained out of our biases. Especially the unconscious biases.

This is another reason why using data-driven methods to screen candidates is fairer than using humans.

Biases can occur in many different forms. Algorithms and Ai learn according to the profile of the data we feed it. If the data it learns from is taken from a  CV, it’s only going to amplify our existing biases. Only clean data, like the answers to specific job-related questions, can give us a true bias-free outcome.

If any biases are discovered, the vendor should be able to investigate and highlight the cause of the bias (e.g. a feature or definition of fitness) and take corrective measure to mitigate it.

  1. On which minority groups have you tested your products?

If you care about inclusivity, then you want every candidate to have an equal and fair opportunity at participating in the recruitment process.

This means taking account of minority groups such as those with autism,  dyslexia and English as a second language (EASL), as well as the obvious need to ensure the approach is inclusive for different ethnic groups, ages and genders.

At PredictiveHire, we test the algorithms for bias on gender and race. Tests can be conducted for almost any group in which the customer is interested.  For example, we run tests on “English As a Second Language” (EASL) vs. native speakers.

  1. What kind of success have you had in terms of creating hiring equity?

If one motivation for you introducing Ai tools to your recruitment process is to deliver more diverse hiring outcomes, it’s natural you should expect the provider to have demonstrated this kind of impact in its customers.

If you don’t measure it, you probably won’t improve it. At PredictiveHire, we provide you with tools to measure equality. Multiple dimensions are measured through the pipeline from those who applied, were recommended and then who was ultimately hired.

8. What is the composition of the team building this technology?

Thankfully, HR decision-makers are much more aware of how human bias  can creep into technology design. Think of how the dominance of one trait in  the human designers and builders have created an inadvertent unfair  outcome.

In 2012, YouTube noticed something odd.

About 10% of the videos being uploaded were upside down.

When designers investigated the problem, they found something unexpected:  Left-handed people picked up their phones differently, rotating them 180  degrees, which lead to upside-down videos being uploaded,

The issue here was a lack of diversity in the design process. The engineers and designers who created the YouTube app were all right-handed, and none had considered that some people might pick up their phones differently.

In our team at PredictiveHire, from the top down, we look for diversity in its broadest definition.

Gender, race, age, education, immigrant vs native-born, personality traits,  work experience. It all adds up to ensure that we minimise our collective blind spots and create a candidate and user experience that works for the greatest number of people and minimises bias.

What other questions have you used to validate the fairness and integrity of the Ai tools you have selected to augment your hiring and promotion processes?

We’d love to know!


Blog

Reinventing the Competency Framework: A Data-Driven Approach for the AI Era

We can’t hide from reality anymore. Talent needs are shifting overnight, and AI is redefining what it means to work. Traditional talent frameworks are no longer fit for purpose. At Sapia.ai, we believe the future of talent strategy lies in a smarter, fairer, and more adaptive way of defining what great looks like. 

Our AI hiring platform is built on the largest proprietary dataset of interview answers globally – we’re a data company at heart, and we’ve seen the power of data-driven people methodology in transforming how organisations hire and retain good talent.  

So, when it came to building a new Competency Framework that could be leveraged globally for hiring for any role at any scale, of course, we used a ground-up, data-led methodology that bridges the gap between organisational psychology and AI.

Why Rethink Competency Frameworks?

Conventional frameworks are typically crafted through expert interviews and focus groups. While valuable, they tend to be subjective, static, and too slow to keep pace with evolving job demands. As roles become more fluid and technology augments or replaces task-based skills, organisations need a new way to understand the human capabilities that genuinely matter for performance.

We wanted to identify enduring, job-agnostic competencies that reflect what drives success in a modern workplace – capabilities like adaptability, resilience, learning agility, and customer orientation.

(Why competencies and not just skills? Read why here.)

Our Approach: Where AI Meets I/O Psychology

Sapia.ai’s methodology is rooted in the science of human behaviour but powered by cutting-edge AI. We asked two core questions:

  1. Can we make competency discovery agile, scalable, and evidence-based?
  2. Can we use AI to automate the process without losing the rigour of traditional psychology?

The answer to both: yes.

We began with a rich dataset of over 37,000 job descriptions across industries and role types. Using large language models (LLMs) and advanced NLP techniques, we extracted over 200,000 behavioural descriptors. These were distilled down through a four-step process:

  1. Behavioural Descriptor Extraction
  2. Clustering and Labeling
  3. Cluster Analysis by I/O Psychologists
  4. Thematic Categorisation and Definition of Competencies

This resulted in a refined list of 25 human-centric competencies, each with clear behavioural indicators and practical relevance across a wide range of roles.

Built to Scale. Built to Adapt.

Our framework is intelligent, but importantly, it’s adaptive. Organisations can apply this methodology to their own job descriptions to discover custom competencies. This bottom-up, role-data-led approach ensures alignment to real work, not just theoretical models.

And because the framework integrates directly with our AI-powered hiring tools, you get a connected system that brings your talent strategy to life. 

Our framework comes to life in the following tools: 

  • Job Analyser – Starting with a job description, it creates a unique competency profile for each role to build tailored structured interviews in seconds.
  • Structured Chat-based Interviews that assess candidates’ responses according to the competency profile for consistent candidate assessment.
  • Talent Insights Reports from every interview with deep reasoning and explainability for fair and objective hiring decisions.
  • Phai Career Coach for internal mobility and employee growth that considers their competency strengths and career aspirations.

The Future of Talent Acquisition & Development is Competency-First

Skills alone cannot predict success. Competencies do. As AI continues transforming how we work, Sapia.ai’s Competency Framework offers a scalable, scientific, and fair foundation for hiring and developing the talent of tomorrow.

Want to see how it works? Download the full framework.


 

Read Online
Blog

It’s Time to Stop Hiring for Skills, and Start Hiring for Competencies

If you’re a CHRO or Head of Recruitment at an enterprise today, chances are you’ve been inundated with messages about the importance of “skills-based hiring.” LinkedIn’s recent Work Change Report (2025) is full of compelling data: a 140% increase in the rate at which professionals are adding new skills to their profiles since 2022, and a projection that by 2030, 70% of the skills used in most jobs today will have changed.

This is essential reading. But there’s a missed opportunity: the singular focus on “skills” fails to acknowledge the real metric that talent leaders need to be using to future-proof their workforce — competencies.

Skills vs Competencies: The Crucial Distinction

  • Skills are task-specific capabilities. Think Python programming, Excel, or even negotiation.

  • Soft skills refer to interpersonal or behavioural qualities like adaptability, communication, and resilience.

But skills on their own — even soft ones — are generic, disjointed, and often disconnected from real-world performance. In contrast:

  • Competencies are clusters of skills, knowledge, behaviours and abilities that are observable, measurable, and context-specific.

Put simply, competencies answer the all-important question: Can this person apply the right skills, in the right way, at the right time, to deliver results in our environment?

Why Competencies Matter More Than Ever

The Work Change Report outlines a future where job titles are fluid, roles evolve quickly, and AI is a constant disruptor. This creates three massive challenges for hiring at scale:

  1. Roles are changing faster than static skill frameworks can keep up

  2. Job candidates may have non-linear, cross-functional backgrounds

  3. The shelf-life of technical skills is shrinking rapidly

Skills alone don’t tell us whether someone can succeed in a role that will look different 12 months from now. But competencies can. Because they measure not just what a person knows, but how they apply it.

Adaptive Talent: The New Competitive Advantage

The LinkedIn report highlights a critical insight: organisations now prioritise agility in entry-level hiring. And there’s a good reason for that. With professionals expected to hold twice as many jobs over their careers compared to 15 years ago, adaptability is not just a nice-to-have. It’s core to success.

But you can’t measure agility with a keyword on a CV. You measure it by looking at competencies like:

  • Learning agility

  • Change resilience

  • Cross-functional collaboration

  • Problem-solving in ambiguous contexts

When you shift the focus away from skills to behavioural competencies that can be defined, observed, and assessed in structured ways, you open yourself up to a much more dynamic and more useful way of managing talent.

Building a Competency-Based Talent Framework

To hire effectively at scale, particularly in a technology-driven world of work, talent leaders must shift their lens:

  1. Define Role-Specific Competencies: Move beyond job descriptions based on qualifications or vague skill sets. Break roles down into measurable competencies that reflect current and emerging performance expectations. This step is crucial for organisations to be able to accurately assess role-fit in the next stages. Sapia.ai does this automatically, taking job descriptions and building role-specific competency models in seconds.

  2. Assess Competencies Fairly and Objectively: Use structured behavioural interviews, ideally at scale. These provide a much more accurate picture of a candidate’s readiness than self-reported skills or credentials. Sapia.ai’s AI powered interviews enable competency assessment, at scale.

  3. Build Pathways for Development and Internal Mobility: A competency framework makes it easier to identify transferable strengths, development gaps, and future-fit potential. It gives employees clarity on how to grow within the business. Using an AI-powered coach can help ensure that talent is being continuously developed against the organisation’s competency framework.

The Future of Work Requires Depth, Not Just Breadth

LinkedIn’s data shows that people are learning more skills more quickly than ever. But the real question for talent leaders like you is: Are those skills being applied in ways that drive value? Are we hiring for task proficiency or performance?

The truth is that the organisations that will thrive in an AI-driven, skills-fluid economy aren’t the ones chasing the next hot skill. They’re the ones designing systems to identify, develop and scale competence.

Keen to Shift to Competencies, but Lacking a Framework? 

Sapia.ai has developed a comprehensive Competency Framework using a data-driven approach. Download the full paper here.


 

Read Online
Blog

The AGC Debate: Are AI-Written Interview Answers a Red Flag or Smart Strategy?

Every day, we read stories of increased fake or AI-assisted applications. Tools like LazyApply are just one of many flooding the market, driving up applicant volumes to never-before-seen levels. 

As an overwhelmed hiring function, how do you find the needle in the haystack without using an army of recruiters to filter through the maze?

At Sapia.ai, we help global enterprises do just that. Many of the world’s most trusted brands, such as Qantas Group, have relied on our hiring platform as a co-pilot for better hiring since 2020. 

Our Chat Interview has given millions of candidates a voice they wouldn’t have had – enabling them to share in their own words why they’re the best fit for the role. To find the people who belong with their brands, our customers must trust that their candidates represent themselves. Thus, they want to trust that our AI is analysing real human answers—not answers from a machine.  

The Rise of GPT 

When ChatGPT went viral in November 2022, we immediately adopted a defensive strategy. We had long been flagging plagiarised candidate responses, but then, we needed to act fast to flag responses using artificially generated content (‘AGC’). 

Many companies were in the same position, but Sapia.ai was the only company with a large proprietary data set of interview answers that pre-dated GPT and similar tools: 2.5 billion words written by real humans. 

That data enabled us to build a world-first:- an LLM-based AGC detector for text-based interviews, recently upgraded to v2.0 with 99% accuracy and a false positive rate of 1%. An NLP classification model built on Sapia.ai proprietary data that operates across all Sapia.ai chat interviews.

Full Transparency with Candidates

Because we value candidate trust as much as customer trust, we wanted to be transparent with candidates about our ability to detect artificially generated content (AGC). As an LLM, we could identify AGC in real time and warn candidates that we had detected it. 

This has had a powerful impact on candidate behaviour. Since our AGC detector went live, we have seen that the real-time flagging acts as a real-time disincentive to use tools like ChatGPT to generate interview responses. 

The detector generates a warning if 3 or more answers are flagged as having artificially generated content. The Sapia.ai Chat Interview uses 5 open-ended interview questions for volume hiring roles, such as retail, contact centre, and customer service, and 6 questions for professional roles, such as engineers, data scientists, graduates, etc.

Let’s Take a Closer Look at the Data… 

We see that using our AGC detector LLM to communicate live with candidates in the interview flow when artificial content has been detected has a positive effect on deterring candidates from using AI tools to generate their answers. 

The rate of AGC use declines from 1 question flagged to 5 questions – raising the flag on one question is generally enough to deter candidates from trying again. 

The graph below shows the number of candidates, from a total of almost 2.7m, that used artificially generated content in their answers.  

Differences in AGC Usage Rate by Groups 

We see no meaningful differences in candidate behaviour based on the job they are applying for or based on geography.

However, we have found differences by gender and ethnicity – for example, men use artificially generated content more than women. The graph below shows the overall completion ratios by gender – for all interviews on the left and for interviews where the number of questions with AGC detected is 5 or more on the right. 

Perception of Artificially Generated Content by Hirers. 

We’re curious to understand how hirers perceive the use of these tools to assist candidates in a written interview. The creation of the detector was based on the majority of Sapia.ai customers wanting transparency & explainability around the use of these tools by candidates, often because they want to ensure that candidates are using their own words to complete their interviews and they want to avoid wasting time progressing candidates who are not as capable as their chat interview suggests.  

However, some of our customers feel that it’s a positive reflection of the candidate, showing that they are using the tools available to them to put their best foot forward. 

It’s a mix of perspectives. 

Our detector labels it as the use of artificially generated content. It’s up to our customers how they use that information in their decision-making processes. 

This concept of having a human in the loop is one of the key dimensions of ethical AI, and we ensure that it is used in every AI-related hiring product we build. 

Interested in the science behind it all? Download our published research on developing the AGC detector 👇

Research Paper Download: AI Generated Content in Online Text-based Structured Interviews

Read Online