Artificial Intelligence-based interview scoring learns from past interview answers, which makes it hard for it to determine if a candidate is legitimately answering the question if their response includes context or an example rarely seen in training data.
Moreover, AI interview platforms may be susceptible to adversarial inputs where an irrelevant answer may receive a high score. Both scenarios raise fairness concerns and can erode trust in AI job interviews (Madaio et al, 2020).
This is why identifying outliers that differ significantly from the majority of answers and flagging them for manual review become crucial steps toward responsible and fair use of AI interview software. While simple rule-based methods (Reiz and Pongor, 2011) could help filter out some irrelevant answers based on answer length and regular expressions, these methods do not take into account the context and content of the answer and question. Someone may describe a very unique, yet relevant, situation in response to an AI for interviews question, which you wouldn’t want to disregard.
In this study, we introduce an unsupervised, question-aware, multi-context outlier detection model that can help detect anomalous answers contextually and semantically. The unsupervised approach is deemed to be more practical compared to a supervised model that requires a large labeled dataset of outlier answers. It helps bootstrap an outlier detector that can then be enhanced through human feedback.
We tested the outlier model to ascertain how well it is able to correctly identify 177,691 actual hired candidate interview answers from outliers, (e.g., movie reviews, news articles, nonsensical text, and sentences generated using BERT (Vaswani et al, 2017) with random starting words).
Our model outperformed the baseline One-class SVM outlier detector (Li et al, 2003), in detecting outliers from actual interview answers. The performance of our model over the baseline unsupervised model can be explained by both question-aware learning and multi-context learning, which help yield better contextual representations for detecting outlier answers from typical interview answers.
We also conducted a human evaluation on 10,689 interview answers of candidates who were not hired and might have provided outlier answers. Our model predicted 0.16% of the answers as outliers with only 5.9% of them being false positives. All of these false predictions describe contexts related to family and personal life in their answers but are relevant to the question. It is reasonable that these answers are labeled as an outlier by our model since they are contextually and semantically different from most interview answers.
While a data-driven AI interviewer can help counter flaws in human interviewers, answers that are significantly different to training data can lead to spurious predictive outcomes. In this study, we show how a
References:
Dai, Y., Qi, J., & Zhang, R. (2020). Joint recognition of names and publications in academic homepages. In Proceedings of the 13th International Conference on Web Search and Data Mining (pp. 133-141).
Li, K. L., Huang, H. K., Tian, S. F., & Xu, W. (2003). Improving one-class SVM for anomaly detection. In Proceedings of the 2003 international conference on machine learning and cybernetics (IEEE Cat. No. 03EX693) (Vol. 5, pp. 3077-3081). IEEE.
Madaio, M. A., Stark, L., Wortman Vaughan, J., & Wallach, H. (2020). Co-designing checklists to understand organizational challenges and opportunities around fairness in AI. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (pp. 1-14).
Reiz, B., & Pongor, S. (2011). Psychologically Inspired, Rule-Based Outlier Detection in Noisy Data. In SYNASC (pp. 131-136)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., … & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.
Walk into any store this festive season and you’ll see it instantly. The lights, the displays, the products are all crafted to draw people in. Retailers spend millions on campaigns to bring customers through the door.
But the real moment of truth isn’t the emotional TV ad, or the shimmering window display. It’s the human standing behind the counter. That person is the brand.
Most retailers know this, yet their hiring processes tell a different story. Candidates are often screened by rigid CV reviews or psychometric tests that force them into boxes. Neurodiverse candidates, career changers, and people from different cultural or educational backgrounds are often the ones who fall through the cracks.
And yet, these are the very people who may best understand your customers. If your store colleagues don’t reflect the diversity of the communities you serve, you create distance where there should be connection. You lose loyalty. You lose growth.
We call this gap the diversity mirror.
When retailers achieve mirrored diversity, their teams look like their customers:
Customers buy where they feel seen – making this a commercial imperative.
The challenge for HR leaders is that most hiring systems are biased by design. CVs privilege pedigree over potential. Multiple-choice tests reduce people to stereotypes. And rushed festive hiring campaigns only compound the problem.
That’s where Sapia.ai changes the equation: Every candidate is interviewed automatically, fairly, and in their own words.
With the right HR hiring tools, mirrored diversity becomes a data point you can track, prove, and deliver on. It’s no longer just a slogan.
David Jones, Australia’s premium department store, put this into practice:
The result? Store teams that belong with the brand and reflect the customers they serve.
Read the David Jones Case Study here 👇
As you prepare for festive hiring in the UK and Europe, ask yourself:
Because when your colleagues mirror your customers, you achieve growth, and by design, you’ll achieve inclusion.
See how Sapia.ai can help you achieve mirrored diversity this festive season. Book a demo with our team here.
Mirrored diversity means that store teams reflect the diversity of their customer base, helping create stronger connections and loyalty.
Seasonal employees often provide the first impression of a brand. Inclusive teams make customers feel seen, improving both experience and sales.
Adopting tools like AI structured interviews, bias monitoring, and data dashboards helps retailers hire fairly, reduce screening time, and build more diverse teams.
Organisations invest heavily in their employer brand, career sites, and EVP campaigns, especially to attract underrepresented talent. But without the right data, it’s impossible to know if that investment is paying off.
Representation often varies across functions, locations, and stages of the hiring process. Blind spots allow bias to creep in, meaning underrepresented groups may drop out long before offer.
Collecting demographic data is only step one. Turning it into insight you can act on is where real change and better hiring outcomes happen.
The Diversity Dashboard in Discover Insights, Sapia.ai’s analytics tool, gives you real-time visibility into representation, inclusion, and fairness at every stage of your talent funnel. It helps you connect the dots between your attraction strategies and actual hiring outcomes.
Key features include:
With the Diversity Dashboard, you can pinpoint where inclusion is thriving and where it’s falling short.
It’s also a powerful tool to tell your success story. Celebrate wins by showing which underrepresented groups are making the biggest gains, and share that progress with boards, executives, and regulators.
Powered by explainable AI and the world’s largest structured interview dataset, your insights are fair, auditable, and evidence-based.
Measuring diversity is the first step. Using that data to take action is where you close the Diversity Gap. With the Diversity Dashboard, you can prove your strategy is working and make the changes where it isn’t.
Book a demo to see the Diversity Dashboard in action.
Why neuroinclusion can’t be a retrofit and how Sapia.ai is building a better experience for every candidate.
In the past, if you were neurodivergent and applying for a job, you were often asked to disclose your diagnosis to get a basic accommodation – extra time on a test, maybe the option to skip a task. That disclosure often came with risk: of judgment, of stigma, or just being seen as different.
This wasn’t inclusion. It was bureaucracy. And it made neurodiverse candidates carry the burden of fitting in.
We’ve come a long way, but we’re not there yet.
Over the last two decades, hiring practices have slowly moved away from reactive accommodations toward proactive, human-centric design. Leading employers began experimenting with:
But even these advances have often been limited in scope, applied to special hiring programs or specific roles. Neurodiverse talent still encounters systems built for neurotypical profiles, with limited flexibility and a heavy dose of social performance pressure.
Hiring needs to look different.
Truly inclusive hiring doesn’t rely on diagnosis or disclosure. It doesn’t just give a select few special treatment. It’s about removing friction for everyone, especially those who’ve historically been excluded.
That’s why Sapia.ai was built with universal design principles from day one.
Here’s what that looks like in practice:
It’s not a workaround. It’s a rework.
We tend to assume that social or “casual” interview formats make people comfortable. But for many neurodiverse individuals, icebreakers, group exercises, and informal chats are the problem, not the solution.
When we asked 6,000 neurodiverse candidates about their experience using Sapia.ai’s chat-based interview, they told us:
“It felt very 1:1 and trustworthy… I had time to fully think about my answers.”
“It was less anxiety-inducing than video interviews.”
“I like that all applicants get initial interviews which ensures an unbiased and fair way to weigh-up candidates.”
Some AI systems claim to infer skills or fit from resumes or behavioural data. But if the training data is biased or the experience itself is exclusionary, you’re just replicating the same inequity with more speed and scale.
Inclusion means seeing people for who they are, not who they resemble in your data set.
At Sapia.ai, every interaction is transparent, explainable, and scientifically validated. We use structured, fair assessments that work for all brains, not just neurotypical ones.
Neurodiversity is rising in both awareness and representation. However, inclusion won’t scale unless the systems behind hiring change as well.
That’s why we built a platform that:
Sapia.ai is already powering inclusive, structured, and scalable hiring for global employers like BT Group, Costa Coffee and Concentrix. Want to see how your hiring process can be more inclusive for neurodivergent individuals? Let’s chat.