Discrimination based on race and ethnicity in personnel selection is a well known and pervasive issue highlighted in numerous studies (Bertrand & Mullainathan, 2004; Kline et al., 2021; Pager et al., 2009).
Most of these studies report name-based inference of race and ethnicity by human reviewers leading to differential outcomes in the recruitment process. Linguistic racism is a form of discrimination that occurs based on one’s use of language, especially English (De Costa, 2020), and is highly associated with race and ethnicity.
As machine learning models are adopted to automate tasks like interview scoring, race or ethnicity encoded signals in language can lead to biased outcomes, if not mitigated. Hence understanding the level of ethnicity encoded signals in language is important when building natural language-based machine learning models in order to avoid biased outcomes, for example by using feature scores rather than raw text to score responses (Jayaratne, Jayatilleke, Dai, 2022).
In this work, we sought to quantify and compare the amount of ethnicity encoded information in over 300,000 candidates’ raw text interview responses to language-derived feature scores, including personality, behavioral competencies, and communication skills.
First, we trained machine learning models to predict candidate ethnicity from raw-text chat interview responses. Specifically, we trained an Attention-Based Bidirectional Long Short-Term Memory (Attn-BiLSTM) (Zhou et al., 2016) model for predicting ethnicity from textual responses.
Secondly, we tested the same for the language-derived features used in the automated scoring of the interview responses. We trained multiple models using a variety of machine learning algorithms (a linear model, tree models with bagging and boosting, and a neural network model with a single hidden layer) suitable for tabular data for predicting ethnicity from the 21 derived features.
Each model was then used to predict ethnicity for the 10% of the sample left out of the training dataset. The results from the classification tasks show a clear distinction between the ability to infer ethnicity based on natural language and inferred features. As hypothesized, we found that features derived according to a clearly defined rubric contain significantly less ethnicity information compared to raw candidate responses. That is, the models based on derived features recorded consistently weaker accuracy, precision, recall, and F1 values across all models compared to the model for the raw text candidate responses.
This research demonstrates the benefit of using algorithmically derived feature values in mitigating ethnicity related biases when scoring structured interview responses. Specifically, our results show that natural language responses to interview questions carry higher amounts of ethnicity information compared to features derived according to a clearly defined rubric for assessing interview responses. This further strengthens the case for using structured interviews that have been shown to reduce bias over unstructured interviews (Levashina et al., 2014) with much stronger criterion validity (Sackett et al., 2021).
References:
Bertrand, M., & Mullainathan, S. (2004). Are Emily and Greg More Employable Than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination. American Economic Review, 94(4), 991–1013.
De Costa, P. I. (2020). Linguistic racism: Its negative effects and why we need to contest it. International Journal of Bilingual Education and Bilingualism, 23(7), 833–837.
Jayaratne, M., Jayatilleke, B., & Yimeng Dai (2022). Identifying and mitigating gender bias in structured interview responses [Paper presentation]. 2022 Society for Industrial Organizational Psychology Conference. Seattle, Washington, United States.
Kline, P. M., Rose, E. K., & Walters, C. R. (2021). Systemic Discrimination Among Large U.S. Employers (NBER Working Papers No. 29053). National Bureau of Economic Research, Inc.
Levashina, J., Hartwell, C. J., Morgeson, F. P., & Campion, M. A. (2014). The Structured Employment Interview: Narrative and Quantitative Review of the Research Literature. Personnel Psychology, 67(1), 241–293.
Pager, D., Western, B., & Bonikowski, B. (2009). Discrimination in a Low-Wage Labor Market: A Field Experiment. American Sociological Review, 74(5), 777–799.
Sackett, P. R., Zhang, C., Berry, C. M., & Lievens, F. (2021). Revisiting meta-analytic estimates of validity in personnel selection: Addressing systematic overcorrection for restriction of range. Journal of Applied Psychology.
Zhou, P., Shi, W., Tian, J., Qi, Z., Li, B., Hao, H., & Xu, B. (2016). Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 207–212.
Retail leaders have embraced AI to improve supply chains, automate checkout, and enhance customer experience. But what about finding the people who deliver that customer experience?
AI brings incredible possibilities to supercharge how retailers hire, develop, and retain talent.
At Sapia.ai, we helped iconic retailers like Woolworths, Starbucks, Holland & Barrett, and David Jones reimagine hiring from the ground up – replacing resumes, ghosting, and gut feel with structured, ethical AI that delivers performance and fairness at scale.
The Retail Problem: Volume, Turnover, and Ghosting
Retail is high volume. It’s high churn. And it’s high stakes for candidate experience:
And yet, most hiring still relies on broken tools: resumes, forms, manual processes, and outdated systems.
Sapia.ai: The AI-Native Hiring Engine Built for Retail
Our platform automates the entire “apply to decide” journey, leveraging AI & automation to streamline the hiring process & bring intelligence into retail hiring.
Smart Interviewer™: Mobile-first, chat-based, structured interviews for a holistic candidate assessment.
Live Interview™: AI-driven bulk interview scheduling without calendar chaos.
InterviewAssist™: Instant interview guide generation.
Discover Insights: Embedded analytics to track hiring health in real-time.
Phai: GenAI coach for career and leadership potential.
Unlike resume parsing or generic chatbots, Sapia.ai assesses soft skills, communication, and culture fit using natural language processing and validated psychometrics. It’s ethical AI built in, not bolted on.
From Application to Interview in Under 24 Hours
Candidates don’t want to wait. They don’t want to be ghosted. And they don’t want resumes to define them.
> 80% of Sapia.ai chat interviews are completed in under 24 hours.
We see consistently high completion across categories: grocery, merchandising, home improvement, and luxury retail.
“It was fast, fair, and I actually got feedback. That never happens.” – Retail Candidate Feedback
Real Impact, Across Every Retail Category
Sapia.ai powers hiring for millions of candidates across diverse retail environments:
Impact of Sapia.ai on Retail Hiring in 2024 | |||
Category | Hours Saved | FTEs Saved | Cost Saved |
Grocery | 272k | 131 | $6.5m |
General Merchandise | 193k | 93 | $4.6m |
Specialty Retail | 133k | 64 | $3.2m |
Home Improvements | 103k | 50 | $2.5m |
Merchandising | 22k | 11 | $0.5m |
Luxury | 9k | 4 | $0.2m |
The savings created by intelligent, AI-native automation have unlocked team capacity, impacted retailers’ P&L, and improved store readiness.
Speed That Delivers Real ROI
Every candidate gets interviewed instantly. No waiting. No bias. Just fast, fair, data-backed decisions. This generates real impact for retailers who previously relied on slow, outdated processes to handle thousands of applicants.
DEI by Design, Not by Mandate
With Sapia.ai:
DEI Fairness Scores (based on actual hiring data):
Gender: 1.03 (vs customer baseline of 1.01)
Ethnicity: 1.15 (vs customer baseline of 0.74)
Why? Because ethical AI removes what humans can’t unlearn: bias. With a candidate experience that is inclusive by design, retailers can ensure fairness in screening, and measure it in hiring.
Candidate Experience = Brand Experience
Retail candidates are your customers. And the experience you give them matters. We have built a brand advocacy engine that delights candidates and gives you the data to prove it.
Responsible, Explainable AI Built for Retail
Not all AI is created equally. Since 2018, Sapia.ai has been built on a foundation of responsible AI:
“We can’t go back to life before Sapia.ai. We used to spend half the day reading resumes.”
— Talent Lead, Starbucks AU
What’s at Stake: Time, Brand, and Revenue
Every day spent using outdated hiring methods costs retailers:
With Sapia.ai, you get the productivity unlock retail hiring demands, and the intelligence your talent deserves.
Want to see how fast, fair, and human retail hiring can be?
We can’t hide from reality anymore. Talent needs are shifting overnight, and AI is redefining what it means to work. Traditional talent frameworks are no longer fit for purpose. At Sapia.ai, we believe the future of talent strategy lies in a smarter, fairer, and more adaptive way of defining what great looks like.
Our AI hiring platform is built on the largest proprietary dataset of interview answers globally – we’re a data company at heart, and we’ve seen the power of data-driven people methodology in transforming how organisations hire and retain good talent.
So, when it came to building a new Competency Framework that could be leveraged globally for hiring for any role at any scale, of course, we used a ground-up, data-led methodology that bridges the gap between organisational psychology and AI.
Conventional frameworks are typically crafted through expert interviews and focus groups. While valuable, they tend to be subjective, static, and too slow to keep pace with evolving job demands. As roles become more fluid and technology augments or replaces task-based skills, organisations need a new way to understand the human capabilities that genuinely matter for performance.
We wanted to identify enduring, job-agnostic competencies that reflect what drives success in a modern workplace – capabilities like adaptability, resilience, learning agility, and customer orientation.
(Why competencies and not just skills? Read why here.)
Sapia.ai’s methodology is rooted in the science of human behaviour but powered by cutting-edge AI. We asked two core questions:
The answer to both: yes.
We began with a rich dataset of over 37,000 job descriptions across industries and role types. Using large language models (LLMs) and advanced NLP techniques, we extracted over 200,000 behavioural descriptors. These were distilled down through a four-step process:
This resulted in a refined list of 25 human-centric competencies, each with clear behavioural indicators and practical relevance across a wide range of roles.
Our framework is intelligent, but importantly, it’s adaptive. Organisations can apply this methodology to their own job descriptions to discover custom competencies. This bottom-up, role-data-led approach ensures alignment to real work, not just theoretical models.
And because the framework integrates directly with our AI-powered hiring tools, you get a connected system that brings your talent strategy to life.
Our framework comes to life in the following tools:
Skills alone cannot predict success. Competencies do. As AI continues transforming how we work, Sapia.ai’s Competency Framework offers a scalable, scientific, and fair foundation for hiring and developing the talent of tomorrow.
If you’re a CHRO or Head of Recruitment at an enterprise today, chances are you’ve been inundated with messages about the importance of “skills-based hiring.” LinkedIn’s recent Work Change Report (2025) is full of compelling data: a 140% increase in the rate at which professionals are adding new skills to their profiles since 2022, and a projection that by 2030, 70% of the skills used in most jobs today will have changed.
This is essential reading. But there’s a missed opportunity: the singular focus on “skills” fails to acknowledge the real metric that talent leaders need to be using to future-proof their workforce — competencies.
But skills on their own — even soft ones — are generic, disjointed, and often disconnected from real-world performance. In contrast:
Put simply, competencies answer the all-important question: Can this person apply the right skills, in the right way, at the right time, to deliver results in our environment?
The Work Change Report outlines a future where job titles are fluid, roles evolve quickly, and AI is a constant disruptor. This creates three massive challenges for hiring at scale:
Skills alone don’t tell us whether someone can succeed in a role that will look different 12 months from now. But competencies can. Because they measure not just what a person knows, but how they apply it.
The LinkedIn report highlights a critical insight: organisations now prioritise agility in entry-level hiring. And there’s a good reason for that. With professionals expected to hold twice as many jobs over their careers compared to 15 years ago, adaptability is not just a nice-to-have. It’s core to success.
But you can’t measure agility with a keyword on a CV. You measure it by looking at competencies like:
When you shift the focus away from skills to behavioural competencies that can be defined, observed, and assessed in structured ways, you open yourself up to a much more dynamic and more useful way of managing talent.
To hire effectively at scale, particularly in a technology-driven world of work, talent leaders must shift their lens:
LinkedIn’s data shows that people are learning more skills more quickly than ever. But the real question for talent leaders like you is: Are those skills being applied in ways that drive value? Are we hiring for task proficiency or performance?
The truth is that the organisations that will thrive in an AI-driven, skills-fluid economy aren’t the ones chasing the next hot skill. They’re the ones designing systems to identify, develop and scale competence.
Sapia.ai has developed a comprehensive Competency Framework using a data-driven approach. Download the full paper here.