Back

The Responsible Use of LLMs in Structured Interviews

 

Large Language Models, or LLMs, are the engines that have driven some of the biggest technological advancements in recent years. OpenAI’s GPT-4, Google’s Bard, and Metas Llama are examples of LLMs that power interfaces like ChatGPT, which has changed how many of us work and manage aspects of our daily lives. 

At Sapia.ai, we use responsible AI to help organizations find the people who belong with their brands. Chat Interview enables our customers to interview every candidate, widening their talent pools and creating a more efficient, effective, and fair recruitment process. 

With the advent of LLMs with enhanced language understanding and reasoning, there is a promising opportunity to extend these models in the recruitment space. However, using LLMs for activities like interview scoring demands not only innovation but also a strong commitment to responsible use and ethics. Any use of technology to make hiring more efficient should also make it fairer and transparent. 

This blog post summarises the paper that we presented at SIOP 2024, titled “Responsible Use of Large Language Models for Response Grading and Explanations in Structured Interviews”, written by Yimeng Dai, Leo Pham, Ashlie Plants, and Buddhi Jayatilleke. 

The Beginning of InterviewLLM

Our goal was to create an LLM that could grasp the unique nuances of interview conversations, leading to the creation of InterviewLLM. This domain-specific model was trained on a rich dataset from our Chat Interview™ platform, encompassing over 1.3 billion words from diverse industries, job roles, and geographies. The result is a model that can understand and generate interview dialogues with fewer possible stereotypes. 

The Gold Standard in Interview Grading

Building on the capabilities of InterviewLLM, we developed Saige™ (Sapia AI for Interview Grading and Explanations), a tool that takes interview grading to new heights. Saige™ is fine-tuned with a grading rubric crafted by our subject matter experts, employing Behaviorally Anchored Rating Scales (BARS) to ensure that assessments of candidate responses are anchored in the description of observable behaviors. By adopting BARS, the gold standard for creating rubrics for scoring interviews by humans, Saige™ allows human SMEs to add new competencies to be rated by Saige™, which most of the current mainstream automated interview grading approaches lack.

We didn’t stop at creating a more intelligent grading system with extensibility and explainability. Using LLMs in interviews demands a strong ethical commitment. We recognized the importance of addressing potential biases in AI, which led us to implement a de-biasing process during Saige™’s training. By combining the expertise of our human SMEs with AI feedback, we created a model that not only delivers accurate grades but does so in a manner that is fair and objective.

Saige™ also leverages LLM to explain why a candidate received a certain score, referencing actual examples from the candidate’s response and a description of how it exemplifies or fails to exemplify, the characteristics of the attribute that were defined by the SMEs. The characteristics of each attribute are defined by not only the many facets of them but also the frequency and depth in which they are demonstrated.

The Impact and Implications of Saige™

The introduction of Saige™ marks a significant milestone in the recruitment industry. In testing, Saige™ has demonstrated exceptional performance compared to LLaMA-2, with a higher agreement and correlation with SME grading. 

The implications of Saige™ extend beyond just improved interview grading accuracy. By reducing biases in the grading process, we’re contributing to a more equitable hiring process, where candidates are evaluated based on their merits rather than unconscious prejudices.

Saige™’s ability to provide clear, natural language explanations for grading decisions creates a level of transparency not seen before in automated interviews. With the explanations, candidates can gain insights into areas for improvement and identify the attributes or abilities necessary for success in their careers; while hiring teams can learn more about each candidate and have confidence in their hiring decisions.

Possible Applications of Saige™

We’re at the start of our journey with InterviewLLM and Saige™. The opportunity to use this capability beyond assessment, to provide effective coaching to candidates and employees at scale is immense and exciting.  It could enable them to understand how they can improve, fostering continuous growth and professional development within the organization. As we continue to refine these Generative AI tools and explore new applications of LLMs in recruitment, we remain committed to our core principles of fairness and transparency.


Blog

Joe & the Juice Partners with Sapia.ai, Scaling an Exceptional Candidate Experience and Cutting Time to Hire

Read the full press release about the partnership here.

Joe & the Juice, the trailblazing global juice bar and coffee concept, is renowned for its vibrant culture and commitment to cultivating talent. With humble roots from one store in Copenhagen, now with a presence in 17 markets, Joe & The Juice has built a culture that fosters growth and celebrates individuality.

But, as their footprint expands, so does the challenge of finding and hiring the right talent to embody their unique culture. With over 300,000 applications annually, the traditional hiring process using CVs was falling short – leaving candidates waiting and creating inefficiencies for the recruitment team. To address this, Joe & The Juice turned to Sapia.ai, a pioneer in ethical AI hiring solutions.

A Fresh Approach to Hiring

Through this partnership, Joe & The Juice has transformed its hiring process into an inclusive, efficient, and brand-aligned experience. Instead of faceless CVs, candidates now engage in an innovative chat-based interview that reflects the brand’s energy and ethos. Available in multiple languages, the AI-driven interview screens for alignment with the “Juicer DNA” and the brand’s core values, ensuring that every candidate feels seen and valued.

Candidates receive an engaging and fair interview experience as well as personality insights and coaching tips as part of their journey. In fact, 93% of candidates have found these insights useful, helping to deliver a world-class experience to candidates who are also potential guests of the brand.

“Every candidate interaction reflects our brand,” Sebastian Jeppesen, Global Head of Recruitment, shared. “Sapia.ai makes our recruitment process fair, enriching, and culture-driven.”

Results That Matter

For Joe & The Juice, the collaboration has yielded impressive results:

  • 33% Reduction in Screening Time: Pre-vetted shortlists from Sapia.ai’s platform ensure that recruiters can focus on top candidates, getting them behind the bar faster.

  • Improved Candidate Satisfaction: With a 9/10 satisfaction score from over 55,000 interviews, candidates appreciate the fairness and transparency of the process.

  • Bias-Free Hiring: By eliminating CVs and integrating blind AI that prioritizes fairness, Joe & The Juice ensures their hiring reflects the diverse communities they serve.

Frederik Rosenstand, Group Director of People & Development at Joe & The Juice, highlighted the transformative impact: “Our juicers are our future leaders, so using ethical AI to find the people who belong at Joe is critical to our long-term success. And now we do that with a fair, unbiased experience that aligns directly with our brand.”

Trailblazing for the hospitality industry

In an industry so wholly centred on people, Joe & the Juice is paving the way for similar brands to adopt technology that enables inclusive, human-first experiences that can reflect a brand’s core values. 

If you’re curious about how Sapia.ai can transform your hiring process, check out our full case study on Joe & The Juice here.

 

Read Online
Blog

Sapia.ai Wrapped 2024

It’s been a year of Big Moves at Sapia.ai. From welcoming groundbreaking brands to achieving incredible milestones in our product innovation and scale, we’re pushing the boundaries of what’s possible in hiring.

And we’re just getting started 🚀

Take a look at the highlights of 2024 

All-in-one hiring platform
This year, with the addition of Live Interview, we’re proud to say our platform now covers screening, assessing and scheduling.
It’s an all-in-one volume hiring platform that enables our customers to deliver a world-leading experience from application through to offer.

Supercharging hiring efficiency
Every 15 seconds, a candidate is interviewed with Sapia.ai.
This year, we’ve saved hiring managers and recruiters hours of precious time that can now be used for higher-value tasks. 

See why our users love us 

Giving candidates the best experience
Our platform allows candidates to be their best selves, so our customers can find the people that truly belong with them. They’re proud to use a technology that’s changing hiring, for good.

Share the candidate love

Leading the way in AI for hiring 

We’ve continued to push the boundaries in leveraging ethical AI for hiring, with new products on the way for Coaching, Internal Mobility & Interview Builders. 

Join us in celebrating an incredible 2024

Read Online
Blog

Situational Judgement Tests vs. AI Chat Interviews: A Modern Perspective on Candidate Assessment

Choosing the right tool for assessing candidates can be challenging. For years, situational judgement tests (SJTs) have been a common choice for evaluating behaviour and decision-making skills. However, they come with limitations that can make the hiring process less effective and less inclusive.

AI-enabled chat-based interviews, such as Sapia.ai, provide organisations with a modern alternative. They focus on understanding candidates as individuals and creating a hiring experience that is both fair and insightful while enabling efficient screening and selection. 

This shift raises important questions: Are SJTs still a tool that should be considered for volume hiring? And what do AI assessments offer in comparison?

1. The Static Nature of SJTs

Traditional SJTs use predefined multiple-choice questions to assess behavioural tendencies and situational knowledge. While useful for screening, these static frameworks lack the flexibility to adapt based on real-world performance data or evolving role requirements. 

Once created, SJTs don’t adapt to new data or evolving organisational needs. They rely on fixed scenarios and responses that may not fully reflect the dynamic realities of modern workplaces, and as a result, their relevance may diminish over time.

AI-enabled chat interviews, on the other hand, are inherently adaptive. Using machine learning, these tools can continuously refine their models based on feedback from real-world outcomes such as hiring or turnover data. This ability to evolve ensures the assessments align with organisations’ needs.

2. Richer Data Through Open-Ended Responses

One of the main critiques of SJTs is their reliance on multiple-choice responses. While structured and straightforward, these options may not capture the full scope of a candidate’s thinking, communication skills, or problem-solving ability. The approach is often limiting, reducing complex human behaviour to a few predefined choices.

AI-enabled chat interviews work more holistically and dynamically. These tools provide a more complete picture of a person by allowing candidates to answer questions in their own words. Natural language processing (NLP) analyses their responses, offering insights into personality traits, communication skills, and behavioural tendencies. This open-ended format lets candidates express themselves authentically, giving employers a deeper understanding of their potential.

3. The Candidate Experience: Stressful or Supportive?

SJTs often include time constraints and rigid formats, which can create pressure for candidates. This is especially true when candidates feel forced to choose options that don’t fully reflect how they would actually behave. The process can feel impersonal, even transactional.

In contrast, chat-based interviews are designed to be conversational and low-pressure for candidates. By removing time limits and adopting a familiar chat interface, these tools help candidates feel more at ease. They also frequently include personalised feedback, turning the assessment into a valuable experience for the candidate, not just the employer.

4. Addressing Bias and Fairness

Traditional SJTs are prone to transparency issues, as candidates can often identify and select the “best practice” answers without revealing their true tendencies. Additionally, static test designs can unintentionally embed bias; due to the nature of the timed test, SJTs have been found to disadvantage some groups. 

AI chat interviews, when developed ethically within a framework like Sapia.ai’s FAIR Hiring Framework, eliminate explicit bias by relying solely on the content of a candidate’s responses. Their machine learning models are continuously validated for fairness, ensuring that hiring decisions are free from subjective judgments or irrelevant demographic factors.

5. An Assessment That Improves Over Time

Workplaces are constantly changing, and hiring tools need to keep up. SJTs’ fixed nature can make them less effective as roles evolve or organizational priorities shift. They provide a snapshot but not a dynamic view of what’s needed.

AI-enabled chat interviews are built to adapt. With feedback loops and continuous learning, they incorporate real-world hiring outcomes—like retention and performance data—into their models. This ensures that assessments stay relevant and effective over time.

Rethinking Candidate Assessment

As hiring demands grow more complex, so does the need for tools that can capture the whole person, not just their response to hypothetical scenarios. While SJTs have played an important role in hiring practices, they are increasingly being replaced by tools like AI-enabled chat interviews.

These modern approaches provide richer data, adapt to changing needs, and create a richer and more engaging experience for candidates. Perhaps most importantly, they emphasise fairness and inclusivity, aligning with the growing demand for unbiased hiring practices.

For organisations evaluating their assessment tools, the question isn’t just which method is “better.” Understanding the specific needs of your roles, teams, and candidates will help you  choose tools that help you make decisions that are both informed and equitable.

Read Online