AI QA Engineer

  • Компания: Київстар
  • Город , Kyiv,
  • Зарплата:
  • Размещено: 2025-08-14 00:00:00

Описание

We are seeking an AI QA Engineer with specialization in LLM/NLP model quality assurance to ensure our language models and NLP applications meet the highest standards of accuracy, reliability, and safety. In this role, you will develop rigorous testing strategies for our AI models – including large language models – and lead efforts to detect issues such as factual errors, biases, and instability in model outputs. You will work closely with data scientists and engineers to integrate testing into the model development lifecycle, from early prototyping to post-deployment monitoring. This position is ideal for someone with a strong quality assurance background and a passion for AI, who can bridge the gap between traditional software QA and the unique challenges of evaluating AI systems (chatbots, NLP APIs, etc.) in the context of our Ukrainian LLM project and other products.About us is a Ukrainian hybrid IT company and a resident of We are a subsidiary of Kyivstar, one of Ukraine's largest telecom operators.Our mission is to change lives in Ukraine and around the world by creating technological solutions and products that unleash the potential of businesses and meet users' needs.Over 500+ specialists work daily in various areas: mobile and web solutions, as well as design, development, support, and technical maintenance of high-performance systems and services.We believe in innovations that truly bring quality changes and constantly challenge conventional approaches and solutions. Each of us is an adherent of entrepreneurial culture, which allows us never to stop, to evolve, and to create something new.What you will do• Develop and execute comprehensive AI model evaluation strategies to assess the performance of our NLP and LLM systems. Define testing methodologies that cover correctness (, accuracy of responses, compliance with requirements), consistency, and fairness of model outputs.• Analyze benchmarking datasets, define gaps, and develop the first SOTA benchmarking framework for the Ukrainian language.• Analyze training datasets and collaborate with data engineers on processing pipelines improvement. Implement a training data testing framework.• Implement both automated and manual testing for applications powered by large language models. This includes creating automation scripts or test harnesses that can systematically query models with test cases (prompts/questions) and verify responses, as well as performing hands-on review of outputs for subjective evaluation.• Build and curate high-quality test datasets for model evaluation. Manage a repository of test inputs (, sample user queries, edge-case scenarios, conversational dialogues) along with expected or reference outputs when applicable. Ensure these datasets are diverse, balanced, and representative of real-world use cases, including Ukrainian language content and culturally relevant scenarios.• Develop pipelines for synthetic data generation and adversarial example creation to challenge the model’s robustness. Use techniques such as paraphrasing, noise injection, or adversarial prompting to produce test cases that can reveal model weaknesses.• Design and maintain testing frameworks to detect hallucinations, biases, and other failure modes in LLM outputs. • Define and track key AI performance metrics. Monitor metrics like factual accuracy, coherence/fluency, relevancy to prompt, response diversity, latency of response, and user satisfaction ratings if available. Establish baseline metrics for each new model version and ensure subsequent iterations meet or exceed these benchmarks.• Work closely with the AI development team to integrate QA in the development process. Collaborate with data scientists to test models at early stages (, evaluating prototypes before full deployment), and with ML engineers to include automated tests in CI/CD pipelines for model updates.• Debug and analyze AI model failures. When tests uncover issues (, a model consistently gives incorrect information in a certain domain or shows a bias), investigate and identify root causes by analyzing model outputs and underlying data. Provide clear, detailed reports on issues with steps to reproduce and potential causes.• Provide feedback and recommendations for model improvement. Work with prompt engineers or NLP scientists to refine prompts and instructions that guide the model towards better performance.• Implement continuous monitoring in production to catch regressions or new issues. Set up mechanisms to regularly evaluate live model outputs (via sampling or user feedback analysis) and alert the team if any quality metrics degrade over time (indicative of model drift or unforeseen use cases).• Maintain comprehensive test documentation and reports. Document test plans, test case suites, and summarize the results of evaluations for each model version (including graphs/metrics and qualitative findings). Communicate findings to both technical teams and stakeholders in a clear, actionable manner.Qualifications and experience neededQA Experience: • 3+ years in a Quality Assurance or Testing role, with at least part of that focused on testing AI, ML, or complex data-driven systems, and 2+ years in data analysis.• Strong foundation in QA methodologies, test planning, and test case design.• Experience writing test plans and handling bug tracking for software projects.AI/ML Knowledge:• Familiarity with machine learning concepts and specific challenges of testing AI models.• Experience with AI/ML testing frameworks and LLM evaluation methodologies – for example, knowledge of how to measure model accuracy on benchmarks, how to perform AB testing on model versions, or using frameworks like Hugging Face’s evaluation tools or custom Python-based testing.NLP Domain Skills: • Solid understanding of Natural Language Processing tasks and common failure modes of language models. • Awareness of issues like model hallucination (making up facts), bias in AI (and methods to test for bias), and the importance of context in language understanding.• Ideally, hands-on experience testing chatbots, virtual assistants, or language generation systems.Programming & Tools:•Proficiency in Python for developing test automation and evaluation scripts.•Familiarity with testing frameworks (PyTest, unittest) and libraries commonly used in ML/NLP (pandas, numpy for data handling; possibly Hugging Face transformers for model interfacing).•Experience with tools for dataset handling and annotation; ability to write simple scripts to manipulate and evaluate text data.Data Management:• Experience creating and managing test datasets, including annotation and labeling processes.• Comfortable with basic data engineering to gather logs or outputs from models and analyze them.• Knowledge of using version control for test scripts and maintaining a repository of test cases.Analytical Skills:• Strong problem-solving and debugging skills, specifically applied to AI outputs• Ability to notice patterns in model errors and analytically determine what they have in common.• Capacity to interpret model evaluation metrics and translate them into actionable improvements.Communication: • Excellent written and verbal communication skills.• Able to document bugs, write detailed QA reports, and discuss issues with developers and researchers.• Fluent Ukrainian is a must, as our LLM is oriented towards Ukrainian – you should be able to evaluate outputs in Ukrainian for correctness and nuance.Attention to Detail:• A keen eye for spotting subtle errors or oddities in AI behavior.• Patience and thoroughness in performing manual testing when needed, and creativity in thinking of edge cases or tricky scenarios to test the model's limits.A plus would beAI Testing Tools: •Experience with specialized tools or frameworks for AI testing, such as model evaluation harnesses, adversarial testing platforms, or crowdsourced evaluation methods.•Familiarity with techniques like prompt engineering and how prompt changes affect model output quality.Statistical Analysis: •Ability to perform statistical analyses on model performance results (significance testing for A/B comparisons, etc.) to determine if changes are improvements.•Understanding of experiment design in AI (, proper control groups for new model versions).Continuous Integration: •Experience integrating tests into CI/CD pipelines for ML – for example, automatically evaluating a model on a validation set every time it’s updated, and blocking deployment if it fails certain criteria.•Familiarity with ML model versioning and deployment workflows.Security & Compliance Testing: •Knowledge of testing AI models for security and compliance issues – , prompt injection attacks on LLMs, data privacy in outputs, or ensuring no disallowed content is generated according to usage policies.UX Perspective:•Some experience or understanding of user experience as it relates to AI products.•Being able to anticipate how end-users might interact with the AI (for instance, phrasing questions in unexpected ways) and ensuring the model handles such interactions gracefully.Testing Certifications:•Any certifications or formal training in Quality Assurance, software testing (such as ISTQB) or in AI/ML could be a plus, demonstrating a commitment to the discipline.What we offerOffice or remote — it’s up to you. You can work from anywhere, and we will arrange your workplace.Remote onboarding.Performance bonuses.We train employees with the opportunity to learn through the company’s library, internal resources, and programs from partners.Health and life insurance.Wellbeing program and corporate psychologist.Reimbursement of expenses for Kyivstar mobile communication.

Похожие вакансии

Middle AI Software Engineer

... seeking a forward-thinking Middle AI Software Engineer to join our R&D team and help shape the future of AI-assisted software development. Youll work ... _Україною. Ми шукаємо прогресивного Middle AI Software Engineer, який приєднається до нашої команди ...
Компания: Raiffeisen Bank Ukraine Город:, Kyiv,
Зарплата: Размещено:
ua.talent.com

Senior QA Engineer

Senior QA Engineer Full-timeCompany DescriptionFounded in 1999, ... for a diligent and meticulous QA Engineer who is excited to work ...
Компания: Playtech Город:, Kyiv,
Зарплата: Размещено:
ua.talent.com

AI Data Engineer

... GoalWe’re looking for a Data Engineer AI Platform Specialist to build and optimize the infrastructure powering our AI ambitions. In this hands-on ... working with us? Forward-thinking AI-driven team revolutionizing the sports ...
Компания: OpenBet Город:, Lviv,
Зарплата: Размещено:
ua.talent.com

AI Integration Engineer

... company Andersen invites an experienced AI Integration Engineer to work on a large- ... .The project is to integrate AI into software development, accelerating delivery, ... related field.Experience as an AI Integration Engineer for 5+ years. Proven experience ...
Компания: Andersen Город:Ukraine
Зарплата: Размещено:
people.andersenlab.com

Senior QA Engineer

... coordinate testing efforts with the QA Team Leader and development teamDesign, ... escalate critical issues to the QA Team LeaderProvide stakeholders with clear ... correspondence as neededManage and improve QA sub-processes assigned by the ...
Компания: Playtech Город:, Kyiv,
Зарплата: Размещено:
ua.talent.com

Lead Data Scientist (Generative AI)

... .IF YOU AREExperienced in Generative AI and natural language processing (NLP) ... and image generationCapable of leading AI initiatives by applying advanced deep ... , including fine-tuningProficient in the AI project lifecycle, from development to ...
Компания: SoftServe Город:, ,
Зарплата: Размещено:
ua.talent.com

Lead QA Engineer

Job DescriptionThe Lead QA Engineer will head our Quality Assurance ... + years of experience in QA with at least 2 years ... adoption and improvement of QA processes and integrate best practices ... , test plans, and other QA artifacts. Perform hands-on testing ...
Компания: Customertimes Город:Georgia, Montenegro, Bulgaria, Serbia, Remote
Зарплата: Размещено:
customertimes.com

Middle Data Scientist (Generative AI)

... a related fieldExperienced in generative AI and NLP techniques, including large- ... image generationCompetent in applying generative AI and language models to lead innovative NLP and AI-driven initiativesProficient with state-of- ...
Компания: SoftServe Город:, ,
Зарплата: Размещено:
ua.talent.com

Middle AI Engineer

... team as a Middle AI Engineer and help shape the future ... passionate about using AI to drive real impact and ... years of experience in AI or ML development and deployment; ... research and trends in AI ML, including advancements in GenAI ...
Компания: Avenga Город:, ,
Зарплата: Размещено:
ua.talent.com

Automation QA Engineer

... is looking for an Automation QA Engineer to join our team full- ... the role: As an Automation QA Engineer, become a part of a ...
Компания: Ciklum Город:, ,
Зарплата: Размещено:
ua.talent.com

Head of AI Consulting (#3887)

... to become) a world-class AI consultant for both clients and internal delivery teams Engage in short- and mid-term AI projects as a consultant engineer to ensure successful execution and ...
Компания: N-iX Город:, ,
Зарплата: Размещено:
ua.talent.com

Gen AI Adoption Lead – Java Engineering Focus (#3839)

... for a skilled Gen AI Adoption Lead to join our team. The Gen AI Adoption Lead will join a ... and scalable integration of Gen AI within the SDLC across N- ... People who talk about AI but dont use it daily ...
Компания: N-iX Город:, ,
Зарплата: Размещено:
ua.talent.com

AI Engineer

... is in a high-performing Engineer team. That’s why we’re seeking an AI Engineer.We would love to hear ... mathematical concepts relevant to AI, such as linear algebra, calculus, ... Side note: We understand that AI can make things easier, but ...
Компания: Vable Город:, Lviv,
Зарплата: Размещено:
ua.talent.com

Senior JavaScript Engineer (React/TypeScript) Remote

... . If you’re an experienced JavaScript engineer who thrives on innovation and is excited about using AI-powered development tools to boost ... us?If you’re a proactive engineer who’s excited to use AI to improve the online shopping ...
Компания: Vitals Город:, Kyiv,
Зарплата: Размещено:
ua.talent.com

Senior JavaScript Engineer (React/TypeScript) - Remote

... . If you’re an experienced JavaScript engineer who thrives on innovation and is excited about using AI-powered development tools to boost ... us?If you’re a proactive engineer who’s excited to use AI to improve the online shopping ...
Компания: Vitals Город:, Kyiv,
Зарплата: Размещено:
ua.talent.com

Junior QA Engineer

... a responsible and organized Junior QA Engineer to join our team.Andersen ... . Performing integration testing.RequirementsExperience as QA Engineer 6+ months. API testing knowledge ...
Компания: Andersen Город:Ukraine
Зарплата: Размещено:
people.andersenlab.com

AI/Data Engineer

... company Andersen invites an experienced AI Data Engineer to join its team for ... streaming capabilities.RequirementsExperience as a AI Data Engineer for 3+ years. Proficiency in ...
Компания: Andersen Город:Ukraine
Зарплата: Размещено:
people.andersenlab.com

Senior AQA Engineer (Python)

... .They seek an experienced Automation QA Engineer to test high-volume data processing systems, ensure AI-driven solution reliability, and enhance ...
Компания: Avenga Город:, ,
Зарплата: Размещено:
ua.talent.com

Middle QA Engineer

... specifically looking for a Middle QA Engineer who join our Product Team ... )Experience in API testingExperience in QA process setup from scratch, estimation, ...
Компания: Ascendix Tech Город:Europe, Remote
Зарплата: Размещено:
ascendixtech.com

Middle QA Engineer

... are looking for a Middle QA Engineer with experience in software testing, ... • 2+ years of experience in QA with a focus on gambling ...
Компания: Playnetic Город:, Kyiv,
Зарплата: Размещено:
ua.talent.com

Snr AI Security Engineer (Detection)

... Security.About the TeamThe AI security engineer will play a crucial role in ensuring the security of Zoom’s AI products. This team identify, assess, ... years experience as a Security Engineer with a focus on AI Security, especially LLM security.Solid ...
Компания: Zoom Город:, Dnipro,
Зарплата: Размещено:
ua.talent.com

Senior Lead Software Engineer, AI Engineering

... YoursAs a Senior Lead Software Engineer, AI, you’ll architect and deliver core ... as a Senior Lead Software Engineer, AIDesign and build AI-powered tools that enhance how ... best practices for embedding Gen-AI into engineering workflows, from code ...
Компания: DraftKings Город:, ,
Зарплата: Размещено:
ua.talent.com

Lead Software Engineer, AI Engineering

... Is YoursAs a Lead Software Engineer, AI, you’ll architect and deliver core ... do as a Lead Software Engineer, AIDesign and build AI-powered tools that enhance how ... best practices for embedding Gen-AI into engineering workflows, from code ...
Компания: DraftKings Город:, ,
Зарплата: Размещено:
ua.talent.com

Junior QA Engineer

Ringostat шукає в команду Junior QA Engineer, котрий допоможе забезпечити якість продукту, ...
Компания: Netpeak Город:Full Remote
Зарплата: Размещено:
career.netpeak.group

AI Senior Product Manager

... , empower your team, and ensure AI becomes a core enabler of ... vision into impact—translating complex AI capabilities into business value, securing ... projects that translate cutting - edge AI innovation into tangible business value. ...
Компания: OpenBet Город:, Lviv,
Зарплата: Размещено:
ua.talent.com

Senior Automation/Manual QA Engineer (C#) (#3827)

... to our business. The Senior QA Tester Automation Engineer is a key member of ... standards set out.  The Senior QA Tester Automation Engineer will become an expert in ...
Компания: N-iX Город:, ,
Зарплата: Размещено:
ua.talent.com

Senior Data Scientist

... collaboration in Generative AI.In 2025, a key area of innovation is Agentic AI — where we design and deploy ... fieldExperienced in Generative AI and natural language processing (NLP), ... -the-art Generative AI techniques and methodologiesGrow your skill ...
Компания: SoftServe Город:, ,
Зарплата: Размещено:
ua.talent.com

Senior/Lead Data Scientist

... collaboration in Generative AI.In 2025, a key area of innovation is Agentic AI — where we design and deploy ... fieldExperienced in Generative AI and natural language processing (NLP), ... -the-art Generative AI techniques and methodologiesGrow your skill ...
Компания: SoftServe Город:, ,
Зарплата: Размещено:
ua.talent.com

ML Engineer (Focus on NLP Technologies)

... it here.Join NineTwoThree AI Studio, a remote-first product ... in Boston. We build AI-powered web and mobile apps ... development team to integrate AI models into existing systems. Experiment ... technical interview with ML Engineer and CTO (optional), 3rd technical ...
Компания: NineTwoThree AI Studio Город:, ,
Зарплата: Размещено:
ua.talent.com

Senior Full Stack Engineer (.NET + Basic knowledge React) (#3777)

... building our next-generation MCP AI Platform, leveraging the full power ... like Salesforce AgentForce and ServiceNow AI Agent Fabric.  Responsibilities: End-to- ... experience as a Full-Stack Engineer with expertise in C#, .NET, ...
Компания: N-iX Город:, ,
Зарплата: Размещено:
ua.talent.com

QA Engineer

... IT company Andersen invites a QA Engineer to work on a large- ... meetings.RequirementsProven experience as a QA Engineer on complex projects, especially in ...
Компания: Andersen Город:Ukraine
Зарплата: Размещено:
people.andersenlab.com

Senior Data Science/ AI Engineer (with Databricks) (#3576)

... seeking a Senior Data Scientist AI Engineer with expertise in Databricks to enhance our capabilities in developing advanced AI solutions for automotive claims management. ... experience in Data Science or AI Engineering. Strong expertise in developing ...
Компания: N-iX Город:, ,
Зарплата: Размещено:
ua.talent.com

QA Engineer (Florida)

QA Engineer (Florida)Full-timeCompany DescriptionFounded in ... for a diligent and meticulous QA Engineer who is excited to work ... the development team and QA Team LeaderDesign, develop, execute, and ... of experience as a QA EngineerAre experienced in Web and ...
Компания: Playtech Город:, Kyiv,
Зарплата: Размещено:
ua.talent.com

QA Engineer

... IT company Andersen invites a QA Engineer to work on a project ... project team.RequirementsExperience as a QA Engineer for 1,5+ years. Solid ...
Компания: Andersen Город:Ukraine
Зарплата: Размещено:
people.andersenlab.com

QA Engineer (EST hours) ID35731

... areas like application development and AI ML, and our people-first ... of the testing pyramid; - Validate AI model performance, including accuracy, confidence ... learn and apply the latest AI and QA practices to make the impossible ...
Компания: AgileEngine Город:, Kharkiv,
Зарплата: Размещено:
ua.talent.com

QA Engineer (EST hours) ID35731

... areas like application development and AI ML, and our people-first ... of the testing pyramid; - Validate AI model performance, including accuracy, confidence ... learn and apply the latest AI and QA practices to make the impossible ...
Компания: AgileEngine Город:, Sokil'nyky,
Зарплата: Размещено:
ua.talent.com

QA Engineer (EST hours) ID35731

... areas like application development and AI ML, and our people-first ... of the testing pyramid; - Validate AI model performance, including accuracy, confidence ... learn and apply the latest AI and QA practices to make the impossible ...
Компания: AgileEngine Город:, Odesa,
Зарплата: Размещено:
ua.talent.com

QA Engineer (EST hours) ID35731

... areas like application development and AI ML, and our people-first ... of the testing pyramid; - Validate AI model performance, including accuracy, confidence ... learn and apply the latest AI and QA practices to make the impossible ...
Компания: AgileEngine Город:, Kyiv,
Зарплата: Размещено:
ua.talent.com

QA Engineer (EST hours) ID35731

... areas like application development and AI ML, and our people-first ... of the testing pyramid; - Validate AI model performance, including accuracy, confidence ... learn and apply the latest AI and QA practices to make the impossible ...
Компания: AgileEngine Город:, Dnipro,
Зарплата: Размещено:
ua.talent.com