Kevin Slater PyData NYC 2024

Kevin Slater
.ical

Kevin Slater is a Senior Data Scientist at Boston Consulting Group. He has over five years experience delivering ML / AI solutions for clients across multiple industries, including aviation, manufacturing, and supply chain management. He is passionate about Responsible AI and has enjoyed helping teams design and implement their GenAI Testing & Evaluation strategies.

Prior to BCG, Kevin was a Quantitative Strategist at Goldman Sachs with a focus on inventory management. He has a degree in Physics and Mathematics from the University of Chicago.

Sessions

11-07

14:35

40min

Use ARTKIT to Automate and Scale Up Your LLM Evaluation Process

Andrea Gao, Kevin Slater

Since late 2022, Large Language Models (LLMs) have become an integral part of our daily lives, propelled by the rise of ChatGPT. Amid continuous media coverage, companies like Nvidia have experienced stock surges, and individuals have adjusted their work and study habits in response to these developments. However, many organizations remain hesitant to adopt LLMs and Generative AI (GenAI) at scale, primarily due to insufficient comprehensive experimentation and testing. There is a concern that deploying LLMs/GenAI without proper alignment to business needs could incur reputational or legal risks.

At Boston Consulting Group (BCG), we've been assisting numerous clients in navigating the LLM/GenAI landscape since late 2022. Unlike other AI applications, we've encountered challenges in finding suitable tools for testing LLM/GenAI systems. In response, we've developed an open-source Python package, ARTKIT, designed to facilitate automatic and scalable testing of LLMs/GenAI. In this talk, we'll share BCG's broader Responsible AI(https://www.bcg.com/capabilities/artificial-intelligence/responsible-ai) efforts and introduce ARTKIT(https://github.com/BCG-X-Official/artkit) to the PyData community.

Central Park East

Kevin Slater .ical

Sessions

Kevin Slater
.ical