meta_pixel
Tapesearch Logo
Log in
Software Engineering Daily

Optimizing Agent Behavior in Production with Gideon Mendels

Software Engineering Daily

Software Engineering Daily

News, Technology, Tech News

4.4662 Ratings

🗓️ 17 February 2026

⏱️ 52 minutes

🧾️ Download transcript

Summary

LLM -powered systems continue to move steadily into production, but this process is presenting teams with challenges that traditional software practices don’t commonly encounter. Models and agents are non-deterministic systems, which makes it difficult to test changes, reason about failures, and confidently ship updates. This has created the need for new evaluation tooling designed specifically

Transcript

Click on a timestamp to play from that location

0:00.0

LLM-powered systems continue to move steadily into production, but this process is presenting

0:05.6

teams with challenges that traditional software practices don't commonly encounter.

0:11.1

Models and agents are non-deterministic systems, which makes it difficult to test changes,

0:16.3

reason about failures, and confidently ship updates.

0:19.9

This has created the need for new evaluation tooling

0:22.8

designed specifically around the properties of LLMs. Comet is a platform with Roots and MLOps

0:29.8

that has evolved to support teams building modern LLM-powered applications. The company recently

0:36.0

launched OPEC, which is an open source platform focused on

0:39.8

evaluation, optimization, and observability for LLM agents. Together, the tools aim to bring the rigor

0:47.4

of traditional engineering and ML workflows to the rapidly evolving world of agent-based systems

0:53.3

by treating prompts, tools, and workflows

0:56.0

as optimizable components that can be evaluated and improved over time.

1:01.6

Gideon Mendels is the co-founder and CEO of Comet.

1:05.5

He previously worked at Google on hate speech and deception detection, and he founded

1:10.2

GroupWise, which trained and

1:12.1

deployed NLP models processing billions of chats. In this episode, Gideon joins Kevin Ball

1:18.6

to discuss how agent development sits between software engineering and ML, why e-vals are the

1:24.7

missing foundation for most AI teams, prompt optimization as a search problem,

1:30.2

and the future for continuously improving agents in production.

1:34.9

Kevin Ball, or K. Ball, is the vice president of engineering at Mento and an independent

1:39.7

coach for engineers and engineering leaders. He co-founded and served as CTO for two companies,

1:45.3

founded the San Diego JavaScript meetup,

...

Please login to see the full transcript.

Disclaimer: The podcast and artwork embedded on this page are from Software Engineering Daily, and are the property of its owner and not affiliated with or endorsed by Tapesearch.

Generated transcripts are the property of Software Engineering Daily and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.

Copyright © Tapesearch 2026.