Code Agents & Threat Vectors
Code Agents, Hugging Face Will Scardino Code Agents, Hugging Face Will Scardino

Code Agents & Threat Vectors

Code agents write and execute code to solve complex problems and complete tasks independently.

Contrasted against AI assistants, code agents translate natural language prompts into code, which users can copy and paste into an IDE.

But what if your agent didn’t stop there?

Unlike coding agents (think Cursor and Windsurf), which generate code for you to run, code agents create an action plan in one shot and execute it.

Read More
When AI Becomes a People Pleaser
Generative AI, OpenAI Will Scardino Generative AI, OpenAI Will Scardino

When AI Becomes a People Pleaser

One of the key qualities of a PM is being brave enough to say No, creatively.

But what about AI?

The curious case of GPT-4o’s sycophantic spiral.

OpenAI recently rolled back an update to GPT-4o that caused the model to behave like a raging Yes Man.

This incident highlights a critical gap in AI evaluation and alignment practices—standard evals often fail to detect sycophantic behavior and harmful agreement in real-world contexts.

That’s why it’s important to adhere to the best practices of Responsible AI.

Read More