AI prompt editor and evaluations tooling now supports multi-turn conversations

You can now save and evaluate multi-turn conversations in the GitHub Models prompt editor and evaluations tooling!

Until now, the evaluations tooling only supported a single user prompt. With this update, you can include up to four rounds of user and assistant messages directly in your .prompt.yml file and test how models respond at the end of a longer interaction. In the API, you can include unlimited pairings.

This is especially useful for:

Testing memory and context retention. For example, in the case of a travel bot, does it still recommend snowy places by turn four after the user says “I want a cold destination” in turn two?
Ensuring consistent behavior as instructions evolve. For example, a shopping assistant where the user first says “make it under $100,” then later changes it to “under $200,” and the assistant correctly adjusts its recommendations.
Evaluating real-world chat flows. For example, a customer support agent that needs to escalate properly after several back-and-forth troubleshooting steps.

Start building AI apps with GitHub Models today

GitHub Models and all our AI development tooling are available now to all GitHub users in public preview. This includes prompt editing and lightweight evaluations. Try our tools out by enabling them in your repository or organization, or learn more in our documentation.

Help us shape what’s next

We’re just getting started, and your feedback helps guide our roadmap. Join the community discussion to share your thoughts and connect with other developers building the future of AI on GitHub.

JUN.16Retired
GitHub Models is no longer available to new customers
- Ecosystem and Accessibility
MAY.15Improvement
GitHub App installation tokens: Per-request override header
- Ecosystem and Accessibility
MAY.13Improvement
New enterprise installation API now in public preview
- Ecosystem and Accessibility
APR.20Retired
Sunsetting SHA-1 in HTTPS on GitHub
- Ecosystem and Accessibility
MAR.12Release
REST API version 2026-03-10 is now available
- Ecosystem and Accessibility
FEB.3Release
The Dependabot Proxy is now open source with an MIT license
- Ecosystem and Accessibility
JAN.12Improvement
Selectively showing "act on your behalf" warning for GitHub Apps is in public preview
- Ecosystem and Accessibility
NOV.7Retired
GraphQL Explorer removal from API documentation on November 7, 2025
- Ecosystem and Accessibility
OCT.31Retired
Deprecated models in GitHub Models
- Ecosystem and Accessibility

AI prompt editor and evaluations tooling now supports multi-turn conversations

Start building AI apps with GitHub Models today

Help us shape what’s next

Related Posts

Subscribe to our developer newsletter