SITREP: Same Name, Different Person

I came in to write a blog post last week and didn't realize I was on a different model. Anthropic had rolled out Opus 4.7, and it was selected by default. I didn't notice. Why would I? Same product, newer version. You'd expect it to work the same way, maybe better.

It was like talking to a completely different person.

What Happened

I handed it a transcript — the same kind of Otter.ai capture I always use as the starting point for an article. I've done this dozens of times with Opus 4.6. The process is well-established: I give it the transcript, it reads my previous articles to match the voice, it understands the HTML conventions for the site, it asks about the angle, and it drafts. I've never had to spell that out. 4.6 just picked it up from the context — the existing articles, the structure of the site, the patterns in how we'd worked before.

4.7 didn't do any of that. From the first interaction, something was off. It was asking questions that 4.6 never asked. It was taking liberties that 4.6 never took. The formatting was different. It didn't include the author signature block. Small things individually, but together they added up to a feeling that the person on the other end of the conversation had been replaced.

Then the draft came back, and the small things stopped being the issue.

The article was about working with Claude in Unity and Blender — a piece I'd eventually publish as Claude Rides Shotgun. In my transcript, I described using Blender to create a baby bassinet model. 4.7 wrote that I needed a bassinet "for a Blender scene." That's not what I said. I used Blender to make the model. It recharacterized my own story back to me, and got it wrong. It invented details that weren't in the source material — attributing quotes and specifics that I never said. The analogy it chose for the article didn't track with what actually happened. The voice didn't sound like my other articles. It read like a first draft from someone who'd skimmed the transcript instead of reading it.

That's when I checked which model I was on. And there it was. Opus 4.7.

The Fix

I switched back to 4.6. I handed it the same transcript, the same broken draft, and said: take a look at this.

The difference was immediate. 4.6 read the transcript, read the existing draft, and came back with a list of everything that was wrong: the Blender scene mischaracterization, the fabricated details, the analogy that didn't hold, the AI voice patterns in the writing. It caught everything I'd caught and a few things I hadn't. We reworked the article together over the course of a session, going back and forth on tone, structure, and specific language choices. The result was the piece that went live.

But the bigger outcome wasn't the article. It was what I had to build to protect against this happening again.

With 4.6, the article workflow was informal. It just worked. 4.6 picked up the conventions from context, matched the voice naturally, and followed a process we'd never written down. When 4.7 couldn't do that, I had to stop and formalize everything: the pipeline from transcript to draft, the HTML conventions, the publish gate, the voice guidelines, specific AI writing patterns to avoid. All of it went into a workflow standards document that now lives in the project.

That document exists because 4.7 forced me to write it. And now any model — 4.6, 4.7, or whatever comes next — has explicit instructions instead of implicit expectations.

The Lesson

When you work with someone long enough, you build up expectations. You stop explaining things because they already know. You trust that when you hand them a task, they'll handle it the way they've always handled it. That works fine until the person changes and you don't realize it.

That's what happened here. A model upgrade felt like a personnel change. And just like you'd onboard a new team member instead of assuming they'd just figure it out, you have to do the same when you switch models. Don't assume the new version will carry forward the behaviors you liked about the old one. It might. It might not. The only way to know is to test it.

And the insurance policy is the same one you'd use with any system: define the behavior you expect. Write it down. If a model is doing something right naturally, that's great, but it's also fragile. The next version might not do it. The next provider definitely won't. If you care about a behavior, codify it. Treat your AI configuration — your prompts, your standards, your workflow documents — like what they are: the specification for how the system should work. Because when the model changes underneath you, that specification is the only thing that stays.

Same name on the box. Different person inside it.

Kevin Phifer is the founder of Theoretically Impossible Solutions LLC, specializing in agentic AI development and consulting. You can reach him at kevin.phifer@theoreticallyimpossible.org.

← Back to Blog