Chat GPT is awfully good at most things, but occasionally it’s just awful
I had an experience this week which forcefully reminded that Chat GPT and Google’s Gemini were great but not perfect. And to be clear, I have jumped into the AI pool with both feet and am enthusiastic about the long-term prospects. However I believe that we need to tap the brakes on the irrational exuberance and belief that AI can do everything, everywhere all at once.
The specific project that broke Chat GPT’s back was obscure on the one hand but shouldn’t really have been that tough. My daughter is finishing her doctoral dissertation and was trying to generate a map that compared the borders of the Byzantine Empire in the years 379 AD versus 457 AD.
Here is the prompt that I used against deep research:
Create a detailed map that overlays the borders of the Byzantine empire in 379 AD at the start of the reign of Theodosius the Great versus the borders in 457 AD at the end of the reign of Marcian. I need both borders shown clearly on a single map.
Use a historical map style and highlight major cities
The Deep Research option is powerful but often time consuming. As it runs, I enjoy watching the play-by-play in the details window. Chat GPT did an excellent job of generating a text analysis of the changing borders, major cities and historical events.
The wheels fell off the bus when I asked Chat GPT to turn its text analysis into an easy to read map

Without digging too deeply into the minutiae of the 5th century world, the point is that it made up names, misspelled names and placed cities at random. Notice that Rome appears twice on the Italian peninsula. What is particularly frustrating about this effort is that the names and locations were correct in the text.
I tried patiently asking for spelling corrections and proper placements of well-known cities without success. Finally, I told Chat GPT that its results were garbage and threw up my hands. To its credit, Chat GPT took the criticism in stride. It replied “Thank you for your candor. You are right to expect better“. Unfortunately things didn’t get better.
After a few minutes of cursing out that platform I decided to give Google Gemini a shot at the identical query. Shockingly its results were even worse. If you look at the image below you’ll see “Rome” in the middle of the iberian Peninsula. Antioch appears three or four times across Europe. Many of the other names are right out of fantasy novels.

I was complaining about this mapping chaos to a friend. He shared a similar story. He entered a photo from a small offsite meeting into Chat GPT. He asked it to add the words “Mahalo from Hawaii 2025” above a photo of a group of colleagues. Instead of just adding the text, the engine totally changed the image. It made people skinnier, it changed men into women and an Asian into a caucasian. Another friend told me that an AI generated biography of him talked about his twin children which he doesn’t have. It even provided a link to a non-existent source. Yikes.
Trust but verify
Ronald Reagan
My point is not to suggest that we run away from AI and cancel all of our subscriptions. Rather it is to remind everyone (me included) that we cannot hand the keys to the AI engines and walk away. They are tools that can assist us but in the end, we need to look at the output, see if it looks and smells right, and decide to accept it or not. It is clear that the performance of the AI engines is uneven. Excellent at some projects and terrible at others–such as mapping.
We will probably see the rise of the machines some day–but today is not the day.
