Hands on: I tried ChatGPT Agent and Perplexity Comet to actually get things done—and I’m not sure if I’m witnessing the future of productivity or just an elaborate way to make simple tasks more complicated.
The premise sounds great: tell an AI what you want, and it’ll just… do it. Biggest difference between the two: ChatGPT Agent is a sandboxed browser inside your browser. Perplexity Comet is the actual browser, carrying over all your logins from Chrome.
1. The Great Train Booking Experiment: I asked both agents to book me the cheapest train from Hamburg to Copenhagen. Simple enough, right? I mean, I could knock this out in under a minute on the Deutsche Bahn website.
ChatGPT Agent found the better deal at €63.26, but watching it navigate the Bahn website was like watching someone perform surgery with oven mitts. Ten excruciating minutes of clicking cookie banners and scrolling through options. Perplexity Comet found the same connection in 3:40 minutes—but missed on savings, resulting in a €70.94 fare.
2. Lost in Translation: I asked for the biggest news in Germany in Axios-style bullet points, and both agents… used only English sources? In 2025? When AI can translate faster than I can say “Guten Tag”?
Perplexity Comet took 20 seconds and served up some articles, among them week-old news about a train derailment. ChatGPT Agent took nine minutes (nine!) but delivered actually witty, recent summaries. It appeared to read different sources, but used only two in the end.
3. The Guest List Gender Test: I asked both agents to help me find potential newsletter guests by checking past issues and my LinkedIn connections. I alternate between male and female guests, a rather obvious pattern.
Perplexity Comet got close: In less than two minutes, it crawled through my newsletter archive, scanned my LinkedIn connections, checked out 14 profile pages, and delivered a table with 13 solid suggestions. Five women out of 13.
ChatGPT Agent spent minutes of opening browsers, methodically reading through all newsletter issues, getting confused by non-existent archive pages, making me log into LinkedIn, and painstakingly clicking through connection profiles and their experience-sections like it was conducting a digital census. The result after eight minutes? Nine solid entries, but only one woman.
Only one.
Now, I’m not saying ChatGPT Agent is secretly plotting against gender diversity in newsletter interviews. But ChatGPT Agent took four times longer, yet somehow managed to recreate exactly the kind of unconscious bias that many of us are actively trying to avoid. Makes you wonder: When we hand over these kinds of decisions to AI, whose patterns are we really perpetuating?
Help me help you: There are two more examples on my blog, buying stuff and making playlists. Maybe I need better use cases. Maybe I’m thinking too small. While I don’t have any invites to share (yet), I’m happy to work on your behalf: What tasks would you test these agents with? Send me your most ambitious prompt ideas!
Speaking of newsletter guests: This week, it’s one of the most sought-after keynote speakers on AI and journalism, host of the Newsroom Robot podcast, journalist turned media advisor Nikita Roy. She has an important message for everyone who thinks that all of this is just another pivot to video.