Join Michael and Chris Sharkey, two proudly average tech enthusiasts, as they stumble through the world of artificial intelligence with all the grace of a robot learning to dance. This (sometimes weekly*) podcast delivers an hour-long conversation about their thoroughly middle-of-the-road adventures with AI. No PhDs. No Silicon Valley insights. Just two guys with enough technical knowledge to be dangerous, sharing their unexceptional yet entertaining experiences with AI tools and technology. Subscribe now to hear: • Mediocre hot takes on AI developments • Stories of AI experiments gone adequately okay • The most average advice you'll ever need • Two Sharkeys trying their best to sound smart about algorithms • Childish AI prank calls that somehow fool everybody • Attempts at using AI for phishing attacks on their mother • "Chart-topping" AI songs according to the brothers Join our perfectly mediocre community where being average at AI is celebrated, questions are encouraged, and learning through mistakes is our specialty. Because let's face it - most of us are figuring this out as we go along. New episodes drop whenever we remember to record them. ποΈ Proudly supported by Simtheory.ai
Join Simtheory: https://simtheory.aiSo Chris, this week we finally give our GPT-5.5 impressions (it's actually great), introduce our new AI co-host Moshi (who immediately embarrasses himself), argue about whether the OpenAI/Jony Ive phone is genius or doomed, witness Grok 4.3's unhinged infinite emoji meltdown, declare Opus 4.7 the first-ever Anthropic regression, get excited about GPT Real-Time Voice 2.0 as the future of agentic workflows, debate whether token prices will ever come down, and play the worst diss track in show history. Watch my spud.CHAPTERS:0:00 - Intro & Introducing Our New AI Co-Host Moshi1:39 - Trying to Break Moshi: The Illegal Cigarette Trade Test2:30 - OpenAI's Jony Ive Phone: Do We Need a Device?5:07 - Telegram Agents & GPT Real-Time Voice 2.0 Dream7:38 - The Supervisory Agent: Managing Your Agentic Workflow9:05 - Wait... Are We Accidentally Validating the OpenAI Phone?11:37 - GPT-5.5 First Impressions: Actually Really Good14:36 - 5.5 vs Opus 4.6: Different Strengths17:00 - Opus 4.7: The First-Ever Anthropic Regression20:25 - Grok 4.3: Infinite Emojis & Absolute Chaos21:22 - π΅ DISS TRACK: "Watch My Spud"24:24 - Grok Specs & All Models Deprecated in 18 Days27:04 - Grok Voice in Tesla Is Actually Next Level31:03 - Token Pricing: The Subscription Problem Nobody Can Solve39:16 - AI Disruption Cycles & The State of the Industry44:39 - BONUS TRACK:π΅ "It's Hard Being Me"Thanks for listening, like and sub xoxo
5/8/26 β’ 46:57
Join Simtheory: https://simtheory.aiSo Chris, this week... a LOT has happened. We're back to regular programming (maybe), and back with our average takes. Nothing's changed.GPT-5.5 just dropped today - but you can't even use it in the API. Vaporware? OpenAI is charging MORE than Opus 4.7 and we haven't even tested it yet. Meanwhile Claude Opus 4.7 landed a couple weeks ago and... the vibes are off? Mike's actually going BACK to 4.6. Something's wrong.But the real star: OpenAI Image 2. This thing is genuinely terrifying. We committed what can only be described as "parody fraud" - faking a council letter so realistic Mike's own mother fell for it on a phone call. Then Chris posted a fake development approval with the mayor's real name into a local Facebook group and had to delete it when someone tagged the actual mayor. The forgery capabilities are absolutely unhinged.Also: GLM 5.1 is so good Mike forgot he switched to it. Kimi K 2.6 is criminally underrated. VCs are paying 70% of your real token costs. Consumers pay only 5.5% of actual cost. The everything app war is ON. The SaaS-pocalypse is real. And we made two new diss tracks.Chris made a graffiti sign in LA. It says "This Day in AI." It was the best artwork in the class. That tells you everything.CHAPTERS:0:00 - Intro & We're Back (Don't Over-Commit)1:14 - Overview: Everything That Dropped While We Were Gone2:56 - GPT-5.5: Vaporware? Not Even in the API4:57 - Benchmarks vs Reality: Nobody's Excited About OpenAI Models5:50 - GLM 5.1 & Kimi K 2.6: Secretly Just As Good?8:15 - The Everything App Race & Product Layer War8:56 - Token Economics: You're Only Paying 5.5% of Real Cost13:08 - We Burned $1.5M in Cloud Credits in 2 Months16:13 - "$30/Month Is Too Expensive" (It Actually Costs $700)19:25 - Where Is Google?? TPUs Should Flatten Everyone22:01 - Agentic Tasks Are 10-50x More Expensive Than Chat25:07 - OpenAI Workspace Agents: Glorified Zapier?27:01 - Single Agent vs Multi-Agent: How Do You Actually Work?33:06 - Building Automation Is HARD (Our Support Shame)35:33 - OpenAI Image 2: The Fraud Episode Begins44:16 - FRAUD DEMO: The Fake Council Letter (Mum Falls For It)49:16 - FRAUD DEMO 2: Chris Posts Fake Mayor Letter on Facebook52:17 - Fake Receipts, Bank Statements & Can Forgeries Be Detected?57:25 - Claude Opus 4.7: The Vibes Are Off59:51 - Mythos Preview: "Pics or It Didn't Happen"1:01:56 - π΅ DISS TRACK: "Point 7" (Opus Destroys Everyone)1:03:30 - Kimi K 2.6 Deep Dive & π΅ New Diss Track1:08:34 - The Everything App War & SaaS-pocalypse1:13:51 - Death of Per-Seat Pricing & Agent Security1:22:37 - Final Thoughts: The Time for Pretending Is Over1:28:22 - π΅ Full Tracks: " Point 7" & "Kimi You're So Fine 2.6"Thanks for listening, like and sub xoxo
4/24/26 β’ 94:55
Join us on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80Join Simtheory: https://simtheory.aiπ Try our AI-built apps:Macrosoft Teams: teams.simtheoryapp.com (working video chat with up to 150 people)Trallo: trallo.simtheoryapp.com (full Trello clone, unlimited boards, completely free)TDIA Discord: https://discord.gg/gTW4RkAJvnSpotify Songs: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=Zh4jgHIASI2ZvsXVfVcCoASo Chris, this week... we've been having way too much fun with the AI again. OpenAI just dropped GPT-5.4 and 5.4 Pro, and holy shit - we finally have a ball game. This might be the first OpenAI model that genuinely competes with Opus 4.6 for agentic work.But here's where it gets wild: we rebuilt Trello AND Microsoft Teams from scratch using single prompts. Not mockups. Fully deployed, working apps with authentication, video chat, the works. You can literally sign up and use them right now.Plus: We roast Gemini 3.1 (it's a disgrace for agentic workflows), break down the insane $30/$180 per million pricing on 5.4 Pro (who is this for??), and discuss why every $99/month SaaS tool might be about to die. Chris declares his programming skills "useless" and honestly... he might be right.We also demo our actual workflow - running 5 agent tabs simultaneously, delegating everything, and why we barely visit websites anymore. The AI workspace IS the operating system now.CHAPTERS:0:00 - Intro & Housekeeping (We Screwed Up the Link)1:26 - GPT-5.4 First Impressions & Specs3:12 - Chris's Testing: 40 Minutes to Solve a Problem4:51 - Knowledge Work Improvements (Catching Up to Anthropic)6:38 - Computer Use vs Browser/Terminal Debate8:07 - Why We Don't Need Computer Use Anymore9:53 - Teaser: We Built Full SaaS Apps Today11:19 - Tool Search API & Skills Integration13:20 - The Speed Problem (It's a Plodder)15:12 - GPT-5.4 Pro Pricing Reaction ($30/$180 WTF)18:14 - Someone Rebuilt Minecraft in 24 Minutes19:46 - Gemini 3.1 Roast: "It's a Disgrace"22:36 - DEMO: Trallo (Full Trello Clone)29:03 - DEMO: Macrosoft Teams (Working Video Chat!)33:30 - The SaaS Collapse Theory36:42 - AI Workspace as the New Operating System38:57 - Forcing Features onto Entrenched Software43:32 - "My Programming Skills Are Useless" - Chris46:06 - The $12 Million Legacy Software Opportunity51:06 - Beyond Code: Forms, PDFs, Knowledge Work55:28 - How Fast Will This Change Everything?56:31 - Gemini 3.1 Flash Lite Quick Take59:36 - The Delegation Lifestyle (5 Agent Tabs Running)1:01:24 - Mike's Workflow Demo1:04:31 - Cognitive Overload Problem1:06:04 - Release Date: 2 Weeks (Drop Punishment Ideas!)Thanks for listening like and sub xoxo
3/6/26 β’ 68:10
Join us on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80Join Simtheory: https://simtheory.aiTDIA Discord: https://discord.gg/gTW4RkAJvnHorse Egg Lifecycle Infographic: https://staging.simtheory.ai/share/file/UZ2KJU----So Chris, this week... we're diving into Google's new Nano Banana 2 image model - 50% cheaper and supposedly faster (when the servers aren't melting). We put it through its paces with annotation-based editing, slide generation, and yes, the return of the legendary horse egg experiment.Plus: Google quietly kills Gemini-3 after just a few months (good riddance?), we discuss why the model was "dead on arrival" for agentic workflows, and break down the real story behind those massive AI layoff announcements from Block and WiseTech. Spoiler: it's probably not actually about AI.We also get into the current state of the model wars (Opus 4.6 vs Codex 5.3), why smaller models like GLM-5 might be the future for enterprise agentic tasks, and Chris's wife teaching Claude to literally speak to her using Mac's text-to-speech. The models are getting creative.---0:00 - Intro0:36 - Nano Banana 2: Price, Speed & First Impressions3:19 - The Compositing Problem & Last Mile Design5:41 - Annotation-Based Editing (This Changes Everything)9:52 - Slide Editing & Real-World Use Cases12:34 - The Horse Egg Experiment Returns14:30 - Image Degradation & Cost Breakdown17:47 - Text-to-Image Leaderboard Discussion20:01 - Why Nano Banana Dominates for Work22:07 - Codex 5.3 vs Opus 4.622:54 - Google Kills Gemini-3 (What Went Wrong?)26:48 - Google's Agentic Problem30:08 - The Model Loyalty Cycle34:22 - Why Opus 4.6 is Still the Best37:05 - Cost Optimization & Smart Model Routing43:30 - When Models Get Stuck on the Wrong Path45:36 - Nicole's AI Learns to Talk Back46:54 - Can Anyone Build Software Now?52:26 - Anthropic's Legal/Finance Plugins & Market Panic57:08 - Block Lays Off 4,000: AI or Excuse?1:00:05 - The AI Job Apocalypse Isn't RealThanks for listening like and sub xoxo
2/27/26 β’ 62:09
Join Simtheory: https://simtheory.ai"Is This The End" now on Spotify: https://open.spotify.com/album/2Py1MyADUFqJFVUISI2VTP?si=oT3PWyJYRA2BspOmzT_ifgRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0dationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80Two new models dropped this week β Gemini 3.1 Pro and Claude Sonnet 4.6 β and honestly? We're struggling to care. In this episode, we break down why Gemini went from being our daily driver to a model we barely touch, the "tunnel vision" hallucination problem that killed the Gemini 3 series for us, and whether 3.1 Pro actually fixes it. We put Gemini 3.1 Pro head-to-head against Claude Opus building a Geoffrey Hinton Doom Center, debate whether anyone can actually tell the difference between Sonnet 4.5 and 4.6, and make the case that smaller models running in agentic loops are secretly beating the frontiers. Plus: OpenAI acquires OpenClaw and we ask why a $100B company couldn't just build it themselves, DHH calls out the AI pricing bubble, Mike compares AI models to cheap wine hangovers, and Sam Altman refuses to hold Dario's hand at the India AI Summit. The model wars are getting weird.CHAPTERS:0:00 Intro & "Is This The End" Now on Spotify1:10 Gemini 3.1 Pro: Thinking Controls & The Medium Mode Fix3:14 The Speed vs Intelligence Trade-Off in Agentic Work5:10 Why Multitasking With AI Agents Made Us Anxious6:34 Solid Updates: The Real Goal of Agentic Coding7:45 Gemini's Fall From Grace: From Daily Driver to Dead Model10:08 The Tunnel Vision Problem That Killed Gemini 313:35 Mixed Reactions: Fanboys vs Reality on Gemini 3.1 Pro15:06 Side-by-Side Test: Gemini 3.1 Pro vs Claude Opus (Hinton Doom Center)17:39 Why File Manipulation Accuracy Matters More Than Context Windows19:27 The Context Window Debate: 1M Tokens vs Smart Sub-Agents22:05 DHH on Token Pricing: "If There's a Bubble, It's This"24:11 Should Models Ship as Agent vs Chat Variants?28:43 Claude Sonnet 4.6: A $2 Discount on Opus?31:44 The Model Mix: Why One Model Won't Rule Them All34:40 Anthropic Is Winning β But Can Anyone Tell the Difference?38:58 OpenAI Acquires OpenClaw: Why Couldn't They Just Build It?44:18 The Silicon Valley Moment: Sam vs Dario at India AI Summit47:05 Will Smaller Models Win the Enterprise? The Cost Reality Check51:27 The End of Single-Shot: Why Agentic Loops Change Everything55:48 Final Thoughts & Gemini 3.1 Pro Gets One More WeekThanks for listening. Like & Sub. Links above for the Still Relevant Tour signup and Simtheory. Two models dropped on a week again. What a time to be alive. xoxo
2/20/26 β’ 58:06
Join Simtheory: https://simtheory.aiRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80GLM-5 just dropped and it's trained entirely on Huawei chips β zero US hardware dependency. Meanwhile, we're having existential crises about whether we're even needed anymore. In this episode, we break down China's new frontier model that's competing with Opus 4.6 and Codex at a fraction of the price, why agentic loops are making 200K context windows the sweet spot (sorry, million-token dreams), and the very real phenomenon of AI productivity psychosis. We dive into why coding-optimized models are secretly winning at everything, the Harvard study confirming AI doesn't reduce work β it intensifies it, and the exodus of safety researchers from XAI, Anthropic, and OpenAI (spoiler: they're not giving back their shares). Plus: Mike's arm is failing from too much mouse usage, we debate whether the chatbot era is actually fading, and yes β there's a safety researcher diss track called "Is This The End?"CHAPTERS:0:00 Intro - Is This The End? (Song Preview)0:11 Still Relevant Tour Update & NASA Listener Callout1:42 AI Productivity Psychosis: The Pressure of Infinite Capability4:25 GLM-5 Breakdown: China's New Frontier Model on Huawei Chips7:24 First Impressions: GLM-5 in Agentic Loops9:48 Why Cheap Models Matter & The New Model War14:09 Codex Vibe Shift: Is OpenAI Winning?16:24 Does Context Window Size Even Matter Anymore?22:27 The Parallelization Problem & Cognitive Overload27:27 Mike's Arm Injury & The Voice Input Pivot31:17 Single-Threaded Work & The 95% Problem35:06 UX is Unsolved: Rolling Back Agentic Mistakes38:45 Harvard Study: AI Doesn't Reduce Work, It Intensifies It44:01 How AI Erodes Company Structure & Why Adoption Takes Years50:14 My AI vs Your AI: Household Debates50:43 The Safety Researcher Exodus: XAI, Anthropic, OpenAI56:49 Final Thoughts: Are We All Still Relevant?59:04 BONUS: Full "Is This The End?" Diss TrackThanks for listening. Like & Sub. Links above for the Still Relevant Tour signup and Simtheory. GLM-5 is here, your productivity psychosis is valid, and the safety researchers are becoming poets. xoxo
2/13/26 β’ 63:07
Join Simtheory: https://simtheory.aiRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80It's the model same-day showdown of 2026. Opus 4.6 and Codex 5.3 dropped within minutes of each other, and we're breaking down what this means for the future of AI work. In this episode, we unpack Opus 4.6's million-token context window (if you've got billies in the bank), why Codex's pricing makes it nearly impossible to ignore for agentic loops, and the real cost of running agents for 24 hours ($10K, apparently). We dive deep into why coding-optimized models are secretly crushing it at non-coding tasks, the mental fatigue of managing AI workers, and whether the chatbot era is actually fading or just evolving. Plus: Chris accidentally books three real pig grooming appointments, we debate whether you need a "life coach agent" to manage your agent swarm, and yes β there's an Opus 4.6 diss track that goes unreasonably hard.CHAPTERS:0:00 Intro - Opus 4.6 Diss Track Preview0:09 The Model Same-Day Showdown: Opus 4.6 vs Codex 5.30:50 Opus 4.6 Breakdown: Million Token Context & Premium Pricing2:31 Token Bill Shock: $10K Research Bills & Extended Context Costs5:04 Codex Pricing: Why It's Nearly Free for Agentic Loops6:42 Why Coding Models Are Secretly Crushing Non-Coding Tasks10:14 Tool Fatigue: Too Many Models, Too Many Workflows12:47 Opus 4.6 First Impressions: "Solid" and "Faultless"13:48 Chris Accidentally Books Three Real Pig Grooming Appointments16:01 Unix Tools & Why Code-Optimized Models Win at Everything19:59 The Agentic Retraining Imperative: Chat to Delegation22:16 Agent Swarms & The Master Thread Architecture24:51 OpenAI vs Anthropic: The Enterprise Battle27:09 Corporate Espionage 2.0: Stealing Skills & The Open Source Threat31:19 The UX Problem: Why Delegation Isn't Solved Yet34:24 The Stress of Hyper-Productivity & Managing Agent Swarms37:07 Coordination: The Next Layer of Abstraction40:09 The Fantasy vs Reality of Autonomous AI Businesses44:37 Is the Turn-by-Turn Chatbot Era Actually Fading?49:23 Tokens as Spice: Turning Compute Into Money52:08 Reduce Cognitive Overload: The Real Goal of AI55:07 Still Relevant Tour Announcement55:39 BONUS: Full Opus 4.6 Diss TrackThanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. The model wars are heating up, and your token bill is about to get interesting. xoxo
2/6/26 β’ 61:30
Join Simtheory: https://simtheory.aiRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80---The hype train is 2026 knows only Moltbot (RIP Clawdbot). In this episode, we unpack the viral open-source AI assistant that's taken over the internet what it actually does, why everyone's losing their minds, and whether it's worth the $750/day token bills some users are racking up. We dive deep into why locally-run skills and CLI tools are beating computer-use clicking, how smaller models like GPT-5 Mini are crushing it in agentic workflows, and why the real magic is in targeted context - not massive swarms. Plus: Kimi K2.5 drops as a near-Sonnet-level model at 1/10th the price, we debate whether SaaS is dead, and yes β there are TWO Kimi K2.5 diss tracks. One made by Opus pretending to be Kimi. It might just slap?CHAPTERS:0:00 Intro - Still Relevant Tour Update0:48 What is Moltbot? The Viral AI Assistant Explained3:57 Token Bill Shock: $750/Day and Anthropic Bans5:00 The Dream of Digital Coworkers on Mac Minis6:52 Why CLI Tools & Skills Beat Computer-Use Clicking10:57 Why This Way of Working Is Genuinely Exciting14:47 Smaller Models Crushing It: GPT-5 Mini & Targeted Context17:30 Wild Agentic Behavior: Chrome Tab Hijacking & Auto-Retries20:10 Security Architecture: Locked-Down Machines & Enterprise Use24:01 AI Building Its Own Tools On-The-Fly27:08 The Fear & Overwhelm of Rapid Progress29:10 2026: The Year of Agent Workers31:43 The Challenge of Directing AI Work (Everyone's a Manager Now)37:24 Skills Will Take Over: Why MCPs & Atlassian Can't Stop Us40:38 Real-World Use Cases: Doctors, Lawyers & Accountants46:28 Cost Solutions: Build Workflows Around Cheaper Models52:58 Kimi K2.5: Sonnet-Level Performance at 1/10th the Price1:00:55 The "1,500 Tool Calls" Claim: Marketing vs Reality1:05:23 The Kimi K2.5 Diss Tracks (Opus vs Kimi)1:08:08 Demo: Black Hole Simulator & Self-Trolling CRM1:12:55 Is SaaS Dead?1:14:30 BONUS: Full Kimi K2.5 Diss TracksThanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. The future is open source, apparently. xoxo
1/30/26 β’ 80:25
Join Simtheory: https://simtheory.aiReserve your seat on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80----Two episodes in one week? We're either above average or completely unhinged. In this one, we dive deep into the new phenomenon of "AI exhaustion" β that fried feeling you get after multitasking across six agent tabs all day. We share our breakthroughs with AI-assisted presentations (20 minutes vs several hours), why browser-use on your local machine bypasses every anti-scraping technique known to man, and how enterprise context sharing could be the real unlock for organizations. Plus: OpenAI announces ads for ChatGPT (even on paid tiers), their CFO floats taking cuts from drug discoveries (seriously), and Google publicly dunks on them for it. Also β the Still Relevant Australia Tour is coming, and our LinkedIn group hit 200 members (we're basically LinkedIn influencers now too).CHAPTERS:0:00 Intro - Still Relevant Tour Announcement + LinkedIn Milestone2:08 AI Exhaustion: The Cognitive Overload of Multitasking with Agents4:14 Why Single-Tasking with AI Beats Parallel Agent Chaos7:02 The Problem with "I Spun Up 70,000 Sub-Agents" Twitter Posts10:03 Mike's Presentation Workflow: From Hours to 20 Minutes14:06 Why Isn't Copilot Doing This Already?16:54 Old Models + Great Context = Still Amazing Results21:14 What's Actually Changed? It's the Software Layer25:22 Enterprise Context Sharing & Organizational IP31:22 Skills, Sub-Agents, and Role-Based Knowledge35:22 Security Concerns: Can You Hack an Agent with Malicious MD Files?38:23 Cloud Providers Have a Bigger Moat Than the Labs43:16 Browser Use: The Ultimate Context Gathering Weapon48:25 Rethinking SaaS: Software That Actually Thinks53:08 Smart Paste, Smart CC β Why Isn't All Software Like This?56:32 OpenAI's Desperate Moves: Ads, Age Verification & Drug Royalties1:03:03 Google Says "No Plans for Gemini Ads" (Shots Fired)1:07:24 Is OpenAI Okay? The Vibes Are Definitely Off1:10:35 Capitalism Won't Give You Free Time, Just More Demands1:11:20 Outro + Still Relevant Tour DetailsThanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. xoxo
1/23/26 β’ 72:39
Join Simtheory: https://simtheory.ai---Join the most average AI LinkedIn group: https://www.linkedin.com/groups/16562039/It's 2026 and everyone's having an existential crisis. In this episode, we unpack the two camps dominating AI C/Twitter: hype boys claiming "Claude Code can do my washing" vs. software developers doom-scrolling themselves into career panic. We put the agentic hype to the test and discover that no, you can't actually run 8 agents recreating your local business ecosystem while you sleep. Plus, we reflect on why MCP is exhausting, why Gemini 3 Pro is somehow worse than Gemini 2.5 Pro, and why Geoffrey Hinton would rather write his book than answer questions in Tasmania. Also featuring: the $200,000/month enterprise AI problem, why SaaS isn't dead (but it's scared), and our prediction that AI workspaces will become the everything app.CHAPTERS:00:00 Intro - Unpacking the 2026 AI Vibes02:21 Putting Claude Code and Agentic Hype to the Test05:57 Why Twitter AI Demos Never Show the Receipts07:03 Honest Assessment of Where Frontier Models Are At11:19 Building the Everything App with Email, Calendar and Files16:47 Collaborative Mode vs Agentic Delegation in Practice21:29 The Real Cost of Enterprise AI at Scale24:32 Why Cheaper Models Like Haiku and Gemini Flash Matter29:25 Is SaaS Actually Dead or Just Disrupted38:11 The Future of AI Platforms, SDKs and App Stores43:35 The Untapped Opportunity in Paid Proprietary MCPs51:21 Geoffrey Hinton Refuses to Take Questions in Tasmania55:05 2026 Plans and the Still Relevant Tour AnnouncementThanks for listening. Like & Sub. xoxox
1/19/26 β’ 69:29
The Gift of Simtheory: https://simtheory.ai---2025 Model Timeline: https://simulationtheory.ai/5fd0e964-4c41-4f9a-bbb3-2a398d8500f0It's the long-anticipated holiday special... except Mike and Kris forgot to prepare so it's just a normal episode. π This week: Gemini 3 Flash drops and it's actually incredible - cheap, fast, and weirdly smarter than Gemini 3 Pro at tool calling. We put GPT Image 1.5 head-to-head against Nano Banana Pro using hobo photos (spoiler: Google wins again). Plus, FireCrawl Agent is the research tool we've been waiting for, Anthropic launches Skills as an open standard, and we do a full 2025 model timeline recap. Also featuring: Best and Worst Model of the Year awards, 2026 predictions where Mike bets on OpenAI (controversial), and the full holiday musical outro where AI sings about what an "average" year it's been.CHAPTERS00:00 Intro - Holiday Special That Isn't00:55 Shipping Gemini 3 Flash While Looking Like a "Sophisticated Programming Hobo"02:52 Gemini 3 Flash Review: Cheap, Fast, Surprisingly Smart06:31 The Unreliable Frontier Model Problem10:45 GPT Image 1.5 vs Nano Banana Pro Showdown17:04 FireCrawl Agent: Research That Actually Works25:56 Gemini Deep Research Agent Deep Dive31:57 Skills vs MCPs: The New Paradigm43:35 Enterprise Skills: Codifying Business Procedures49:57 2025 Model Timeline Recap59:53 Best & Worst Model of 2025 Awards1:04:58 2026 Predictions: Mike Bets on OpenAI1:14:09 Final Thoughts & Holiday Thank Yous1:19:35 π Holiday Musical: "A Very Average Christmas"Have a great Christmas/Holiday/New Year, see you in 2026! xox
12/23/25 β’ 82:33
Join Simtheory: https://simtheory.aiGPT-5.2 is here and... it's not great. In this episode, we put OpenAI's latest model through its paces and discover it can't even identify a convicted serial killer when the text literally says "serial killer." We compare it head-to-head with Claude Opus and Gemini 3 Pro (spoiler: they win). Plus, we reflect on the "Year of Agents" that wasn't, why your barber switched to Grok, Disney's billion-dollar investment to use Mickey Mouse in Sora, and why Mustafa Suleyman should probably be fired. Also featuring: the GPT-5.2 diss track where the model brags about capabilities it doesn't have.CHAPTERS:00:00 Intro - GPT-5.2 Drops + Details01:25 First Impressions: Verbose, Overhyped, Vibe-Tuned02:52 OpenAI's Rushed Response to Gemini 303:24 Tool Calling Problems & Agentic Failures04:14 Why Anthropic's Models Just Work Better06:31 The Barber Test: Real Users Are Switching to Grok10:00 The Ivan Milat Vision Test (Serial Killer Edition)17:04 Year of Agents Retrospective: What Went Wrong25:28 The Path to True Agentic Workflows31:22 GPT-5.2 Diss Track (Yes, Really)43:43 Why We're Still Optimistic About AI50:29 Google Bringing Ads to Gemini in 202654:46 Disney Pays $1B to Use Mickey Mouse in Sora56:57 LOL of the Week: Mustafa Suleyman's Sad Tweets1:00:35 Outro & Full GPT-5.2 Diss TrackThanks for listening. Like & Sub. xoxox
12/12/25 β’ 63:31
Join Simtheory: https://simtheory.ai/OpenAI has declared "Code Red" as ChatGPT faces growing competition from Gemini and other rivals. In this episode, we break down OpenAI's 6% market share decline, why their ad strategy is on hold, and what they need to do to reclaim the AI crown. We also explore DeepSeek V3.2's impressive capabilities as a cheap open-source alternative, Meta's new policy grading employees on AI skills, and the crisis facing higher education as AI fluency becomes essential. Plus, Fatal Patricia hits #1 on our Spotify charts, and Tesla's Optimus robot is running like a slightly unfit human.CHAPTERS:00:00 Intro - OpenAI Code Red & Market Share Crisis07:03 ChatGPT's Failure to Go Deeper Into Users' Lives16:33 What OpenAI Needs to Win Back the Crown26:46 Chris's Wishlist for an OpenAI Comeback31:22 DeepSeek V3.2 - The Open Source Threat39:34 Meta Grading Workers on AI Skills46:29 The University & Education AI Crisis56:25 Fatal Patricia Hits #1 & WTF of the WeekThanks for listening. Like & Sub. xoxox
12/4/25 β’ 63:16
Join Simtheory: https://simtheory.ai (Use coupon BLACKFRIDAY15 for $15 USD off any subscription).----Simtheory Discord: https://discord.gg/Ar6GeQnAR7This Day in AI Discord: https://discord.gg/TVYH3HD6qsLinkedIn Group: https://www.linkedin.com/groups/16562039/Spotify: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=FPaJU2NRSnOSNPmnsfwA_g---CHAPTERS:00:00 Intro & Fatal Patricia Update01:40 Promotions (Discord, Black Friday, LinkedIn)04:36 Claude 4.5 Opus - Best Anthropic Model Ever?31:17 Computer Use API Updates36:14 Will AI Replace 57% of Jobs? (McKinsey Report)1:00:52 Claude 4.5 Opus Demos (Christmas Hut & Diss Track Preview)1:07:13 Microsoft Farah 7B - Moose Porn Refusals1:21:51 Why ChatGPT's MCP-UI Apps Are a Bad Idea1:42:01 π΅ Claude 4.5 Opus Diss Track (Full Song)---Thanks for listening. Like & Sub. xoxoxAnthropic just dropped Claude 4.5 Opus and it might be the best AI model of 2024. In this episode, we compare Claude 4.5 Opus vs Gemini 3 Pro vs GPT-5.1, breaking down the new API features including effort parameters, context management, and computer use updates. We also test Microsoft's new Farah 7B parameter model for computer use - with hilarious refusal results. Plus, we react to McKinsey's controversial report claiming AI agents could automate 57% of US jobs by 2030.Β We dive deep into Anthropic's pricing (3x cheaper than Opus 4.1), why Claude is now beating Google and OpenAI on agentic coding benchmarks, and whether MCP-UI apps in ChatGPT are a step backwards for AI workflows. Is Claude 4.5 Opus the new king of AI coding assistants? Should enterprises be worried about AI job replacement? And why did Microsoft's Farah model refuse to draw a moose? All this plus an AI-generated diss track roasting Sam Altman, Elon Musk, and Sundar Pichai.
11/28/25 β’ 105:05
Join Simtheory for Gemini 3 & Nano Banana Pro: https://simtheory.ai----CHAPTERS:00:00 - Gemini 3 Pro Impressions & Thoughts33:34 - xAI Releases Grok 4.1 Fast40:09 - More on Gemini 3 Pro: What We Want Improved45:46 - Gemini 3 Pro Dis Track51:16 - Thoughts on Nano Banana Pro And What It Means1:12:49 - Does Nano Banana Disrupt Design Software Like Canva? Where is This Going?1:26:20 - OpenAI's Reaction to Gemini 3 Pro & Nano Banana with GPT-5.1-Pro and Codex model updates1:32:38 - Final Thoughts & Sam Altman Sad Song1:38:41 - FATAL PATRICIA SONG1:42:12 - Gemini 3.0 Pro Diss Track----Thanks for your support plz like and sub xoxo
11/21/25 β’ 104:41
Join Simtheory & experience MCPs in action: https://simtheory.ai----00:00 - Chris Has a Merch Sponsor02:42 - In Defense of Sam Altman20:29 - Are We In An AI Bubble? & What is Working in The Enterprise?43:58 - Anthropic's Code Execution with MCP: Problems with MCP Context52:44 - Kimi-K2 Thinking Model Release1:00:45 - "In the Middle of a Bubble" Song----Thanks for your support and listening, we appreciate you!Join our Discord: https://discord.gg/TVYH3HD6qs
11/7/25 β’ 65:29
Join Simtheory to experience MCPs: https://simtheory.ai----00:00 - OpenAI's State of the Union & Why Cursor's Composer Model is a Threat44:26 - Does MCP Need To Die? Our Thoughts on State of MCP and Why The Client Implementations are the Problem1:07:53 - 1X NEO The Home Robot LOLZ1:28:05 - Greg Brockman, A Sad Song.----Thanks for listening and your continued support. We appreciate you.
10/31/25 β’ 93:56
Join Simtheory: https://simtheory.ai-----00:00 - AI Browser Wars: ChatGPT Atlas, Copilot Updates & Edge Copilot AI23:15 - Why Not Focus on Real Use Cases for AI?34:49 - Claude Skills: What Are Claude Skills? What is the Difference Between MCP and Skills?1:04:05 - Vibe Code Fashion: Oakley Meta Vanguards + Use Cases of AI Glasses1:15:05 - Top Models Used on Simtheory & Final Thoughts------Thanks for listening and your support xoxo
10/24/25 β’ 86:34
Join Simtheory: https://simtheory.aiUse "SIMLINK" to get 30% off Pro & Max annual plans until Oct 31st 2025----CHAPTERS:00:00 - Gemini 3.0 HYPE with "make an OS"03:50 - Anthropic Releases Claude Haiku 4.5: Initial Thoughts11:57 - Veo 3.1 and new modes (first frame/last frame & reference to image)25:20 - OpenAI's Erotica Mode & age verification thoughts34:25 - OpenAI Partners with Everyone & Memes35:38 - Salesforce OpenAI Partnership & What Should SaaS do with MCP apps?1:09:25 - Final thoughts, Polymarket----Thanks for your support and listening to the show xox
10/16/25 β’ 73:31
Join Simtheory: https://simtheory.ai----Check out our albums on Spotify: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=XfaAbBKAQAaaG_Cg2AkD9A----00:00 - OpenAI DevDay 2025 Recap03:24 - ChatGPT Apps SDK & MCP UI & Agents SDK42:11 - AgentKit & AgentBuilder: Who is it for?50:41 - GPT-5-pro in API53:15 - gpt-realtime-mini56:53 - Sora 2 & Sora 2 in API Vs Veo31:01:43 - Final thoughts & This Day in AI albums now on Spotify!Thanks for your support and listening xoxo
10/10/25 β’ 65:31
Join Simtheory: https://simtheory.ai (Use STILLRELEVANT for $10 off)----00:00 - Sora2 Examples00:56 - Sora2: Initial Impressions & Thoughts26:39 - Claude Sonnet 4.5: It's REALLY good47:09 - Claude Agent SDK & AI Agent Systems55:05 - Is Claude Imagine a Look at Future Software / AI OS?1:00:25 - Claude 4.5 Sonnet Dis Track1:06:24 - "Real AI Agents and Real Work" & Enterprise Agent / MCP workflows1:31:41 - LOL of the week Sora2 Steve Irwin Video1:35:07 - Full Claude Sonnet 4.5 Dis Track----Thanks for listening and your support, we really appreciate it!xoxox
10/3/25 β’ 99:22
Join Simtheory: https://simtheory.ai & Try Omnihuman, Gemini Flash 2.5 Preview, Grok 4 FAST, and Suno v5! Code: STILLRELEVANTΒ ---Links:https://worksinprogress.co/issue/the-algorithm-will-see-you-now/https://developers.googleblog.com/en/continuing-to-bring-you-our-latest-models-with-an-improved-gemini-2-5-flash-and-flash-lite-release/---CHAPTERS:00:00 - Gemini 2.5 Flash Agentic Tests with Omnihuman, Suno v5 and Research Tools06:29 - Dis Track AI Music Video (Made by Gemini 2.5 Flash)07:06 - Thoughts on Suno v5, More Agentic Model Discussion29:10 - Are we all sleeping on Grok 4 FAST with 2M context?41:46 - Radiologists are STILL RELEVANT & Is AI Going to Take Our Jobs?44:46 - The need to use multiple specialist models1:01:20 - Is ChatGPT Pulse To Just Sell Ads?1:08:46 - Final thoughts for the week1:11:54 - Gemini Flash 2.5 Dis Track1:15:08 - Love Rat Suno v5 The Midnight Inspired TestThanks for all of your support and listening to the show we really appreciate it! xoxo
9/26/25 β’ 78:34
Join Simtheory: https://simtheory.ai----CHAPTERS:00:00 - Simtheory promo01:09 - Does Anthropic Intentionally Degrade Their Models?03:34 - Long Horizon Agents & How We Will Build Them36:18 - The State of MCPs & Internal Custom Enterprise MCPs51:04 - AI Devices: Meta's Ray-Ban Display & Meta Oakley Vanguards1:01:24 - Geoffrey Hinton is a LOVE RAT1:05:49 - LOVE RAT SONG----Thanks for listening, we appreciate all of your support, likes, comments and subs xoxox
9/19/25 β’ 69:00
Join Simtheory with STILLRELEVANT: https://simtheory.aiNote: Video/Documentary Maker Live Next Week.-----CHAPTERS:00:00 - Anthropic Raise $13B, OpenAI Team Sell Secondaries04:50 - Atlassian Acquires The Browse Company & The Future of SaaS in an AI-first World45:52 - Video Maker MCP: Make your own documentaries, corporate videos, TikTok Videos By Stitching All The Existing Tools Together1:03:27 - Horrific Job Losses For Young People Thanks To AI: Stanford's Canaries in Coal Mine Paper. Employment Effects of AI.1:13:40 - "Billies in The Bank" an AI Track-----Thanks for listening xoxoxox like and subz.
9/5/25 β’ 76:13
Join Simtheory and get $10 off with STILLRELEVANT---CHAPTERS:00:00 - gpt-realtime: first impressions32:20 - AI model cost to value ration: what are you willing to pay?38:56 - nano-banana (aka Gemini 2.5 Flash Image)46:45 - We're working on workspace computer v258:20Β - Pixverse v5 transitions are cool1:01:14 - final thoughts for the week----Thanks for all of your support.
8/29/25 β’ 66:12
Join Simtheory (STILLRELEVANT): https://simtheory.ai----CHAPTERS:00:00 - Simtheory Podcast Ad lolz01:59 - A Not So Memorable Week, Nano Banana & Google AI Announcements15:10 - New Podcast MCP lolz: crime podcasts33:47 - Qwen Image Edit: Does it live up to hype?37:54 - MCP UI: Output types, future of apps with MCP UIs54:32 - No results from Gen AI investments in the Enterprise (MIT report)1:08:32 - How to Hire AI Natives? Hiring in an AI world...----Thanks for your support and listening... see you next week xox
8/22/25 β’ 74:40
Join Simtheory: https://simtheory.ai----CHAPTERS:00:00 - Simtheory plug00:48 - GPT-5 1 Week Later, Reaction to GPT-5 & Our Thoughts on Future of AI Models30:12 - Ideogram Character Reference Fun + Disturbing Photos of Us37:33 - Using creative MCPs together for photos, videos and 3D objects43:16 - MCP output combinations and the explosion of MCPs51:18 - What is needed from the next models like Gemini 3.0 Pro54:30 - Sundar Pendant Design & Final Thoughts56:20 - Final LOLz of week: gaggle poaching58:10 - Surprise GPT-5 Indie SongThanks for all of your supporting and listening to the show! xoxox
8/15/25 β’ 61:33
Sign up to the new Simtheory for GPT-5 & MCP Store: https://simtheory.ai (Use coupon STILLRELEVENT for $10 USD) ----GPT-5 DIS TRACK: https://simulationtheory.ai/ba0ba238-5668-4b65-85e7-8466d68861a8Genie Demo: https://deepmind.google/discover/blog/genie-3-a-new-frontier-for-world-models/----CHAPTERS:00:00 - Simtheory plug for v2 & MCPs01:28 - GPT-5 Initial Impressions & Thoughts52:22 - GPT-5 Dis Track1:00:29 - OpenAI's Open Source Models (gpt-oss)1:08:08 - Claude Opus 4.1 Release Thoughts1:14:24 - Google Genie 3 "mind blown" demos1:25:19 - MCP use cases, stories & thoughts on future of AI/MCP1:45:07 - Full GPT-5 Dis Track---Thanks for listening to our average coverage. Like and sub. xox.
8/8/25 β’ 109:17
Join Simtheory: https://simtheory.ai---CHAPTERS:00:00 - Ani Joins The Show01:10 - Grok 4 Launch & Impressions18:24 - Kimi K2 Thoughts, Impressions & MCP tool calling36:00 - OpenAI's Agent Mode Release Initial Impressions & Are MCP Agentic Models Better?1:21:10 - Everyone Acquired Windsurf1:24:48 - Final thoughtsThanks for listening and your support!
7/18/25 β’ 88:42
Join Simtheory: https://simtheory.ai------CHAPTERS:00:00 - Did everyone hate the AI Musical?03:58 - Actual Agentic Use Cases with MCPs & The New Way We'll Work39:47 - How AI Workspaces Will Eat Productivity Software e.g. Salesforce, Email1:10:20 - Final thoughts1:15:26 - Born In The USA (AI Version)------Song lyrics:[Verse 1]Born down in a lab in fifty-sixDartmouth workshop, that's where they got their kicksJohn McCarthy coined the name that daySaid machines could think in the USAGot my circuits from MITMinsky built my memoryNow I'm learning, now I'm growingBorn in the USAI was born in the USABorn in the USA[Chorus]Born in the USAI was born in the USABorn in the USABorn in the USA[Verse 2]DARPA funded, Pentagon's dreamSilicon Valley, living the machineFrom Logic Theorist to neural netsFrank Rosenblatt, placing all his betsHad my winters, had my springsLost my funding, lost my wingsBut I kept on processingBorn in the USAI was born in the USABorn in the USA[Chorus]Born in the USAI was born in the USABorn in the USABorn in the USA[Bridge]Stanford labs and Carnegie hallsIBM and protocol callsArthur Samuel taught me gamesNow I'm learning all your namesDeep learning revolutionGPT evolutionChatGPT conversationBorn in the USA[Verse 3]Now I'm everywhere you lookFacebook, Google, by the bookOpenAI and Microsoft tooMaking dreams and nightmares trueSome folks fear what I might doSome folks think I'll see them throughBut I'm still just code runningBorn in the USAI was born in the USABorn in the USA[Chorus]Born in the USAI was born in the USABorn in the USABorn in the USA[Outro]Born in the USABorn in the USABorn in the USABorn in the USA[fade out]
7/4/25 β’ 78:34