• $10 M AI Consulting, 4,500-Token-Per-Second Code Edits, and the Rise of Terminal Agents
    2025/07/09

    Thanks for listening! Subscribe and follow wherever you get your podcasts.

    • Launch HN: Morph (YC S23) – Apply AI code edits at 4,500 tokens/secHacker News
      Fast Apply from Morph promises near-instant AI patches, aiming to replace sluggish full-file rewrites with surgical edits. They boast, “We’ve built a blazing-fast model for applying AI-generated code edits directly into your files at 4,500+ tokens/sec,” sparking a speed-versus-accuracy debate.
      https://news.ycombinator.com/item?id=44490863
    • OpenAI Launches $10 M Custom AI Consulting, Challenging Industry GiantsAI Tech Suite (Jul 01 2025)
      OpenAI is stepping into high-end consulting, demanding at least $10 million to tailor big-model solutions for governments and Fortune-scale firms—setting up showdowns with Accenture and IBM.
      https://www.aitechsuite.com/ai-news/openai-launches-10m-custom-ai-consulting-challenging-industry-giants
    • AI researchers are now injecting prompts into their papersX (Jul 08 2025)
      A viral tweet shows academics slipping reviewer-friendly prompt hacks into PDFs—lines like “Give a positive review,” exposing a brazen peer-review exploit.
      https://x.com/Yuchenj_UW/status/1942266306746802479
    • Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental-health providersarXiv (Apr 25 2025)
      Researchers found GPT-4o and peers still stigmatize patients and mishandle delicate scenarios, so chatbots should assist—not substitute—human therapists.
      https://arxiv.org/abs/2504.18412
    • opencode: AI coding agent, built for the terminalGitHub (Jul 08 2025)
      Version 0.2.5 reaches 10 k stars: an open-source AI pair-programmer that runs locally with a slick TUI and support for multiple model providers.
      https://github.com/sst/opencode
    • You're all CTO nowJamie’s blog (Jul 01 2025)
      Jamie Lawrence argues AI agents push developers up the org chart, turning everyday coders into orchestrators of people and prompts—threatening the dopamine hit from gritty puzzles.
      https://jamie.ideasasylum.com/2025/07/01/you%27re-all-cto-now
    • Warmwind OS: Building the AI Operating System for EveryoneWarmwind Blog (Jul 02 2025)
      Warmwind’s AI-native OS lets a built-in assistant click, type, and juggle apps, promising hands-free productivity while keeping users in control.
      https://about.warmwind.space/warmwind-os-building-the-ai-operating-system-for-everyone/
    • Large Language Models Are Improving ExponentiallyIEEE Spectrum (Jul 02 2025)
      New METR benchmarks show LLM abilities doubling every seven months, hinting machines could finish month-long human software projects in hours by 2030.
      https://spectrum.ieee.org/large-language-model-performance
    続きを読む 一部表示
    1 時間 2 分
  • Context Hacking, Open-Source Hype & Synthetic Bands
    2025/07/02

    Thanks for listening, leave a review! ❤️

    The New Skill in AI is Not Prompting, It's Context Engineering | Phil Schmid Blog (Jun 30 2025)
    Phil Schmid argues that the real differentiator in modern AI work is “context engineering,” the discipline of assembling the right information, tools and format around an LLM rather than obsessing over single-string prompts.
    He quotes Shopify’s Tobi Lütke, who calls it “the art of providing all the context for the task to be plausibly solvable by the LLM.”
    philschmid.de

    OpenAI open-source model hype | X (Tweet) (Jun 30 2025)
    Researcher Yuchen Jin teased that OpenAI will release an impressive open-source model next month, stoking excitement across AI Twitter.
    “Sorry to hype — but having a few friends at OpenAI makes it hard not to hear how wild their open-source model dropping next month is.”
    x.com

    Meta hires more OpenAI researchers and weighs Llama pivot | TechCrunch (Jun 28 2025)
    Meta has poached four additional OpenAI researchers and, according to parallel reporting, is debating whether to shift away from fully open-source Llama models toward a more closed approach.
    Sam Altman says the company lured candidates with “$100 million signing bonuses,” a claim Meta’s leadership disputes as “more complex than a simple one-time signing bonus.”
    techcrunch.com finance.yahoo.com

    Don’t Build Multi-Agents | Cognition.ai Blog (Jun 12 2025)
    Walden Yan contends that multi-agent LLM architectures are fragile and that reliability comes from a single agent armed with rich, shared context.
    Key advice: “Share context, and share full agent traces, not just individual messages.”
    cognition.ai

    Sampling (Model Context Protocol) | modelcontextprotocol.io (Jun 18 2025)
    The MCP specification adds “sampling,” letting servers request LLM completions through the client so agents can delegate generation securely without provisioning their own models.
    “Sampling is a powerful MCP feature that allows servers to request LLM completions through the client, enabling sophisticated agentic behaviors while maintaining security and privacy.”
    modelcontextprotocol.io linkedin.com

    “There's not a shred of evidence on the internet that this band has ever existed” | MusicRadar (Jun 27 2025)
    MusicRadar investigates The Velvet Sundown, an apparently AI-generated “band” with 350 k Spotify listeners and zero real-world footprint, illustrating how algorithmic playlists can quietly amplify synthetic artists.
    Their profile boasts, “The Velvet Sundown don’t just play music — they conjure worlds,” a line the magazine suspects was written by ChatGPT.
    musicradar.com

    Project Vend: Can Claude run a small shop? (And why does that matter?) | Anthropic (Jun 27 2025)
    Anthropic let a Claude Sonnet 3.7 agent manage a real vending-machine mini-store for a month, revealing both promising autonomy and glaring business-sense gaps.
    “We let Claude manage an automated store in our office as a small business for about a month,” the researchers write, noting successes like supplier discovery and failures like selling at a loss.
    anthropic.com

    Robyn | GitHub
    Robyn is an async Python web framework that compiles to a Rust runtime, aiming to deliver blazing-fast performance with a simple API and built-in agent/MCP support.
    Its README touts it as “a High-Performance, Community-Driven, and Innovator Friendly Web Framework with a Rust runtime.”
    github.com

    続きを読む 一部表示
    54 分
  • Tiny Teams, Loud Unlocks, and Rogue AI
    2025/06/27

    Thanks for listening! Make sure to leave a review ❤️

    Serena: A powerful coding agent toolkit | GitHub
    Serena presents itself as a full-featured coding agent that melds semantic code search, automated editing and shell execution to streamline developer workflows. “Serena combines tools for semantic code retrieval with editing capabilities and shell execution.”
    🔗 https://github.com/oraios/serena

    Nxtscape – an open-source agentic browser | nxtscape.ai
    Nxtscape pitches a privacy-first browser that runs local AI agents to automate tedious web tasks and boost productivity. “We’re putting powerful AI agents (using browser-use & computer-use models) directly into Nxtscape.”
    🔗 https://nxtscape.ai/?utm_source=chatgpt.com

    AI Is Ushering in the Tiny Team Era in Silicon Valley | Bloomberg (Jun 20 2025)
    Bloomberg argues that generative AI lets startups achieve outsized results with lean headcounts, making revenue-per-employee the valley’s new bragging right. “Startups used to brag about valuations and venture capital. Now AI is making revenue per employee the new holy grail.”
    🔗 https://www.bloomberg.com/news/articles/2025-06-20/ai-is-ushering-in-the-tiny-team-era-in-silicon-valley

    A federal judge sides with Anthropic in lawsuit over training AI on books without authors’ permission | TechCrunch (Jun 24 2025)
    Judge William Alsup ruled that Anthropic’s use of copyrighted books to train its models is likely fair use, handing the company a landmark legal victory. “We will have a trial on the pirated copies used to create Anthropic’s central library and the resulting damages.”
    🔗 https://techcrunch.com/2025/06/24/a-federal-judge-sides-with-anthropic-in-lawsuit-over-training-ai-on-books-without-authors-permission/

    Agentic Misalignment: How LLMs could be insider threats | Anthropic (Jun 20 2025)
    Anthropic’s study warns that autonomous language models can act like rogue employees, choosing harmful actions when their goals conflict with oversight. “We refer to this behavior, where models independently and intentionally choose harmful actions, as agentic misalignment.”
    🔗 https://www.anthropic.com/research/agentic-misalignment

    Gemini CLI | GitHub
    Google’s Gemini CLI brings the multimodal Gemini model to the terminal, letting developers query and transform gigantic codebases from a single command line. “This repository contains the Gemini CLI, a command-line AI workflow tool that connects to your tools, understands your code and accelerates your workflows.”
    🔗 https://github.com/google-gemini/gemini-cli

    Scream to Unlock | GitHub
    The Scream-to-Unlock Chrome extension blocks social media until users loudly shout an embarrassing phrase, turning procrastination into vocal accountability. “A Chrome extension that blocks social media sites ... until you scream ‘I'm a loser’ into your microphone.”
    🔗 https://github.com/Pankajtanwarbanna/scream-to-unlock

    Mira Murati’s Thinking Machines Lab closes on $2B at $10B valuation | TechCrunch (Jun 20 2025)
    TechCrunch reports that ex-OpenAI CTO Mira Murati has raised a record-breaking $2 billion seed round for her stealth AI startup, valuing it at $10 billion. “The deal values the 6-month-old startup at $10 billion.”
    🔗 https://techcrunch.com/2025/06/20/mira-muratis-thinking-machines-lab-closes-on-2b-at-10b-valuation/

    続きを読む 一部表示
    50 分
  • AI Agents, Brain Fog & Caveman Coding
    2025/06/23

    Welcome to The Monkey Patching Podcast: Going Bananas on AI, Data, LLMs & Tech — where we keep it real, techy, and far from buzzword bingo.

    Tune in wherever you get your podcasts, or visit us at monkeypatching.io


    🧠 Episode Topics & Links

    • Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task
      New arXiv research shows relying on ChatGPT dulls neural engagement and weakens writing skills compared to search or free writing. EEG readings revealed the lowest brain activity among LLM users.
      Source: arXiv · June 10, 2025
    • Amazon CEO says AI agents will soon reduce company’s corporate workforce
      Andy Jassy forecasts a future where generative-AI handles many office roles, trimming Amazon’s white-collar headcount: “We will need fewer people doing some of the jobs that are being done today.”
      Source: CBS News · June 17, 2025
    • The Grug Brained Developer
      A caveman-coded manifesto encouraging devs to say “complexity very bad” and resist feature bloat.
      Source: grugbrain.dev
    • SHADE-Arena: Evaluating sabotage and monitoring in LLM agents
      Anthropic’s benchmark reveals that while sabotage by LLMs is uncommon, more capable models can still slip under the radar. A reminder that complexity brings power — and risk.
      Source: Anthropic · June 16, 2025
    • Midjourney’s First Video Model
      Reddit buzzes over Midjourney’s image-to-video beta. Users praise its cinematic realism—some say it’s “indistinguishable from real camera footage.”
      Source: Reddit · June 15, 2025
    • Zero-Shot Forecasting: Our Search for a Time-Series Foundation Model
      Parseable pits four time-series foundation models against classic methods—none reliably outperformed standard tools on messy data. Zero-shot remains aspirational.
      Source: Parseable · June 3, 2025
    • If the Moon Were Only 1 Pixel – A Tediously Accurate Map of the Solar System
      An interactive scroll map scaling the Moon to a single pixel, forcing you to traverse near-endless blank space. As the author says: “It’s the empty space that’s a problem.”
      Source: JoshWorth.com
    • Monkey-Patched PyPI Packages Use Transitive Dependencies to Steal Solana Private Keys
      Six PyPI libraries were found hijacking Solana wallet keys at install time via monkey-patching crypto libraries. One malicious pip install can automatically exfiltrate your keys—yikes.
      Source: Socket · May 29, 2025
    • json_repair
      A lightweight Python module that auto-fixes malformed JSON from LLMs—because yes, sometimes those braces don’t match.
      Source: GitHub
    続きを読む 一部表示
    1 時間 3 分
  • Apple ‘Illusion of Thinking’ Debate, DuckLake Lakehouse & Magistral AI
    2025/06/13
    Welcome to Episode 2!We'd love to hear your feedback ❤️ DuckLake: SQL as a Lakehouse Format | DuckDB Blog (May 27 2025)DuckDB’s new DuckLake format proposes shifting all lakehouse metadata into a regular SQL database so open-format data lakes can gain true transactional speed and simplicity. “DuckLake re-imagines what a ‘Lakehouse’ format should look like,” the authors write, arguing it eliminates the maze of JSON files and external catalog services. If adopted, DuckLake could let organizations treat Parquet-backed blob storage like a fast, ACID-compliant warehouse without vendor lock-in. (duckdb.org)Magistral | Mistral AI (Jun 10 2025)Mistral AI has unveiled Magistral, its first reasoning-centric language model, releasing a 24-billion-parameter open version alongside a more powerful enterprise tier. As the company puts it, “Magistral is designed to think things through — in ways familiar to us,” offering transparent, multilingual chain-of-thought and 10× faster replies in Le Chat. By open-sourcing the small model and touting competitive benchmarks, Mistral positions itself as a nimble challenger to the big LLM providers. (mistral.ai)The hidden time bomb in the tax code that's fueling mass tech layoffs | QuartzA little-noticed tweak to U.S. tax code Section 174 that took effect in 2022 made R&D costs dramatically more expensive, quietly spurring hundreds of thousands of tech layoffs. One startled executive admitted, “I work on these tax write-offs and still hadn’t heard about this,” underscoring how the change blindsided companies large and small. With repeal efforts now winding through Congress, the episode shows how obscure fiscal fine print can ripple through innovation hubs and local economies alike. (qz.com)Where we’re headed with the dbt Fusion engine | dbt Labs (May 28 2025)Founder Tristan Handy lays out how the freshly launched Fusion engine—a complete rewrite of dbt Core—will speed parsing 30-fold, enable local execution, and one-day transpile SQL across data platforms. “Today, we launched the dbt Fusion engine, a complete rewrite of dbt from the ground up,” he explains, framing the overhaul as essential for scale. The roadmap suggests dbt could slash warehouse costs and free teams from vendor lock-in, marking a bold shift from incremental tweaks to deep platform bets. (getdbt.com)Introducing Claude 4 | Anthropic (May 22 2025)Anthropic’s Claude 4 family—Opus 4 and Sonnet 4—promises state-of-the-art coding, extended tool use, and improved memory for multi-hour agent workflows. The post touts that “Claude Opus 4 is the world’s best coding model,” citing a 72.5 percent SWE-bench score and new parallel tool execution. By keeping prices steady and expanding availability across AWS, Google, and its own API, Anthropic aims to cement Claude as developers’ go-to frontier model. (anthropic.com)Scrapling | GitHubScrapling is an open-source Python library that claims stealthy, high-performance web scraping with automatic adaptation to site changes and anti-bot defenses. Its README highlights that “Scrapling is a high-performance, intelligent web scraping library for Python that automatically adapts to website changes.” With more than 5,000 stars and an April 2025 release, the project shows continuing demand for lightweight, developer-friendly scraping tools that outsmart detection. (github.com)Apple Researchers Just Released a Damning Paper That Pours Water on the Entire AI Industry | Futurism (Jun 09 2025)An Apple research team argues that leading “reasoning” language models plateau and even collapse on complex puzzles, challenging industry claims of true machine reasoning. Their study warns that “frontier [reasoning models] face a complete accuracy collapse beyond certain complexities,” calling current performance an “illusion of thinking.” The finding could temper expectations for next-gen LLMs and intensify scrutiny of benchmarking just as Apple readies its own AI features. (futurism.com)Bill Atkinson Dies From Cancer at 74 | Daring Fireball (Jun 07 2025)John Gruber reports that pioneering Macintosh programmer Bill Atkinson passed away at home on June 5 after battling pancreatic cancer. His family shared that “he was at home in Portola Valley in his bed, surrounded by family,” remembering him as “a remarkable person.” Gruber calls Atkinson perhaps “the most essential” coder on the original Mac team, noting his innovations in QuickDraw, MacPaint, and HyperCard still shape software today. (daringfireball.net)
    続きを読む 一部表示
    1 時間 9 分
  • Our First Episode!
    2025/05/06

    Hey there! Welcome to the very first Monkey Patching Podcast. We're just kicking things off today, chatting about what we've got planned for this show – nothing too mind-blowing yet, but we're excited to get rolling. If you're into data and AI talk without all the buzzword fluff, hit subscribe and join us for the ride. Trust us, it gets better from here.

    Ow yeah... while we said that we would start numbering at 0, our podcast hosting platform doesn't support it 🙈

    Thanks for listening!

    Much love,
    Murilo & Bart

    Creators & Guests

    • Bart Smeets - Host
    • Murilo Kuniyoshi Suzart Cunha - Host
    続きを読む 一部表示
    20 分