<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>m3-max on Fabian G. Williams</title>
    <link>https://www.fabswill.com/tags/m3-max/</link>
    <description>Recent content in m3-max on Fabian G. Williams</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en</language>
    <lastBuildDate>Sat, 09 May 2026 00:00:00 +0000</lastBuildDate>
    
	<atom:link href="https://www.fabswill.com/tags/m3-max/index.xml" rel="self" type="application/rss+xml" />
    
    
    <item>
      <title>Qwen 3.6 vs gpt-oss:120b on M3 Max: I Ran a Harder Test, the 8× Speed Gap Surprised Me</title>
      <link>https://www.fabswill.com/blog/qwen-3-6-vs-gpt-oss-m3-max-8x-speed-gap-receipts/</link>
      <pubDate>Sat, 09 May 2026 00:00:00 +0000</pubDate>
      
      <guid>https://www.fabswill.com/blog/qwen-3-6-vs-gpt-oss-m3-max-8x-speed-gap-receipts/</guid>
      <description>TL;DR I published a post last week about replacing gpt-oss:120b with Qwen 3.6 on my MacBook Pro M3 Max. The numbers in that post were real, but one set of tests was structurally gameable — 38 of 40 baseline images were the same class, so an &amp;ldquo;always-say-A&amp;rdquo; stub also scored 95 percent. I went back, designed three un-gameable reasoning tasks, and ran them against both local models on identical hardware.</description>
    </item>
    
  </channel>
</rss>