<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>gpt-oss on Fabian G. Williams</title>
    <link>https://www.fabswill.com/tags/gpt-oss/</link>
    <description>Recent content in gpt-oss on Fabian G. Williams</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en</language>
    <lastBuildDate>Sat, 02 May 2026 00:00:00 +0000</lastBuildDate>
    
	<atom:link href="https://www.fabswill.com/tags/gpt-oss/index.xml" rel="self" type="application/rss+xml" />
    
    
    <item>
      <title>Replacing gpt-oss:120b With Qwen3.6 on a MacBook Pro: A Two-Day Local Model Benchmark</title>
      <link>https://www.fabswill.com/blog/replacing-gpt-oss-with-qwen3-6-on-macbook-pro/</link>
      <pubDate>Sat, 02 May 2026 00:00:00 +0000</pubDate>
      
      <guid>https://www.fabswill.com/blog/replacing-gpt-oss-with-qwen3-6-on-macbook-pro/</guid>
      <description>TL;DR I spent two days benchmarking three Qwen3.6 variants against gpt-oss:120b on my MacBook Pro M3 Max. The shocking result: a 21 GB coding-tuned model ran an OpenClaw-shaped research-brief workload that I use for the non profit MACONA.org in 6 seconds — 10x faster than gpt-oss:120b on the same prompt. Fast enough that I now have reasonable confidence I could move this kind of work off the SaaS-hosted frontier models I have been paying for and onto local hardware on my dev machine.</description>
    </item>
    
  </channel>
</rss>