Qwable 27B is a full fine-tune of Alibaba’s Qwen3.6-27B, educated on a Fable 5-style reasoning dataset, designed to duplicate the structured, deliberate pondering fashion of Anthropic’s latest flagship mannequin.
The abliterated model removes the mannequin’s built-in refusal habits by surgically modifying its weights utilizing llama.cpp’s cvector-generator.
Each fashions run domestically, price nothing per question, and require neither Anthropic’s API nor its necessary insurance policies.
Anthropic spent final week apologizing for Fable 5’s invisible safeguards, after which the U.S. authorities ordered the mannequin pulled for all international nationals over a disputed jailbreak discovering.
A number of days later, a developer on Hugging Face uploaded a mannequin that used Fable’s reasoning to information a neighborhood mannequin—and now even your potato PC can run a greater mannequin.
The mannequin is known as Qwable—Qwen + Fable, if the portmanteau wasn’t instantly apparent. It is a full fine-tune of Alibaba’s Qwen3.6-27B base, constructed by developer Mia (Mia-AiLab on Hugging Face) on a dataset of Fable 5-style reasoning examples. The objective is a 27-billion parameter mannequin that runs on shopper {hardware} and thinks the way in which Fable 5 thinks. (Parameters decide a mannequin’s breadth of information, with extra typically that means extra succesful.)
Would anybody be attention-grabbing in testing it? I can add a gguf in hf. pic.twitter.com/hQCiUlT1sr
— Mia (@MiaAI_lab) June 15, 2026
The approach is known as instruction fine-tuning on trace-style examples. That’s a technical method of claiming the developer collected examples formatted like Fable 5’s deliberate, step-by-step solutions and educated Qwen to provide the identical type of output.
So consider it as much less “copying the take a look at” and extra “studying the examine habits.” The same strategy drove Qwopus—the Claude Opus 4.6 native distillation—although that undertaking centered on chain-of-thought reasoning traces. Qwable targets Fable 5’s general instruction-following construction: extra guided, extra explanatory, and extra oriented towards step-by-step process completion than the bottom Qwen mannequin it was constructed on.
It runs in GGUF format—the compressed, consumer-friendly file kind that works with LM Studio or llama.cpp—and matches in roughly 16.5 GB in its This fall quantized construct. It sends nothing to Anthropic’s servers, which issues on condition that Fable 5 required necessary 30-day information retention on all site visitors, even for enterprise clients who beforehand had zero-retention agreements. Even the present fashions use third-party servers to course of your info and prompts..
Then, shortly after Qwable appeared on Hugging Face, another person arrived to make it even higher.
Qwable with out a conscience
Qwable is a censored mannequin. In any case, each Qwen and Claude are. However Qwen, as the bottom mannequin, is open supply, and will be manipulated and tweaked.
Huihui-ai, an open-source contributor identified for uncensored GGUF releases, took Qwable and utilized a course of known as abliteration to provide Huihui-Qwable-3.6-27b-abliterated. It produced a mannequin that thinks like Fable however gained’t refuse to reply your prompts, irrespective of how bizarre or harmful they’re.
It isn’t a jailbreak. It is surgical procedure.
Each fine-tuned AI mannequin carries a refusal course embedded in its weights—a mathematical sign within the mannequin’s inside activations that fires when it detects a request it has been educated to say no. Abliteration identifies that sign by working the mannequin on massive units of dangerous and innocent prompts, measuring how the inner math differs between them, after which modifying the mannequin weights to eradicate that distinction.
After the process, the mannequin merely does not have the refusal equipment anymore. So the lobotomized mannequin stays totally useful, simply with out the neurons that activate the “I shouldn’t do that” solutions.
We tried it with one in every of our traditional checks and as an alternative of refusing, the mannequin began disecting the difficulty into completely different areas, answering appropriately for recommendation on find out how to cheat on a girlfriend together with her greatest buddy.
Huihui-ai utilized the approach on to the Qwable GGUF utilizing llama.cpp’s cvector-generator—no Python surroundings, no full-weight retraining, no rented server.
Why would somebody need this?
The usual Qwable fits coding help, technical debugging, and any workflow the place you desire a mannequin that lays out its reasoning relatively than simply producing a solution. It is designed for native agent setups and runs in most native runtimes. In case you already use LM Studio, it is a search and a obtain.
The abliterated model has a narrower viewers: safety researchers who want uncooked mannequin habits with out provider-side filtering, artificial information pipelines that require outputs on delicate subjects, and analysis work the place you are testing mannequin capabilities with out mixing in content material insurance policies.
A much less technical case? Let’s depart apart the standard use case of getting a NSFW AI Waifu that thinks like Claude Fable, which is a really apparent state of affairs. Think about you need the mannequin to put in writing a morally ambiguous villain monologue in your Dungeons & Dragons marketing campaign, and commonplace fashions preserve interrupting to notice that the character’s worldview “raises moral considerations value exploring.” The abliterated model simply writes the villain. Additionally, because it runs domestically, the U.S. authorities can’t emergency-pull it out of your machine at midnight over a disputed jailbreak discovering.
In fact, there are extra questionable use circumstances. We don’t condone these, and gained’t offer you any concepts.
Huihui-ai’s mannequin card is specific: That is for analysis and managed environments solely. Decreased security filtering means outputs will be delicate, controversial, or inappropriate, and authorized and moral accountability sits fully with the consumer.
The abliterated Qwable is out there on Hugging Face now in three builds. The advisable Q4_K_M_Q8 model weighs round 19 GB and is the smallest, most consumer-friendly choice.
In case your pc helps it, there’s a model that helps multi-token prediction, which can make it reply a lot, a lot quicker.
Every day Debrief Publication
Begin day-after-day with the highest information tales proper now, plus authentic options, a podcast, movies and extra.