r/LocalLLaMA · May 28, 2026 · 1 min read

Qwen3.6 35B - TXT vs Markdown vs HTML vs HTML+CSS

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Theres been talk of late about using HTML rather than markdown in Claude Code. I was curious how this worked with a local model so loaded up Qwen3.6 35B A3B at Q8 and F16 KV cache.

Then I gave it the same prompt write a detailed explanation of the Blazor render cycle first asking for raw text, then markdown, then unstyled HTML, then HTML+CSS, and finally with no constraint (where it chose markdown). I measured the token counts for reasoning, total response (including the md or HTML formatting) and the raw response content stripped of formatting.

I also recorded the tokens per second (running MTP with 3 draft tokens) and the total time taken.

Output	Reasoning tokens	Output tokens	Raw content tokens	Tokens per second	Time taken
Raw text	1,873	1,080	1,080	146	20s
Markdown	1,264	1,496	1,269	123.5	23s
Unstyled HTML	166	7,346	4,857	139	56s
Styled HTML	108	10,290	3,418	139	82s
No constraint (chose markdown)	1,465	2,256	2,002	122	31s

Finally I got ChatGPT 5.5 Extended Reasoning to score the quality of their output based on:

How much correct useful information is present
How well it is explained
How many errors it contains
How efficiently it uses its length

Rank	Output	Cov	Expl	Err	Dens	Total
1	Markdown	31/40	21/25	18/25	8/10	78/100
2	No constraint (chose markdown)	32/40	18/25	13/25	8/10	71/100
3	Raw text	30/40	19/25	11/25	6/10	66/100
4	Unstyled HTML	34/40	17/25	6/25	4/10	61/100
5	Styled HTML	33/40	19/25	3/25	3/10	58/100

submitted by /u/BigYoSpeck
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA