Lantier@jlai.lu to LocalLLaMA@sh.itjust.worksEnglish · 3 days agoQwen/QwQ-32B · Hugging Facehuggingface.coexternal-linkmessage-square5fedilinkarrow-up114arrow-down10
arrow-up114arrow-down1external-linkQwen/QwQ-32B · Hugging Facehuggingface.coLantier@jlai.lu to LocalLLaMA@sh.itjust.worksEnglish · 3 days agomessage-square5fedilink
minus-squaresuoko@feddit.itlinkfedilinkEnglisharrow-up1·3 days agoWhy insane? For quality, speed, size? I find the coder 1.5b and 3b light and good
minus-squaremorrowind@lemm.eelinkfedilinkEnglisharrow-up3·3 days agoIt matches R1 in the given benchmarks. R1 has 671B params (36 activated) while this only has 32
insane, absolutely insane
Why insane? For quality, speed, size? I find the coder 1.5b and 3b light and good
It matches R1 in the given benchmarks. R1 has 671B params (36 activated) while this only has 32