SmokeyDope@lemmy.worldM to LocalLLaMA@sh.itjust.worksEnglish · edit-26 months agoDeepSeek just released updated r1 models with 'deeper and more complex reasoning patterns'. Includes a r1 distilled qwen3 8b model boasting "10% improved performance" over originalhuggingface.coexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1external-linkDeepSeek just released updated r1 models with 'deeper and more complex reasoning patterns'. Includes a r1 distilled qwen3 8b model boasting "10% improved performance" over originalhuggingface.coSmokeyDope@lemmy.worldM to LocalLLaMA@sh.itjust.worksEnglish · edit-26 months agomessage-square0linkfedilinkfile-text