pepperfree@sh.itjust.works to LocalLLaMA@sh.itjust.worksEnglish · 2 days agoDeepSeek dropped the V3.1 Weighthuggingface.coexternal-linkmessage-square17linkfedilinkarrow-up142arrow-down10file-text
arrow-up142arrow-down1external-linkDeepSeek dropped the V3.1 Weighthuggingface.copepperfree@sh.itjust.works to LocalLLaMA@sh.itjust.worksEnglish · 2 days agomessage-square17linkfedilinkfile-text
minus-squarepepperfree@sh.itjust.worksOPlinkfedilinkEnglisharrow-up1·1 day agoI wonder if we can extend the context length. It already fine-tuned with YaRN so we can’t get free extend with that method.
I wonder if we can extend the context length. It already fine-tuned with YaRN so we can’t get free extend with that method.