Destide@feddit.uk to Programming@programming.devEnglish · 6 months agoOpen-R1: a fully open reproduction of DeepSeek-R1huggingface.coexternal-linkmessage-square9linkfedilinkarrow-up1109arrow-down15
arrow-up1104arrow-down1external-linkOpen-R1: a fully open reproduction of DeepSeek-R1huggingface.coDestide@feddit.uk to Programming@programming.devEnglish · 6 months agomessage-square9linkfedilink
minus-squareTomasEkeli@programming.devlinkfedilinkarrow-up5·6 months agohonestly both 7b and 8b are pretty dumb as well.
minus-squareMadhuGururajan@programming.devlinkfedilinkEnglisharrow-up1·6 months agowe could add so much deterministic code at 1.5GB that would start religions…
honestly both 7b and 8b are pretty dumb as well.
we could add so much deterministic code at 1.5GB that would start religions…
True