Destide@feddit.uk to Programming@programming.devEnglish · 1 年前Open-R1: a fully open reproduction of DeepSeek-R1huggingface.coexternal-linkmessage-square9linkfedilinkarrow-up1106arrow-down15cross-posted to: [email protected][email protected]
arrow-up1101arrow-down1external-linkOpen-R1: a fully open reproduction of DeepSeek-R1huggingface.coDestide@feddit.uk to Programming@programming.devEnglish · 1 年前message-square9linkfedilinkcross-posted to: [email protected][email protected]
minus-squareTomasEkeli@programming.devlinkfedilinkarrow-up5·1 年前honestly both 7b and 8b are pretty dumb as well.
minus-squareMadhuGururajan@programming.devlinkfedilinkEnglisharrow-up1·1 年前we could add so much deterministic code at 1.5GB that would start religions…
3B is probably also pretty dumb
honestly both 7b and 8b are pretty dumb as well.
True
we could add so much deterministic code at 1.5GB that would start religions…