lemmydividebyzero@reddthat.com to Technology@lemmy.worldEnglish · 18 hours agoOpensource AI Must Winopensourceaimustwin.comexternal-linkmessage-square35fedilinkarrow-up1135arrow-down114cross-posted to: [email protected]
arrow-up1121arrow-down1external-linkOpensource AI Must Winopensourceaimustwin.comlemmydividebyzero@reddthat.com to Technology@lemmy.worldEnglish · 18 hours agomessage-square35fedilinkcross-posted to: [email protected]
minus-squaremabeledo@lemmy.worldlinkfedilinkEnglisharrow-up1·9 hours agoI mean if that’s all that would be loaded in memory, sure.
minus-squareZephyrXero@lemmy.worldlinkfedilinkEnglisharrow-up1·7 hours agoI got Qwen 3.5:9b running on my 8GB GPU the other day, and it still has some room left over
minus-squaremabeledo@lemmy.worldlinkfedilinkEnglisharrow-up1·6 hours agoI was talking about combined system RAM. People often overestimate what the average system specs are.
minus-squareZephyrXero@lemmy.worldlinkfedilinkEnglisharrow-up1·6 hours agoIf the model dumps over to system ram it gets super slow, you ideally want it to fit completely in your VRAM
I mean if that’s all that would be loaded in memory, sure.
I got Qwen 3.5:9b running on my 8GB GPU the other day, and it still has some room left over
I was talking about combined system RAM. People often overestimate what the average system specs are.
If the model dumps over to system ram it gets super slow, you ideally want it to fit completely in your VRAM