• boonhet@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    1
    ·
    6 hours ago

    They acquire and somewhat understand it but it doesn’t get saved. It only lives in the context window.

    The things the big ones can do now are amazing when you have tasks that can be iterated upon until completion. And then there’s Deepseek V4 flash, a much smaller model with a still huge context window that costs an order of magnitude (nearly 2) less than the American frontier models and managed to do some spectacular things for me with essentially no input from me beyond the original prompt. It took hours, but for me to learn the tools and everything would’ve taken days or weeks.

    Still not intelligent the way humans are. Next session starts, it’s all wiped clean until it reads the last session’s notes. But in terms of understanding information and reasoning about it, it’s going to be better than a human that isn’t a domain expert.