• dev_null@lemmy.ml
    link
    fedilink
    English
    arrow-up
    3
    ·
    3 hours ago

    Maybe they did, that’s how they got to 99%. The remaining issues are so intricate/complex the LLM just can’t solve them no matter how many test cases you give it.