• [deleted]@piefed.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 days ago

    They could use reliable sources to approach 100% instead of jamming literally everything in. For example, limiting the training data to peer reviewed papers would not be exactly 100% but it would be a lot closer than including all of reddit.