The gloomy sentiment around Reddit Inc. has failed to dissipate after its shares fell 50% from a February high, with volatile technology stocks under pressure.

  • RedFrank24@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    4 days ago

    I disagree that Reddit would gain in value over time if they kept banning automation, because it is increasingly difficult to avoid AI-generated material polluting your dataset, no matter how much you avoid automation and try banning it. Inevitably, some AI-generated material is going to get in.

    It’s a problem in two ways:

    1. The vast vast majority of data on Reddit has already been sold, so you can’t rely on that data for future revenue
    2. The remaining data that’s current is polluted by AI and is therefore worth less than the historical data because the more AI pollutes your dataset, the more likely it is to lead to Model Collapse, where an LLM is poisoned due to unverified data generated by other LLMs

    I am firmly of the belief that sites like Internet Archive will be some of the most valuable companies in the AI space, because they hold an immense amount of untainted data created prior to 2019.