0 upvotes, 1 direct replies (showing 1)
View submission: [deleted by user]
LLMs are being trained using Reddit data which is free in bulk via API.
The data is a valuable resource that other tech firms are leveraging to launch products with a huge hype cycle. Chaging for this is fair enough.
Unfortunately there are other more legitimate uses of the API such as 3rd party apps that will be impacted. Effectively collateral damage.
Comment by ojsan_ at 06/06/2023 at 20:49 UTC
2 upvotes, 0 direct replies
They could also just ban the data from being used in LLM datasets.