Comment by snozburger on 06/06/2023 at 20:11 UTC

0 upvotes, 1 direct replies (showing 1)

View submission: [deleted by user]

View parent comment

LLMs are being trained using Reddit data which is free in bulk via API.

The data is a valuable resource that other tech firms are leveraging to launch products with a huge hype cycle. Chaging for this is fair enough.

Unfortunately there are other more legitimate uses of the API such as 3rd party apps that will be impacted. Effectively collateral damage.

Replies

Comment by ojsan_ at 06/06/2023 at 20:49 UTC

2 upvotes, 0 direct replies

They could also just ban the data from being used in LLM datasets.