Comment by safrax on 31/05/2023 at 00:59 UTC

14 upvotes, 3 direct replies (showing 3)

View submission: Advancing Community-Led Moderation: An Update on How NCRI/Pushshift and Reddit, Inc. are Working Together

View parent comment

Camas itself does nothing beyond build an API call to pushshift that it then makes the results of look "pretty". The pushshift code is not open source despite repeated calls to make it so. Even if it was open sourced Reddit is killing the public API that pushshift uses so you cannot build a pushshift clone going forwards.

Replies

Comment by Watchful1 at 31/05/2023 at 01:11 UTC

8 upvotes, 1 direct replies

Ingesting reddit content is relatively simple. It would be nice if they opensourced their implementation, but anyone really interested can just build one themselves.

But replicating the database structure and api capable of handling the loads pushshift did is a lot of detailed server setup and configuration that isn't that easy to publish and wouldn't be that useful anyway unless you bought all the same hardware they did.

Comment by BlogSpammr at 31/05/2023 at 01:19 UTC

4 upvotes, 2 direct replies

thanks but i’m not interested in pushshift code but the camas code that makes the data pretty. for someone with extremely poor technical skills like me, it would be easier to use code already written than struggle with trying to understand the massive complexity of implementing a web interface like camas.

thank you very much for your helpful reply!

Comment by Yekab0f at 02/06/2023 at 19:56 UTC

0 upvotes, 1 direct replies

Pushshift API is indeed open source. The ingest engine is not