Artemis v2.3 Maple is now deployed! (with a replacement for Pushshift post aggregations)

https://www.reddit.com/r/AssistantBOT/comments/o7xlt7/artemis_v23_maple_is_now_deployed_with_a/

created by kungming2 on 25/06/2021 at 22:00 UTC

10 upvotes, 3 top-level comments (showing 3)

This version has few obvious changes from the outside, but there's a replacement for Pushshift's aggregations - for posts, to allow Artemis to resume providing certain statistics about the posts and submitters of a subreddit.

More than six months ago, the Pushshift database disabled aggregations[1] for everyone, which meant that some of the activity data that Artemis provided could not be provided. This data was:

1: https://www.reddit.com/r/pushshift/comments/jm8yyt/aggregations_have_been_temporarily_disabled_to/

At first, I (and other bot writers) figured this would be a temporary issue as the announcement post stated that "this won't be a permanent change," and that once aggregations went back up, everything would go back to normal. But of course, it's been a while now and aggregations show no sign of being re-enabled[2], and the "replacements" that have popped up don't seem reliable. Therefore, I decided to create my own local solution to replace Pushshift for* submissions* aggregations queries - Artemis Stream - something I outlined a few months ago as a solution[3].

2: https://www.reddit.com/r/pushshift/search?q=aggs+OR+aggregations&restrict_sr=on&sort=relevance&t=all

3: https://www.reddit.com/r/AssistantBOT/comments/kg3ndg/artemis_future_version_roadmap/

Artemis Stream

Artemis Stream is a separate script[4] that is run regularly and ingests/fetches all new posts from *only* the subreddits moderated by Artemis instances. A very small subset of those posts' *data* is cached in a local database that the statistics routine can query - in fact, Stream mimics Pushshift's reponse in providing that data back to the statistics function. Stream was actually deployed at the end of May and has been fetching posts for a few weeks now, and I upgraded Artemis's setup with an NVMe SSD to allow for faster reads/writes with Stream.

4: https://github.com/kungming2/AssistantBOT/blob/master/artemis_stream.py

Stream allows me to restore half of the data previously provided by Pushshift aggregations:

Artemis v2.3 Maple was deployed a few days ago, so you should have already seen this data re-appear in your June statistics! Stream also allows me to provide three new and simple, but useful information points for a subreddit's monthly statistics with v2.3:

There are other data points that Stream caches and I may write more tools to surface them in the future.

Notes

Comments

Comment by YoloMice at 23/10/2021 at 20:00 UTC

1 upvotes, 0 direct replies

AssistantBot is the top mod on my subreddit and is behaving badly. Can you remove it?

Comment by SolariaHues at 30/06/2022 at 00:41 UTC

1 upvotes, 1 direct replies

Hi

AssistantBot hasn't updated the stats in r/gardenwild recently (not since the 5th of May) - is there something wrong, do I need to do anything?

Also, I might be premature reporting this, but we've just added AssistantBot1 to r/NewToReddit, and it's passed the time it said it would populate the wiki page, and nothing has happened. Any help very much appreciated!

Comment by leneay at 16/08/2022 at 17:39 UTC

1 upvotes, 0 direct replies

Hi, sorry to bother. I'm not sure if I did something wrong, but AssistantBOT hasn't updated the user flair statistics since July 1. Could you tell me how I might fix that? Thank you.