Lemmy World outages

lwadmin@lemmy.world · edit-2 1 year ago

Lemmy World outages

KrisND@lemmy.world · edit-2 1 year ago

It sucks but there will always be some labor intensive queries to execute. Although, it can be limited and restricted which I’m sure they are already on top of it. Such as caching and security control put in place to make limits like “this type of request from this IP can only happen 1x per hour” or something along those lines.

If I had to guess, without looking into the source code yet and limited information provided I’d assuming it’s mass account creation, image uploading and/or exploiting how the instant syncs with the fediverse. It’s most certainly something that can be mostly prevented once the holes are made and then patched.

Also, I’m sure in the future something more efficient than SQL will be used.

solrize@lemmy.world · edit-2 1 year ago

I have to wonder what those queries actually do. Why is mass account creation a thing? Image uploading shouldn’t cause significant db activity (add a row saying where the image is, don’t put the image into a BLOB or anything like that). Syncing is no big deal either, given the quite low amount of traffic. I know that some websites use Postgres for fulltext search and I don’t know how well that works under heavy loads. I’ve mostly used Solr (solr.apache.org, thus my username) but I think that is now considered old fashioned.

PostgreSQL itself is quite performant and should be able to handle high loads once the queries and schemas are optimized, there is some caching of obvious things, etc. One antipattern I’ve noticed is pagination: saying “page=5” like Lemmy does to get to the 5th page of /all is done with an OFFSET clause which is expensive because it has to count off that many rows. It is better to use timestamps or other markers like Reddit does, that can be an indexed column that can be accessed quickly.

Anyway thanks.