Instance Meta - Is this instance getting flooded with spam bot accounts?

maegul@startrek.website · 2 years ago

Instance Meta - Is this instance getting flooded with spam bot accounts?

SysAdmin@startrek.website · 2 years ago

Just a quick update for everyone, yes OP is right and a bunch of bots signed up. We’ve purged them from our user count and enabled CAPTCHA. Email verification is coming soon as a secondary deterrent.

For the record nobody told us that it’s not safe out here. We were aware that self-hosting was wondrous, with treasures to satiate desires both subtle and gross; but has NO IDEA that it wasn’t for the timid. 😉

maegul@startrek.website · edit-2 2 years ago

Ooh … how did you purge them from your user numbers? Many other admins might not know how to do that … maybe worth sharing?

williams_482@startrek.website · 2 years ago

We deleted them from the local_user database table outright based on some sketchy shared attributes, and then manually updated the user count in site_aggregates to the correct figure so our stats wouldn’t look so sketchy.

Pretty simple for anyone comfortable in SQL who knows where to look (a helpful user DM’d and gave us a hand here), but not something anybody should try willy nilly if they don’t know what they are doing. Editing production data on the fly is not to be done casually.

maegul@startrek.website · 2 years ago

There are pricy probably admins who might appreciate this, as dangerous as it is.

Care if I post it into the lemmy community or even made the support community?

williams_482@startrek.website · 2 years ago

Sure, go for it.

Include a mention that even running queries against the database won’t necessarily be easy if you don’t know what you’re doing. where it is located and (separately) how best to access it will depend on how it was installed.

StillPaisleyCat@startrek.website · 2 years ago

Just a member here, but given the recent rapid growth in the past 10 days as folks migrate over from a 600k subreddit (myself included), and the normal 90% lurker rule-of-thumb, this is actually a fairly reasonable monthly active user ratio.

I genuinely understand the concern to avoid bots and trolls, but have admins in other instances actually documented a significant number of bots originating through this instance?

Value Subtracted@startrek.website · 2 years ago

There is some Verified Suspicious Activity^TM. We’re hoping to get it sorted out soon.

StillPaisleyCat@startrek.website · 2 years ago

Thank you for your care of this place, and due diligence.

With the instance growing so quickly, aggregate stats can flag but are vulnerable to aliasing.

I very much appreciate the path to lemmy membership that you offered to us in the final hours before many of the subreddits went dark.

The there is a ratcheting chain of new members to this and other Lemmy instances is still happening. I have the sense that there’s still a lot of private chat on Reddit as folks looking for a new place to be are enquiring of those who’ve already migrated.

All to say, when it is sorted, it would be great to have an updated sticky with the details so those of us here can support the less tech adept among us make the transition.

maegul@startrek.website · 2 years ago

You have a point, especially as lemmy defines “active” as a user that has at least posted once within the relevant time period. So yes, lurkers definitely wouldn’t count toward the active user count (mastodon and the like use different metrics AFAIU).

StillPaisleyCat@startrek.website · 2 years ago

‘Posted’ is even a higher bar than ‘commented’, a 10th of a 10th.

Not to say that there isn’t a need to filter out sockpuppets and bots at signup, but rather that the inference being made from the stats was not obvious.

maegul@startrek.website · 2 years ago

Yea I’m unclear on whether commenting counts as being active. I would guess that it does.

stux⚡ @geddit.social · 2 years ago

Yes, this is a big issue currently going on…

Enabeling captcha should fix the issue. Email alone not since they will abuse the email service

th3raid0r@tucson.social · 2 years ago

Admin of tucson.social here - I haven’t noticed an attack on my instance yet but I do have Captcha AND Email validation turned on.

Since my instance is for Arizonan’s only, I could do a geo-ip block if pressed, but obviously that won’t work for places like startrek.website.

If any admin needs assistance, I recommend enlisting some help over at programming.dev - likely the best instance for collaborating on our lemmy servers.

Faceman🇦🇺@discuss.tchncs.de · 2 years ago

geoblocking is also a bit of a blunt instrument, many people either use network wide VPNs or even sometimes the ISPs IP blocks are mislocated (my work ISP has my IP in a different state)

th3raid0r@tucson.social · 2 years ago

For sure! If I were running a more general and globally focused instance that would be a larger concern. I understand using a VPN in North America, but not so much from other countries. I guess my vision is that it’s only really locals accessing the site for the most part. If someone is travelling out of the country, they can equivalently use a U.S. based VPN server.

I suppose my example of Arizona came across as the proposed bounds of my geoblock. I’d probably just say “North America” to avoid the issues of remote workers using a company VPN to access the site (please don’t though, your company probably doesn’t like that - the current version of lemmy is VERY bandwidth inefficient )

Also, consider that I can use one Geoblock for my signup page, and different, more permissive one for the login page which should make things a bit more reasonable.

Freeman@lemmy.pub · 2 years ago

I just closed my registration, was onboarding it and syncing up communities in prep for a 7/1 rush. Haven’t seen any attempts yet. But will probably just work out a kbin instance and move on. Too much drama with the lemmy devs.

th3raid0r@tucson.social · edit-2 2 years ago

Agreed, and my one call to action post to get other Admins to give a crap fell on it’s face over on beehaw. It seems that many admins really think that every instance should use manual registration, or other tools. All in all, the message I got was “The devs don’t have to listen to anyone”.

I’m now of the opinion that most lemmy admins aren’t people I want to associate with, they seem to be all about “open source” until it collides with concepts like “collective responsibility” and you’ll get a response in the individualist line of reasoning of “Oh, just fix it yourself”.

Kbin is sure lookin’ pretty good these days now.

KNova@links.dartboard.social · 2 years ago

I think the tone of your other post and the call to essentially brigade GitHub and demand changes from the devs put a lot of other instances admins in an uneasy position. You also said that instance admins were “abdicating their responsibility” to demand things of the devs.

Isn’t jumping ship to kbin abdicating your responsibility to stay on and help grow Lemmy…?

To be fair I have no ill will with you, but that post stunk of open source entitlement. https://tommcfarlin.com/open-source-entitlements-users/

th3raid0r@tucson.social · 2 years ago

I find it odd that a lot of people communicating that there is a problem and that they are impacted equates to brigading. To me, how can they know how wide reaching a change was if there is only a couple of comments from more savvy admins? There’s a difference between knowing a decision created a problem, and knowing a decision created a BIG far-reaching problem. From my perspective, sometimes we forget to think about the bigger implications of changes and that’s where community pressure and action can come in.

Yeah, I could probably work on my tone a bit. The internet echo chamber has trended my communication style to be more bold, but the fediverse does appear to be ever so slightly different.

Also, haven’t moved to Kbin yet - it’s just tempting - and yes it would be abdicating my responsibility to grow Lemmy and contribute to that project, thus I haven’t done it yet.

As for open-source entitlement - here’s what irks me - Admins are users, indeed, but we’re not the type of user that blog is admonishing, and I’m not asking other admins to become that. I plan to contribute code to the lemmy code base, make some bots, and generally enrich the ecosystem. So when the ecosystem itself is threatened (by spam), I can’t help but sense that this is something bigger than “open-source entitlement”.

Freeman@lemmy.pub · edit-2 2 years ago

Yeah agree. Like I get their captcha is bad. But why rip out a piece of the puzzle without a solution? Doesn’t seem to be conflicts just “I guess it’s time”. It’s a weird hill to die on. Just defer the removal until a pr for a better alternative. Security is an onion, no one thing is gonna stop spammer and bots.

xuu@social.sour.is · 2 years ago

Yep. Been seeing this on mine and a few other instances. They are random word + numbers. They are using bogus emails which cause bounces. One instance that was hit got their mail service locked out because of bounced emails.

I enabled captcha and manually deleted the users that had signed up… Its very hard to list users in local instances.

EuphoricPenguin@normalcity.life · edit-2 2 years ago

I had this issue as well on my instance. Here’s how I fixed it with SQL commands included. TL;DR Turn on CAPTCHAs, don’t use email verification (as they will spam the shit out of it), and use SQL commands to ban all of them in one fell swoop.

death916@lemmy.death916.xyz · 2 years ago

Yo thanks for this. I’m not rlly good with dbs and was trying to figure out how to mass get ride of some boy accounts. Luckily I noticed B4 it got super bad

EuphoricPenguin@normalcity.life · 2 years ago

Yeah, I heavily modified and expanded upon someone else’s query to seek out and destroy more of the accounts. Theirs is basically pattern-matching some of the Gmail-with-numbers spam, but there’s a subset using junk@junk with no actual .TLD to try and get people’s email verification to bounce. Someone else said that ended up in people getting their email relay account suspended, hence why email verification (at least without CAPTCHAs) is a fairly bad idea. I added a table join and some extra matching to find some of those extra bogus “emails,” which typically results in quite a few more accounts being banned. There are two major caveats with my method: 1) it doesn’t delete the accounts, which is really just a simple modification to the query to “fix,” and 2) it doesn’t deal with spam accounts that have no email attached, although those seem to be a fairly small subset of the account spam. I’ll see if there is an easier way to deal with those, but getting most banned or deleted is still pretty easy.

Lockely@pawb.social · edit-2 2 years ago

@SysAdmin should definitely enable captcha at the very least.

Edit: Tagging @[email protected] to maybe get visibility here?

Value Subtracted@startrek.website · 2 years ago

We are on the case.

Briongloid@aussie.zone · 2 years ago

rabbitea.rs appears to be entirely made for spam accounts and I have suggested to my instance that we ban it only because it has no genuine activity that I can see.

russjr08@outpost.zeuslink.net · 2 years ago

They might’ve cleaned up their instance, from what I can see when I bring it up now according to the stats there is only one user there.

Date	Episode	Title
08-28	SNW 3x08	“Four-and-a-Half Vulcans”
09-04	SNW 3x09	“Terrarium”
09-11	SNW 3x10	“New Life and New Civilizations”
01-15	SFA 1x01	TBA
01-15	SFA 1x02	TBA