• 1 Post
  • 140 Comments
Joined 3 years ago
cake
Cake day: June 14th, 2023

help-circle
  • If you have an option in the BIOS or drivers to set the battery charging threshhold to stop charging at 80% or possibly even lower (I believe Lithium Ion/Polymer cells are happiest at around 40-60% state of charge, which is why they are usually shipped that way). Sometimes this feature exists, but only vaguely described as “battery life optimization” or some similar wankery that they never properly explain.

    You can also just remove the battery entirely if it will be always plugged in, but setting it to a low/modest maximum charge gives you a nice, reliable, built-in UPS in case of power interruptions.


  • This is FUD. Dell workstation class laptops are absolutely designed for running at full power with the lid closed (and for use with a docking station, in fact). That’s why it’s called a “workstation”. Business class laptops are a totally different beast than consumer-grade/“gaming” laptops. They are typically quite durable, quite tolerant, and quite repairable. Exceptions, obviously, exist, and we don’t know the exact model OP is talking about, but there’s no reason to assume they will have any issues running it as intended.

    I run several old consumer grade laptops as light servers, permanently closed, and they’ve been perfectly fine for years.


  • i18n is definitely a real thing and it’s often even used as an abbreviation in repos, folder structures and filenames related to internationalization/translation, I don’t love it, but it is definitely in real world use. l10n is one I’ve seen before and is also a real thing, but much less common (and even less acceptable in my opinion). a11y is not one I’ve ever seen before.


  • If this is about the bot I think it is, I haven’t personally complained but I have noticed it’s weird and often wrong, it seems to detect HTTP/HTTPS in every post (perhaps seeing links and URLs?) and it seems to maybe possibly be detecting any words with those strings of letters somewhere in them and presumably also doesn’t care about case sensitivity? The short ones in particular like “AP”, “CA”, “CF”, “HA” and “IP” seem to come up frequently almost every time it posts, and “NAT” and “IoT” seems common, and none of things seem to be actually mentioned in any of the comments that I see.


  • I am questionning using it only for myself and use something else to share with family

    I’m confused, you don’t want your family to be able to know each other’s email addresses? That seems like a bit… extreme level of profile paranoia.

    I have my family on my Nextcloud and I can guarantee 90% of them don’t even know where their own user profile is nevermind anybody else’s. Maybe you have a different family than mine.


  • Way too much. The Nvidia P40 I scavenged for my homemade AI server runs at 120W (throttled down from 250W default) on its own. Then I’ve got two more PCs running purely as redundant firewalls with automatic failover, pretty unnecessary but if that’s not the sort of thing homelabbing is for then I’m going to keep doing it wrong because I find it fun. Then there’s the minecraft server, which is pretty beefy and also eternally running at max CPU because my niece is a monster who loves spamming spawn eggs and should never be allowed access to creative mode. And I don’t even have the two rack units of disk arrays I bought at auction powered up yet because they need 240V which I don’t have handy. I guess someone could do the math on what 48 enterprise SAS drives will pull if they need to satisfy their curiosity, I’m not sure I want to. I will hook them up someday but for now ignorance is bliss. All I know is it’s a lot, and there’s stuff I’m not even including in this.



  • Short answer: You can’t. Longer answer: Accept it if/when it happens, but don’t make yourself an attractive target, and don’t put yourself in a position where it’s going to cost you money if it does. The hype of LLM scrapers is largely overblown for small personal static pages. LLM training wants fresh, data-heavy content. If they are scraping your smolweb site you’re either updating it with and hosting rich content far too frequently, or it’s an error on their part out of pure ignorance and laziness. That doesn’t mean it can’t happen, but also, what actual harm does it do to have a dozen scrapers hitting your site every second? (this is an exaggeration it’s likely not going to be that bad) How big is your smolweb page and images? A few dozen kilobytes? What’s your bandwidth limit, and what happens when you hit the cap? If you’re worried about hitting the cap too quickly, this can be straightforwardly managed by per-IP rate limiting and throttling if necessary to keep things under a cap and allow fair access to gentler users. But when you’re only hosting small files, most connections have plenty of bandwidth to handle scraping until they realize how pointless it is and give up, and it probably won’t be necessary.

    I run about 20 small websites, all public and searchable, with no protections at all. Most of them are rarely updated and have been static for years, I just checked my traffic logs for the last day: ~14,000 hits. That may sound like a lot, but for a request that takes milliseconds to deliver, a computer sitting around not doing anything for the many seconds in between each of those requests is probably bored. Many different scrapers are obviously buried in that traffic, but they’re not the overwhelming horror that people make them out to be, at least in my experience.

    Anubis potentially makes sense on social media sites like Lemmy that are hosting large numbers of users and user-generated content. This stuff is like manna from heaven for LLM bots. Same with code repositories like forgejo. They are very attractive targets for scrapers, with lots of frequent updates that require frequent scraping and also lots of very large files for it to download and ingest. This will absolutely hammer your bandwidth if the scrapers find you an attractive target and they are stupid (which they are).

    But smolweb? Honestly, I hate to break it to you but nobody cares that much, not even LLMs.



  • I believe you, and I feel for you. The saddest part about AI is how it has tainted all high-effort, carefully organized work to the point that it makes it hard to distinguish between the most trustworthy content and the least trustworthy. We need better tools for information provenance. Like I said, the first thing I did was look into your backgrounds to try to understand “is this some AI slop bot or a real person with a real brain” and everything I looked at suggested it being legit and that’s why I decided to give you the benefit of the doubt. But that doubt is everywhere nowadays. It’s rough out there.



  • That’s great for production AWS managed services, but that still sounds like the opposite of self-hosting to me, I don’t need scaling like that, I’m not lying when I admit I’m using sshfs (which was a slightly tongue-in-cheek counterpoint to s3) and despite everyone dunking on it, it is in fact working perfectly at my scale. I know I’ve been downvoted to purgatory but I still stand by my original comment. I don’t understand why you would need S3 or S3 compatibility in a self-hosting context. The closest someone has come to explaining it is the guy who said choice is good… like, yeah, it’s good to have the choice I guess, but… still doesn’t seem like a great choice for self-hosting. I appreciate you trying to explain it but I feel like everyone is missing the self-hosting context here. For a little home lab I simply don’t see the value. Why are people promoting AWS and AWS-adjacent services here?





  • cecilkorik@lemmy.ca
    cake
    toSelfhosted@lemmy.worldKitchenOwl Gone?
    link
    fedilink
    English
    arrow-up
    17
    arrow-down
    9
    ·
    2 months ago

    He’s still right that it’s weird people like you are going to bat to defend them. Microsoft sucks. It must get tiring if you have to call out every inaccurate thing everyone says to try to tear them down. The important take-home message is that we need to tear them down, they suck.

    You don’t see people bothering to defend Epstein, for example. Even though there’s lots of inaccurate stuff going around, there’s enough accurate stuff to be absolutely confident he was an absolute loathsome piece of shit not worth defending. Not worth the effort to defend. Why bother?

    What do you see in Microsoft that you think is worth defending? Github is shit, and it’s evil. Let it go.


  • Private trackers are like the Matrix’s “zion”. When civilization collapses into a dystopian surveillance capitalism hellscape and the AIs and fascist governments take over the net, the last free humans will be hiding in private tracker communities, sharing freely and building a resistance. Will we have mechs with gatling guns? I don’t know, all I can say is I hope so because it looks like we’re going to need it.