This is an unpopular opinion, and I get why – people crave a scapegoat. CrowdStrike undeniably pushed a faulty update demanding a low-level fix (booting into recovery). However, this incident lays bare the fragility of corporate IT, particularly for companies entrusted with vast amounts of sensitive personal information.

Robust disaster recovery plans, including automated processes to remotely reboot and remediate thousands of machines, aren’t revolutionary. They’re basic hygiene, especially when considering the potential consequences of a breach. Yet, this incident highlights a systemic failure across many organizations. While CrowdStrike erred, the real culprit is a culture of shortcuts and misplaced priorities within corporate IT.

Too often, companies throw millions at vendor contracts, lured by flashy promises and neglecting the due diligence necessary to ensure those solutions truly fit their needs. This is exacerbated by a corporate culture where CEOs, vice presidents, and managers are often more easily swayed by vendor kickbacks, gifts, and lavish trips than by investing in innovative ideas with measurable outcomes.

This misguided approach not only results in bloated IT budgets but also leaves companies vulnerable to precisely the kind of disruptions caused by the CrowdStrike incident. When decision-makers prioritize personal gain over the long-term health and security of their IT infrastructure, it’s ultimately the customers and their data that suffer.

  • yeehaw@lemmy.ca
    link
    fedilink
    English
    arrow-up
    5
    ·
    5 months ago

    This is a good solution for these types of scenarios. Doesn’t fit all though. Where I work, 85% of staff work from home. We largely use SaaS. I’m struggling to think of a good method here other than walking them through reinstalling windows on all their machines.

    • LrdThndr@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      1
      ·
      5 months ago

      That’s still 15% less work though. If I had to manually fix 1000 computers, clicking a few buttons to automatically fix 150 of them sounds like a sweet-ass deal to me even if it’s not universal.

      You could also always commandeer a conference room or three and throw a switch on the table. “Bring in your laptop and go to conference room 3. Plug in using any available cable on the table and reboot your computer. Should be ready in an hour or so. There’s donuts and coffee in conference room 4.” Could knock out another few dozen.

      Won’t help for people across the country, but if they’re nearish, it’s not too bad.

      • yeehaw@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        5 months ago

        Not a lot of nearish. It would be pretty bad if this happened here.

    • John Richard@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      6
      ·
      edit-2
      5 months ago
      1. Configure PXE to reboot into recovery image, push out command to remove bad file. Reboot. Done. Workstation laptops usually have remote management already.

      or

      1. Have recovery image already installed. Have user reboot & push key to boot into recovery. Push out fix. Done.
      • yeehaw@lemmy.ca
        link
        fedilink
        English
        arrow-up
        3
        ·
        5 months ago

        I had no idea you could remotely configure pxe to reboot into a recovery image and run a script. How do you do this?

      • LrdThndr@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        edit-2
        5 months ago

        Fuck yeah. Even better than reimage. That’s creative as fuck and I love it.