Ugh, apparently yesterday a bot visited my Forgejo instance and queried everything, which caused Forgejo to create repo archives for everything. Git on the instance is 2.1 GB in size, but the repo archive filled up everything and is 120 GB. I really didn’t expect such a spike.

That meant that it filled up the whole hard drive and the server and all the services and websites on it went down while I was sleeping.

Luckily it seems that just deleting that directory fixes the problem temporarily. I also disabled the possibility of downloading archived from the UI but I’m not sure if this will prevent bots from generating those archives again. I also can’t just make the directory read only because it uses it for other things like mirroring, etc too.

For small instances like mine those archives are quite a headache.

  • harsh3466@lemmy.ml
    link
    fedilink
    English
    arrow-up
    31
    arrow-down
    1
    ·
    2 months ago

    Not saying this is an option for you, only that I kept my forgejo instance private to avoid dealing with this AI crawler bullshit. I hope you find a good solution.

    • Jeena@piefed.jeena.netOP
      link
      fedilink
      English
      arrow-up
      16
      ·
      2 months ago

      Yeah I understand, but the whole point of me hosting my instance was to make my code public.

      • omegabyte@piefed.zip
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 months ago

        I appreciate that you make your stuff public. I can’t find the specific repos right now but I know I’ve referenced your code for various fediverse things I’ve dabbled in over the last year or so.

        • Jeena@piefed.jeena.netOP
          link
          fedilink
          English
          arrow-up
          18
          ·
          2 months ago

          Codeberg is a instance of forgejo, I run my own instance because I don’t want to be dependent on others.

      • harsh3466@lemmy.ml
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        1 month ago

        I don’t know the specifics but forgejo is a gitea fork. There was/is some controversy around gitea governance and movent towards prioritizing a closed source paid/private versions of gitea.

        Again, I don’t know details, just very broad strokes. I chose forgejo because it’s under active Foss development and I didnt want to deal with potentially going with gitea and then having to abandon it later for whatever reason might develop.