feat(forgejo): Expire repository archives faster #7

Merged
histalek merged 1 commit from expire-archives-faster into main 2025-02-02 14:25:50 +01:00
Owner

This is needed to limit the amount of data on our forgejo host.

Currently our repositories take up ~2GB of storage while the repository
archives take up ~29GB.
This most likely happens because of scraper bots [1].

With this change we now expire repo archives older than 2 hours and
clean those up every hour. This should keep the disk space used at least
somewhat limited. Config ref [2]

In addition to this we might want to deploy a proper robots.txt [3] to
at least keep good bots out.

[1] https://codeberg.org/forgejo/forgejo/issues/923
[2] https://forgejo.org/docs/latest/admin/config-cheat-sheet/#cron---cleanup-old-repository-archives-cronarchive_cleanup
[3] https://code.forgejo.org/robots.txt

This is needed to limit the amount of data on our forgejo host. Currently our repositories take up ~2GB of storage while the repository archives take up ~29GB. This most likely happens because of scraper bots [1]. With this change we now expire repo archives older than 2 hours and clean those up every hour. This should keep the disk space used at least somewhat limited. Config ref [2] In addition to this we might want to deploy a proper robots.txt [3] to at least keep good bots out. [1] https://codeberg.org/forgejo/forgejo/issues/923 [2] https://forgejo.org/docs/latest/admin/config-cheat-sheet/#cron---cleanup-old-repository-archives-cronarchive_cleanup [3] https://code.forgejo.org/robots.txt
feat(forgejo): Expire repository archives faster
All checks were successful
ci/woodpecker/pr/ansible-lint Pipeline was successful
ci/woodpecker/push/ansible-lint Pipeline was successful
d13591d215
This is needed to limit the amount of data on our forgejo host.

Currently our repositories take up ~2GB of storage while the repository
archives take up ~29GB.
This most likely happens because of scraper bots [1].

With this change we now expire repo archives older than 2 hours and
clean those up every hour. This should keep the disk space used at least
somewhat limited. Config ref [2]

In addition to this we might want to deploy a proper robots.txt [3] to
at least keep good bots out.

[1] https://codeberg.org/forgejo/forgejo/issues/923
[2] https://forgejo.org/docs/latest/admin/config-cheat-sheet/#cron---cleanup-old-repository-archives-cronarchive_cleanup
[3] https://code.forgejo.org/robots.txt
requested review from saibotk 2025-02-02 09:04:54 +01:00
saibotk approved these changes 2025-02-02 14:22:11 +01:00
saibotk left a comment
Owner

LGTM.

For the robots.txt i created another issue #8

LGTM. For the robots.txt i created another issue #8
histalek deleted branch expire-archives-faster 2025-02-02 14:25:50 +01:00
Author
Owner

deployed and LOL
image

deployed and LOL ![image](/attachments/85af18c8-b1f1-4b83-95d6-d4921120500f)
Owner

Wild West Web 😆

Wild West Web 😆
Sign in to join this conversation.
No reviewers
No labels
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: SipsOfCode/infrastructure#7
No description provided.