Does lemmy have any communities dedicated to archiving/hoarding data?

    • clif@lemmy.world
      link
      fedilink
      arrow-up
      45
      ·
      edit-2
      8 days ago

      Last time I updated it was closer to 120GB but if you’re not sweating 100 GB then an extra 20 isn’t going to bother anyone these days.

      Also, thanks for reminding me that I need to check my dates and update.

      EDIT: you can also easily configure a SBC like a Raspberry Pi (or any of the clones) that will boot, set the Wi-Fi to access point mode, and serve kiwix as a website that anyone (on the local AP wifi network) can connect to and query… And it’ll run off a USB battery pack. I have one kicking around the house somewhere

      • techwithjake@sh.itjust.works
        link
        fedilink
        arrow-up
        14
        ·
        8 days ago

        Just built one of those using Dietpi as the OS and NVME M.2 for the storage. I have many different ZIMs and running different services and only using about 270GB.

        Works great for offline use. Probably should add an ISO or 2 as well.

        • clif@lemmy.world
          link
          fedilink
          arrow-up
          3
          ·
          edit-2
          8 days ago

          What other services are you running?

          @fmstrat@lemmy.world asked what else I was running in a sibling comment to yours and I didn’t have an answer because I’m not… yet : )

          • techwithjake@sh.itjust.works
            link
            fedilink
            arrow-up
            9
            ·
            8 days ago

            DietPi makes it dead simple to run most of these things as their “software suite” is pretty robust and simple to setup.

            For “user facing” applications:

            • Homer Dashboard as the landing page when going to the .local address in a browser
            • Kiwix for the ZIMs
            • Hedgedoc for personal note taking/wiki
            • Lychee photos for a very lightweight photo album maker/viewer for keepsake photos.

            For “admin side” stuff:

            • Portainer to manage the containers/stacks
            • Watchtower to auto-update the containers while they’re still network connected
            • Transmission daemonized to download and seed the ZIMs or anything else non-pirate related
            • Use jojo2357’s ZIM updater to auto-update ZIMs via cron job while they’re still network connected
            • DietPi-Dashboard as an all-in-one dashboard to monitor and control the RPi from a web interface. (Yeah I know I can do everything SSH’ing in but I’m lazy.)
            • File Browser just in case I want other people to have access to files but since it’s in maintenance mode and I’m unsure I want others to have access, might strip it out

            I try to use containers from LinuxServer.io whenever possible. Mostly just cause it’s what I do on my main server.

            I’m still looking at adding/removing things as I get more time to sit down but I’m pretty happy with it’s current state.

      • Fmstrat@lemmy.world
        link
        fedilink
        arrow-up
        4
        ·
        edit-2
        8 days ago

        Do you recommend adding anything else to it?

        For instance, OSM maps?

        I’ve been thinking about running the Kiwix app + OSMAnd on an old Android phone and auto updating it once a year.

        • clif@lemmy.world
          link
          fedilink
          arrow-up
          2
          ·
          8 days ago

          That’s a good question (and good idea) that I hadn’t really thought about past a collection of ZIMs. The one I built advertises it’s own AP SSID that anyone can connect to and then access the ZIMs that are served via kiwix-serve on HTTP/80. That is, I wanted a single, low power, headless device that multiple people could use simultaneously via wifi and browser rather than a personal device.

          I hadn’t really thought about other helpful services past that. I mean, we’ve got a (wee) server so why not use it? I like the idea of OSM and their website is open source but has a lot of dependencies :

          openstreetmap-website is a Ruby on Rails application that uses PostgreSQL as its database, and has a large number of dependencies for installation

          A fully-functional openstreetmap-website installation depends on other services, including map tile servers and geocoding services, that are provided by other software. The default installation uses publicly-available services to help with development and testing.

          I wonder how hard it would be to host everything it needs locally/offline… and what that would do to power consumption : )

          Thanks for the idea - something to look into, for sure.

            • clif@lemmy.world
              link
              fedilink
              arrow-up
              2
              ·
              7 days ago

              Yeah, I feel the same in that it’s assuredly doable, but how hard is it?

              If you’re able to dig into and make some progress, please tag me because I’m interested but don’t have much time these days.

          • Fmstrat@lemmy.world
            link
            fedilink
            arrow-up
            1
            ·
            8 days ago

            I might beat you to it. I’ve got Kiwix running in docker, just did a PR to the kiwix-zim-updater so it can run in Docker on a cron schedule next to the server, and have spun those up with Karakeep (self-hosted web archive I use for bookmarking).

            Right now I’m adding a ZIM list feature to the updater to list available ZIMs by language, and then I’ll move on to OSM.

    • mistermodal@lemmy.ml
      link
      fedilink
      arrow-up
      29
      ·
      8 days ago

      Yeah also if you make a Zim wiki or convert a website into Zim then you can run that stuff too. If you use Emacs it’s easy to convert some pages to wikitext for Zim too

    • Fmstrat@lemmy.world
      link
      fedilink
      arrow-up
      3
      ·
      edit-2
      8 days ago

      120GB not including Wikimedia 😉

      Also, I wish they included OSM maps, not just the wiki.

    • Random Dent@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 days ago

      You can also offline the whole of Project Gutenberg with Kiwix, it’s about 70GB IIRC.

    • Gigasser@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      8 days ago

      I wonder if there’s anyways to edit these files afterwards? They tend to be read only, right? I must confess, I don’t have too much experience with this myself.

      • Prathas@lemmy.zip
        link
        fedilink
        arrow-up
        4
        ·
        8 days ago

        It’s probably hundreds of thousands of HTML files, no? What is the fear about being able to edit or not?

          • Prathas@lemmy.zip
            link
            fedilink
            arrow-up
            1
            ·
            7 days ago

            Okay, I’m unfamiliar with both. Well, I still don’t understand why read-only state matters; are you concerned about tampering?

            • Gigasser@lemmy.world
              link
              fedilink
              arrow-up
              1
              ·
              6 days ago

              Well I think it would be cool to be able to fix/edit any inaccurate articles, or pages that may have been messed with by trolls, or to update with more up to date info.

              • Prathas@lemmy.zip
                link
                fedilink
                arrow-up
                1
                ·
                5 days ago

                Oh, well, yeah, you can do that with Wikipedia as it is. I would be surprised if you couldn’t edit a local file.

  • Dr. Bluefall@toast.ooo
    link
    fedilink
    English
    arrow-up
    109
    ·
    9 days ago

    The English Language Wikipedia probably wouldn’t be hard, or Debian Stable.

    All of Debian’s packages might be a tad more expensive, though.

  • pyrflie@lemmy.dbzer0.com
    link
    fedilink
    arrow-up
    83
    ·
    edit-2
    8 days ago

    Welcome to datahoarders.

    We’ve been here for decades.

    Also follow 3-2-1 people. 3 Backups, 2 storage mediums, 1 offsite.

  • Pumpkin Escobar@lemmy.world
    link
    fedilink
    English
    arrow-up
    53
    ·
    8 days ago

    I stumbled across this sort of fascinating area of doomsday prepping a few weeks back.

    https://prepperpress.com/usb/

    A nice addition to that, don’t just make it a USB, but a raspberry pi. So you’d have a reasonably low-powered computer you could easily take with you.

    Not suggesting this one as it seems a bit expensive to me, but https://www.prepperdisk.com/products/prepper-disk-premium-over-512gb-of-survival-content?view=sl-8978CA41

    • techwithjake@sh.itjust.works
      link
      fedilink
      arrow-up
      21
      ·
      edit-2
      8 days ago

      Just built one of these myself. I went NVME M.2 instead of SD Card to avoid data corruption. I know SD Cards are fine if you don’t write to them a lot but if you wanna update or add your own stuff, scares me. Plus NVME is just so much faster.

        • Bob Robertson IX @discuss.tchncs.de
          link
          fedilink
          English
          arrow-up
          18
          ·
          8 days ago

          You find a generator, or solar panels, or wind mill, or water turbine, or a bicycle hooked up to a generator.

          If electricity permanently goes out then we’re in a scavenger situation and it is time to start taking apart things that are no longer necessary to build the things that are.

          • Trainguyrom@reddthat.com
            link
            fedilink
            English
            arrow-up
            2
            ·
            edit-2
            7 days ago

            start taking apart things that are no longer necessary to build the things that are

            Hey finally a good use for all of those cars, grab the alternators out for small generators (since bicycles are the ultimate apocalypse vehicle: simple, small, easy to maintain and don’t require complex fuels)

        • Sinthesis@lemmy.today
          link
          fedilink
          English
          arrow-up
          12
          ·
          8 days ago

          You only need 20 watts of power. One of those dinky fold up solar panels would work. Add a USB power brick for cloudy days.

                • notabot@piefed.social
                  link
                  fedilink
                  English
                  arrow-up
                  3
                  ·
                  8 days ago

                  Eink displays are pretty awesome for this sort of thing, I repuposed a kobo ereader as a household info display and it worked nicely. Those PaPiRus screens look easier to interface with, but a little small for reading wikipedia articles. They’d do in a pinch, but the eyestrain would have me looking for a bigger solution.

        • techwithjake@sh.itjust.works
          link
          fedilink
          arrow-up
          11
          ·
          8 days ago

          Pretty much what Sinthesis said; USB power brick and/or solar panels. Both at the ready and tested. Also got a big ass battery backup that will charge off solar panels.

      • boonhet@sopuli.xyz
        link
        fedilink
        arrow-up
        3
        ·
        8 days ago

        You’d first have to buy a phone that can run postmarketos and these are much rarer than I wish they were. Is there even anything new that can run it? Pine64 stopped making phones and said they’ll make a new one when they can make it RISC-V.

        Fairphone maybe I guess. 4 is listed as a supported device, but someone has gotten it working on 6 too.

        • Jumuta@sh.itjust.works
          link
          fedilink
          arrow-up
          1
          ·
          7 days ago

          there’s lots of devices it runs on iirc, something like the pixel 3a can be had for less than a new rpi3b+ where I live

          • boonhet@sopuli.xyz
            link
            fedilink
            arrow-up
            1
            ·
            7 days ago

            But hardware ages and dies. Will you trust a pixel 3a for the next 10 years? I’d rather have a new device for this.

      • techwithjake@sh.itjust.works
        link
        fedilink
        arrow-up
        2
        ·
        8 days ago

        Cause if ya wanna go overboard like I did, 1TB of NVME storage, can add with SD Card if necessary. 16GB RAM. Very little learning curve for my part as I use SBCs often. Plus almost every Docker container and program I want works on RPi without any hassle.

        There’s also more robust guides and community for RPi.

        Just my thoughts.

  • AnimalsDream@slrpnk.netOP
    link
    fedilink
    English
    arrow-up
    51
    arrow-down
    5
    ·
    8 days ago

    Curious about the mindset of the one (so far) person who has downvoted this post. What is there to dislike about archiving Linux and Wikipedia? 🤔

  • Gerowen@lemmy.world
    link
    fedilink
    English
    arrow-up
    41
    ·
    edit-2
    8 days ago

    Neither are that bad honestly. I have jigdo scripts I run with every point release of Debian and have a copy of English Wikipedia on a Kiwix mirror I also host. Wikipedia is a tad over 100 GB. The source, arm64 and amd64 complete repos (DVD images) for Debian Trixie, including the network installer and a couple live boot images, are 353 GB.

    Kiwix has copies of a LOT of stuff, including Wikipedia on their website. You can view their zim files with a desktop application or host your own web version. Their website is: https://kiwix.org/

    If you want (or if Wikipedia is censored for you) you can also look at my mirror to see what a web hosted version looks like: https://kiwix.marcusadams.me/

    Note: I use Anubis to help block scrapers. You should have no issues as a human other than you may see a little anime girl for a second on first load, but every once and a while Brave has a disagreement with her and a page won’t load correctly. I’ve only seen it in Brave, and only rarely, but I’ve seen it once or twice so thought I’d mention it.

    • trashboat@midwest.social
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      8 days ago

      I rarely get bounced by Anubis, but oddly enough it has happened to me a couple times in FF, I suspect it’s the fingerprinting resistance settings that cause this to happen? Hasn’t happened in a while though

  • GreenKnight23@lemmy.world
    link
    fedilink
    arrow-up
    41
    arrow-down
    1
    ·
    8 days ago

    I have been archiving Linux builds for the last 20 years so I could effectively install Linux on almost any hardware since 1998-ish.

    I have been archiving docker images to my locally hosted gitlab server for the past 3-5 years (not sure when I started tbh). I’ve got around 100gb of images ranging from core images like OS to full app images like Plex, ffmpeg, etc.

    I also have been archiving foss projects into my gitlab and have been using pipelines to ensure they remain up-to-date.

    the only thing I lack are packages from package managers like pip, bundler, npm, yum/dnf, apt. there’s just so much to cache it’s nigh impossible to get everything archived.

    I have even set up my own local CDN for JS imports on HTML. I use rewrite rules in nginx to redirect them to my local sources.

    my goal is to be as self-sustaining on local hosting as possible.

    • Foster Hangdaan@lemmy.hangdaan.com
      link
      fedilink
      arrow-up
      5
      ·
      edit-2
      7 days ago

      Everyone should have this mindset regarding their data. I always say to my friends and family, “If you like it, download it.”. The internet is always changing and that piece of media that you like can be moved, deleted, or blocked at any time.

  • Retro_unlimited@lemmy.world
    link
    fedilink
    arrow-up
    33
    ·
    8 days ago

    I also recommend downloading “Flashpoint archive” to have flash games and animations to stay entertained.

    There is a 4gb version and a 2.3TB version.

  • Maroon@lemmy.world
    link
    fedilink
    arrow-up
    32
    ·
    8 days ago

    I thought the whole point of torrenting was to decentralise distribution. I use torrents to get my distros.

    In my own little bubble, I thought that’s how most people got their distro.

    • prole@lemmy.blahaj.zone
      link
      fedilink
      arrow-up
      15
      arrow-down
      1
      ·
      edit-2
      8 days ago

      What happens when they just cut the underwater cables? Torrent over carrier pigeon for a linux distro would take ages

      • hayvan@feddit.nl
        link
        fedilink
        arrow-up
        20
        ·
        8 days ago

        Sneakernet to the rescue. Some of you are too young to know about walking around with boxes full of disks.

        • oppy1984@lemdro.id
          link
          fedilink
          English
          arrow-up
          6
          ·
          8 days ago

          It was trading CD-R’s during my high school days… good times. Napster was just starting to take off by the time we had a CD-R trading network set up, Napster just increased the amount of CD’s that got passed around.

      • Sestren@lemmy.world
        link
        fedilink
        arrow-up
        10
        ·
        8 days ago

        Pigeon latency is horrible, but the bandwidth is pretty great. You could probably load up an adult pigeon with at least 12TB of media.

      • DarkAri@lemmy.blahaj.zone
        link
        fedilink
        arrow-up
        9
        ·
        8 days ago

        A good way to see what the future of places like the U.S are is to look at places like North Korea, where they do exactly this, move files around on flash media to avoid the state censors.

  • utopiah@lemmy.world
    link
    fedilink
    arrow-up
    31
    ·
    8 days ago

    FWIW :

    fabien@debian2080ti:/media/fabien/slowdisk$ ls -lhS offline_prep/
    total 341G
    -rw-r--r-- 1 fabien fabien 103G Jul  6  2024 wikipedia_en_all_maxi_2024-01.zim
    -rw-r--r-- 1 fabien fabien  81G Apr 22  2023 gutenberg_mul_all_2023-04.zim
    -rw-r--r-- 1 fabien fabien  75G Jul  7  2024 stackoverflow.com_en_all_2023-11.zim
    -rw-r--r-- 1 fabien fabien  74G Mar 10  2024 planet-240304.osm.pbf
    -rw-r--r-- 1 fabien fabien 3.8G Oct 18 06:55 debian-13.1.0-amd64-DVD-1.iso
    -rw-r--r-- 1 fabien fabien 2.6G May  7  2023 ifixit_en_all_2023-04.zim
    -rw-r--r-- 1 fabien fabien 1.6G May  7  2023 developer.mozilla.org_en_all_2023-02.zim
    -rw-r--r-- 1 fabien fabien 931M May  7  2023 diy.stackexchange.com_en_all_2023-03.zim
    -rw-r--r-- 1 fabien fabien 808M Jun  5  2023 wikivoyage_en_all_maxi_2023-05.zim
    -rw-r--r-- 1 fabien fabien 296M Apr 30  2023 raspberrypi.stackexchange.com_en_all_2022-11.zim
    -rw-r--r-- 1 fabien fabien 131M May  7  2023 rapsberry_pi_docs_2023-01.zim
    -rw-r--r-- 1 fabien fabien 100M May  7  2023 100r-off-the-grid_en_2022-06.zim
    -rw-r--r-- 1 fabien fabien  61M May  7  2023 quantumcomputing.stackexchange.com_en_all_2022-11.zim
    -rw-r--r-- 1 fabien fabien  45M May  7  2023 computergraphics.stackexchange.com_en_all_2022-11.zim
    -rw-r--r-- 1 fabien fabien  37M May  7  2023 wordnet_en_all_2023-04.zim
    -rw-r--r-- 1 fabien fabien  23M Jul 17  2023 kiwix-tools_linux-armv6-3.5.0-1.tar.gz
    -rw-r--r-- 1 fabien fabien  16M Oct  6 21:32 be-stib-gtfs.zip
    -rw-r--r-- 1 fabien fabien 3.8M Oct  6 21:32 be-sncb-gtfs.zip
    -rw-r--r-- 1 fabien fabien 2.3M May  7  2023 termux_en_all_maxi_2022-12.zim
    -rw-r--r-- 1 fabien fabien 1.9M May  7  2023 kiwix-firefox_3.8.0.xpi
    
    

    but if you want the easier version just get Kiwix on whatever device in front of you right now (yes, even mobile phone assuming you have the space) then get whatever content you need.

    If need a bit of help I recorded TechSovereignty at home, episode 11 - Offline Wikipedia, Kiwix and checksums with a friend just 3 weeks ago.

    I also wrote randomly update https://fabien.benetou.fr/Content/Vademecum and coded https://git.benetou.fr/utopiah/offline-octopus but tbh KDE-Connect is much better now.

    The point though is having such a repository takes minutes. If you don’t have the space, buy a 512Go microSD for 50EUR then put that on, stuff it in a drawer then move on. If you want to every 3 months or whenever you feel like it, updated it.

    TL;DR: takes longer to write such a meme than actually do it.

    • LH0ezVT@sh.itjust.works
      link
      fedilink
      arrow-up
      9
      ·
      8 days ago

      Watch out for flash data corruption. Lots of cheap flash (USB sticks, SD cards, SSDs) lose data after just a few years of offline storage. Something something quantum tunnel bullshit, iirc.

      So either look for media that guarantee long cold storage retention (lots of businesses need to keep shit for 10 years for tax reasons), or occasionally plug it in and let do the housekeeping.

      • b000rg@midwest.social
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        8 days ago

        It’s more that flash NAND uses a small electric charge to keep the NAND gates in the correct configuration. Over time, that charge dissipates. If you power the storage device every once in a while, you minimize these chances.

        Here’s a video explaining why it happens to Wii U’s after being powered off for a while. https://youtu.be/JHME4zLs6Qs

      • DarkAri@lemmy.blahaj.zone
        link
        fedilink
        arrow-up
        2
        ·
        8 days ago

        User older flash tech can be useful here. You might not always need the highest density storage if you want to maintain files for a long time. Getting stuff built in a much larger process node makes for a much more stable form of storage.

        • LH0ezVT@sh.itjust.works
          link
          fedilink
          arrow-up
          2
          ·
          5 days ago

          Or look for industrial / business grade stuff with long retention times. Old flash also means less sophisticated controllers etc

      • utopiah@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        8 days ago

        Thanks but even though it’s on a plugged HDD I don’t even care for any of that data. What I mean is that none of that data is sensitive. It might be useful, potentially, but it’s not unique. What I mean is that if somehow my .zim file for Wikipedia was corrupted I could download it again from https://library.kiwix.org/#lang=eng&category=wikipedia or elsewhere in ~30min (just checked).

        What I’m trying to highlight here is more the process than the actual outcome.

        TL;DR: yes, if one is actually serious about just getting and storing, they should verify periodically if the data is indeed fine. What I do want to highlight though is to first know how to do it at all. Anyway, you are right that for a proper solution on the long run one must understand how (cold) storage actually works. My heuristic is that it’s like can food (which I don’t use much), it might last a while, but not forever.

        • prole@lemmy.blahaj.zone
          link
          fedilink
          arrow-up
          3
          ·
          8 days ago

          I thought the point of backing stuff up was to have things in case just downloading it again isn’t a viable option?

          • utopiah@lemmy.world
            link
            fedilink
            arrow-up
            1
            ·
            edit-2
            8 days ago

            It can be but not to me. To me the point is to test what’s actually feasible and usable. It can be Wikipedia on my HDD but it could also be SO on a microSD or a RPi … or it could be something totally different on another piece of hardware with another piece of storage. It will depend on the context.

            So again, sure, having the data itself feels nice but in practice I never really needed it. If tomorrow my HDD would die I would shrug. If tomorrow Kiwix library wouldn’t work anymore, I’d be disappointed but I could rely on .zim file elsewhere, e.g. on torrent trackers.

            IMHO the point isn’t files, the point is usable knowledge.

            Edit : to be clear this isn’t philosophy, you can see exactly what I mean and even HOW I do it (and even when) with the edits of my public wiki or my git repositories.

      • utopiah@lemmy.world
        link
        fedilink
        arrow-up
        5
        ·
        8 days ago

        Commenting inline :

        -rw-r--r-- 1 fabien fabien 103G Jul  6  2024 wikipedia_en_all_maxi_2024-01.zim
        # encyclopedia Wikipedia English with images and more
        -rw-r--r-- 1 fabien fabien  81G Apr 22  2023 gutenberg_mul_all_2023-04.zim
        # Project Gutenberg, book collection in multiple languages
        -rw-r--r-- 1 fabien fabien  75G Jul  7  2024 stackoverflow.com_en_all_2023-11.zim
        # StackOverflow, programming questions and answers
        -rw-r--r-- 1 fabien fabien  74G Mar 10  2024 planet-240304.osm.pbf
        # OpenStreetMap low resolution for the whole World
        -rw-r--r-- 1 fabien fabien 3.8G Oct 18 06:55 debian-13.1.0-amd64-DVD-1.iso
        # Debian base ISO
        -rw-r--r-- 1 fabien fabien 2.6G May  7  2023 ifixit_en_all_2023-04.zim
        # iFixit colection of guides to fix appliances
        -rw-r--r-- 1 fabien fabien 1.6G May  7  2023 developer.mozilla.org_en_all_2023-02.zim
        # Web development documentation
        -rw-r--r-- 1 fabien fabien 931M May  7  2023 diy.stackexchange.com_en_all_2023-03.zim
        # Do It Yourself Q&A
        -rw-r--r-- 1 fabien fabien 808M Jun  5  2023 wikivoyage_en_all_maxi_2023-05.zim
        # WikiVoyage, the version of Wikipedia for traveling
        -rw-r--r-- 1 fabien fabien 296M Apr 30  2023 raspberrypi.stackexchange.com_en_all_2022-11.zim
        # Raspberry Pi Q&A
        -rw-r--r-- 1 fabien fabien 131M May  7  2023 rapsberry_pi_docs_2023-01.zim
        # Rasspberry Pi documentation
        -rw-r--r-- 1 fabien fabien 100M May  7  2023 100r-off-the-grid_en_2022-06.zim
        # Off the grid documents
        -rw-r--r-- 1 fabien fabien  61M May  7  2023 quantumcomputing.stackexchange.com_en_all_2022-11.zim
        # Quantum computer Q&A
        -rw-r--r-- 1 fabien fabien  45M May  7  2023 computergraphics.stackexchange.com_en_all_2022-11.zim
        # Computer graphics Q&A
        -rw-r--r-- 1 fabien fabien  37M May  7  2023 wordnet_en_all_2023-04.zim
        # Graph of words in English
        -rw-r--r-- 1 fabien fabien  23M Jul 17  2023 kiwix-tools_linux-armv6-3.5.0-1.tar.gz
        # Kiwix to read .zim files
        -rw-r--r-- 1 fabien fabien  16M Oct  6 21:32 be-stib-gtfs.zip
        # public transport database in Brussels, Belgium
        -rw-r--r-- 1 fabien fabien 3.8M Oct  6 21:32 be-sncb-gtfs.zip
        # train transport database in Belgium
        -rw-r--r-- 1 fabien fabien 2.3M May  7  2023 termux_en_all_maxi_2022-12.zim
        # Termux, Linux tooling on Android, documentation in English
        -rw-r--r-- 1 fabien fabien 1.9M May  7  2023 kiwix-firefox_3.8.0.xpi
        # Kiwix Web Extension for the Firefox browser
        
        • u/lukmly013 💾 (lemmy.sdf.org)@lemmy.sdf.org
          link
          fedilink
          English
          arrow-up
          1
          ·
          4 days ago

          By the way, there’s now a Wikipedia 2025 snapshot.
          I am currently trying to fit that on my phone somehow. I wish I could just omit the index database at the end that can’t be split it seems. I have to keep it, but when it’s split up, it doesn’t work anyway (search is broken that way) (https://github.com/openzim/zim-tools/issues/295).
          My phone can only do FAT32 for SD cards…

          For 2024 Wikipedia, that seems to be around 18GiB of wasted space.

          • utopiah@lemmy.world
            link
            fedilink
            arrow-up
            1
            ·
            3 days ago

            Thanks, updating (~20min) accordingly.

            FWIW I have a CMF Nothing 1 and I can put a 500Go microSD in it.

              • utopiah@lemmy.world
                link
                fedilink
                arrow-up
                1
                ·
                3 days ago

                Weird, assuming you have Android 13 it should be usable at least as exFAT and thus can be large enough

                • u/lukmly013 💾 (lemmy.sdf.org)@lemmy.sdf.org
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  edit-2
                  3 days ago

                  Unfortunately, this is rather dependent on manufacturer (or rather how much they can fuck up).
                  Android 14, but without exFAT support.
                  I tried multiple, exFAT, ext4, f2fs, NTFS, nothing else works.

  • juipeltje@lemmy.world
    link
    fedilink
    arrow-up
    30
    ·
    8 days ago

    Yeah not gonna lie, i think i heard someone in a youtube video a while back talk about how the entirety of wikipedia takes up like 200 gigs or something like that, and it got me seriously considering to actually make that offline backup. Shit is scary when countries like the uk are basically blocking you from having easy access to knowledge.

    • mic_check_one_two@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      10
      ·
      8 days ago

      Yeah, it’s surprisingly small when it’s compressed if you exclude things like images and media. It’s just text, after all. But the high level of compression requires special software to actually read without uncompressing the entire archive. There are dedicated devices you can get, which pretty much only do that. Like there are literal Wikipedia readers, where you just give it an archive file and it’ll allow you to search for and read articles.

          • mfed1122@discuss.tchncs.de
            link
            fedilink
            English
            arrow-up
            3
            ·
            8 days ago

            If my experience with mashing the random article button is any indicator, you could reduce the size by 30% just by removing articles on sports players. I doubt I’ll need those

            • potoooooooo ☑️@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              edit-2
              8 days ago

              “Fellow survivors, oh my God! What are your names?”

              “I’m OJ Simpson. This is my friend Aaron Hernandez. And this is his car, Christine.”

    • palordrolap@fedia.io
      link
      fedilink
      arrow-up
      7
      ·
      8 days ago

      UKGOV haven’t started on things like Wikipedia yet. They know kids use it for school and blinded by ideology though they are, even they can see there’d be an enormous backlash if they blocked it any time soon.

      If that’s going to happen at all, I doubt it would be before the next election. That’s whether Labour get re-elected or the Tories make an unexpected comeback. You can tell how far Labour have fallen in the eyes of their party faithful when they’ve taken a Tory-drafted policy and made it their own.

      Ironically, the up and coming third option fascist party, have said they’re going to repeal the Online Safety Act. They have other fish to fry if they get in, and they’ll want to keep their preferred demographic(s) happy while they do it.

      I assume that eventually something like the OSA would come back to “protect the children”. They love the current US President.

      None of this is hopeful. Take this as more of a rant.

        • Isthisreddit@lemmy.world
          link
          fedilink
          English
          arrow-up
          10
          arrow-down
          3
          ·
          edit-2
          8 days ago

          Conservatives hate knowledge, learning is toxic to them. Also the people who start with burning books usually end up burning people eventually

          • TangledHyphae@lemmy.world
            link
            fedilink
            arrow-up
            3
            arrow-down
            18
            ·
            edit-2
            7 days ago

            Removing books about sucking cum out of anuses from public schools isn’t really “burning books.” You can still buy them whenever you want, just not putting them in taxpayer funded schools with children.

            EDIT: Had to add some details of the “books being burned [but really just removed from public school]”:

            During public comment, one woman read a passage from “Yolo” by Lauren Miracle which is found in Freedom High School.

            “I climbed onto of him and started kissing him in a way that said very clearly here I am, I’m ready to have sex,” the speaker read.

            Another title, “Anatomy of a Single Girl” by Daria Snadowsky, was also read by a speaker.

            “Guy tries rubbing my clitoris with his fingers, he wiggles his pelvis back and forth,” another woman read from the book.

            “This is ridiculous that this school – any school – has this book,” the woman said to the board.

            Julie Gebhards, the woman seen in the first video of our story, is a Hillsborough County mom of six children.

            Gebhards read an excerpt from the book “Invisible Monsters Remix” by Chuck Palahniuk. According to the district’s online book library, the title is found in Steinbrenner High School.

            “He shoots his load, and then plants his mouth on your anus and sucks out his own warm sperm, plus whatever lubricant and feces are present. That’s felching. It may or may not, I add, include kissing you to pass the sperm and fecal matter into your mouth,” Gebhards said.

            • WoodScientist@lemmy.world
              link
              fedilink
              arrow-up
              8
              ·
              7 days ago

              We should ban mention of Christianity in public. We should also make it illegal for anyone to teach their children Christianity. Practicing Christians should be declared mentally ill, and if they practice their faith in front of children, they should be put on the sex offender registry.

              These freaks actually put giant statues of a naked bleeding man up on full public display in buildings. And they believe the most holy book in the world is one that features incest, murder, rape, genocide, and often fully endorses these horrors. Their main ritual is a form of public ritual cannibalism.

              Christians are too dangerous to be allowed near children.

              • lightnsfw@reddthat.com
                link
                fedilink
                arrow-up
                3
                ·
                7 days ago

                As if kids aren’t finding shit way worse on the internet on a daily basis. Well… maybe not felching that’s pretty vile. But still.

            • fibojoly@sh.itjust.works
              link
              fedilink
              arrow-up
              4
              ·
              7 days ago

              Oh no! What have you done! Now, I want to go try felching because I saw a message about it online with no context and I just. Have. To. Try. It.
              … Oh wait, no. No, I don’t. Pfew!

              So what was your point again?