I have backups on a backup hard drive and also synced to B2, but I am thinking about backing up to some format to put in the cupboard.

The issue I see is that if I don’t have a catastrophic failure and instead just accidentally delete some files one day while organising and don’t realise, at some point the oldest backup state is removed and the files are gone.

The other thing is if I get hit by a bus and no one can work out how to decrypt a backup or whatever.

So I’m thinking of a plain old unencrypted copy of photos etc that anyone could find and use. Bonus points if I can just do a new CD or whatever each year with additions.

I have about 700GB of photos and videos which is the main content I’m concerned about. Do people use DVDs for this or is there something bigger? I am adding 60GB or more each year, would be nice to do one annual addition or something like that.

  • schizo@forum.uncomfortable.business
    link
    fedilink
    English
    arrow-up
    20
    ·
    edit-2
    3 days ago

    I’m using blu-ray disks for the 3rd copy, but I’m not backing up nearly as much data as you are.

    The only problem with optical media is that you should only expect it to be readable for a couple of years, best case, at this point and probably not even that as the tier 1 guys all stop making it and you’re left with the dregs.

    You almost certainly want some sort of tape option, assuming you want long retention periods and are only likely to add incremental changes to a large dataset.

    Edit: I know there’s longer-life archival optical media, but for what that costs, uh, you want tape if at all possible.

    • shyguyblue@lemmy.world
      link
      fedilink
      English
      arrow-up
      10
      ·
      3 days ago

      Same.

      Bought a Blu-ray burner and “archive grade” disks for third location backups.

      I made a list of files that is just a text document (3MB!) that sits on the root of the Blu-ray. There’s probably a better way of doing that, but it works for me.

    • Dave@lemmy.nzOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      3 days ago

      Hmm I am keen for something that could be left in the cupboard for 50 years and still works when brought out.

      What does it take me to do home tape storage? Do the tapes needs to be stored with climate control or are they pretty stable? Is it feasible for the average person to load the contents?

      I’m thinking of pulling a suitcase out of the cupboard of all the baby photos, but digital files or photo and video.

      • schizo@forum.uncomfortable.business
        link
        fedilink
        English
        arrow-up
        7
        ·
        3 days ago

        So, 50 years isn’t a reasonable goal unless you have a pretty big budget for this. Essentially no media is likely to survive that long and be readable unless they’re stored in a vault, under perfect climate controlled conditions. And even if the media is fine, finding an ancient drive to read a format that no longer exists is not a guaranteed proposition.

        You frankly should be expecting to have to replace everything every couple of years, and maybe more often if your routine tests of the media show it’s started rotting.

        Long term archival storage really isn’t just a dump it to some media and lock it up and never look at ever again.

        Alternately, you could just make someone else pay for all of this, and shove all of this to something like Glacier and make the media Amazon’s problem. (Assuming Amazon is around that long and that nothing catches fire.)

        • Dave@lemmy.nzOP
          link
          fedilink
          English
          arrow-up
          2
          ·
          3 days ago

          Hmm damn. I don’t really think cloud is the right answer for what I’m trying to do.

          I disagree that formats like JPEG won’t be readable in 50 years. I feel like there would be big demand for being able to read the format even if it’s been superceded, on account of all the JPEGs that still living people have.

          Maybe I get a big drive. Each year I copy over files from the last year. Every X years I swap the hard drive for a new one, copy all data.

          How can I tell if individual files get corrupted? Like the hard drive failed in that section, then I copy the corrupted file to the new drive, and I’d never know. Can I test in bulk? 50k+ photos and videos so far.

          • sugar_in_your_tea@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            2
            ·
            edit-2
            2 days ago

            How can I tell if individual files get corrupted?

            Checksums. A good filesystem will do this for you, but you can do it yourself if you want.

            If you sync a drive with rsync or something periodically, it’ll replace files with different checksums, fixing any corruption as you go. Then smart tests should tell you if there’s any corruption the drive is aware of. I’m sure automated backup tools have options for this.

            • Dave@lemmy.nzOP
              link
              fedilink
              English
              arrow-up
              1
              ·
              2 days ago

              I specifically don’t want to be touching previous files on the drive, it should be addition only. So I may need to write a script to do the checks, or compare against a mirror drive. I can do this with the right filesystem, but I’m worried that if I use a filesystem not readable by Windows then it may not be layman-proof enough.

              • sugar_in_your_tea@sh.itjust.works
                link
                fedilink
                English
                arrow-up
                2
                ·
                2 days ago

                Then I’d go with FAT on a USB, which should be plenty portable into the future. You’ll want to replace it every 5-10 years, and check on it every other year or so.

                That’s about as easy to use as I can think of. Decades down the road, physical media like DVDs and tapes may be difficult to find readers for, but USB is versatile enough that someone is bound to have access. Micro SD cards may also be a good option, as long as you keep a couple USB readers around.

                • Dave@lemmy.nzOP
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  2 days ago

                  I have a terrible track record with USB sticks, including completely losing a stack of photos because of a USB stick.

                  I’m now thinking the benefits of a nice error-correcting file system probably outweigh the benefits of using a widely supported one. So I might use a pair of mirrored hard drives with SATA->USB cable, then include instructions along the lines of “plug into my linux laptop to access, or take to a computer repair show if you can’t work it out”.

                  • sugar_in_your_tea@sh.itjust.works
                    link
                    fedilink
                    English
                    arrow-up
                    2
                    ·
                    1 day ago

                    I’m not talking about USB sticks, I’m talking about USB drives, like a HDD or SSD. If you want to go with flash memory, I recommend SD cards because they’re small and cheap, so keeping a few copies isn’t particularly burdensome.

                    I wouldn’t trust any of these options to last a long time on a shelf though. Check them every year or two and replace every 5-10 years, maybe a little longer if you buy higher quality.

                    So I might use a pair of mirrored hard drives with SATA->USB cable, then include instructions along the lines of “plug into my linux laptop to access, or take to a computer repair show if you can’t work it out”.

                    That’s basically what I’m planning too. But my use case is disaster recovery, basically as a cheaper alternative to paying for hosted backup for important, but recoverable information (e.g. ripped media). Everything truly important (pictures and documents) goes to hosted backup as well.

                    I’m largely relying on documents explaining how to access the backups. If I pass, I expect my survivors to either figure it out themselves or hire someone who can figure it out from my documentation.

              • Krik@lemmy.dbzer0.com
                link
                fedilink
                English
                arrow-up
                2
                ·
                2 days ago

                Any file systems Windows can read out-of-the-box are no good file systems. What Windows read? FAT and NTFS. Former is so basic it has no mechanisms to detect errors and bitrot and the later one is a mess.
                You should stick to ext4, btrfs and zfs.

                If you want to make if fool-proof then add a sticker with ‘bring me to a computer shop to access my content’.

                • Dave@lemmy.nzOP
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  2 days ago

                  I have considered that exact message. It does seem making it easily plug and play may be out of the question if I want the error correction capabilities.

                  • Krik@lemmy.dbzer0.com
                    link
                    fedilink
                    English
                    arrow-up
                    2
                    ·
                    2 days ago

                    Btrfs and zfs are self-healing.

                    You can make a script to check for errors and autocorrection yourself but that needs at least a second hdd. On both drives are the same data and a file or database with the checksums of the data. The script then compares the actual checksums of the two copies and the db checksum. If they match -> perfect. If they don’t match the file where there are two matching checksum is the good one and replaces the faulty one or corrects the db entry, whichever is defect. That’s it. It doesn’t have to be more complicated.

          • schizo@forum.uncomfortable.business
            link
            fedilink
            English
            arrow-up
            7
            ·
            3 days ago

            The format is the tape in the drive, or the disk or whatever.

            Tape existed 50 years ago: nothing modern and in production can read those tapes.

            The problem is, given a big enough time window, the literal drives to read it will simply no longer exist, and you won’t be able to access even non-rotted media because of that.

            As for data integrity, there’s a lot of options: you can make a md5 sum of each file, and then do it again and see if anything is different.

            The only caveat here is you have to make sure whatever you’re using to make the checksums gets stored somewhere that’s not JUST on the drive because if the drive DOES corrupt itself, and your only record of the “good” hashes is on the drive, well, you can’t necessarily trust those hashes either.

            • Dave@lemmy.nzOP
              link
              fedilink
              English
              arrow-up
              1
              ·
              2 days ago

              Ah good thinking. I am thinking a spare drive that I update once a year with new content and replace every few years with a new drive is a good idea.

              • schizo@forum.uncomfortable.business
                link
                fedilink
                English
                arrow-up
                2
                ·
                2 days ago

                That could probably work.

                Were it me, I’d build a script that would re-hash and compare all the data to the previous hash as the first step of adding more files, and if the data comes out consistent, I’d copy the files over, hash everything again, save the hash results elsewhere and then repeat as needed.

                • Dave@lemmy.nzOP
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  2 days ago

                  Yeah I think I should do something like this. I really want to make sure the files are not getting corrupted in storage without me knowing.

      • Moonrise2473@feddit.it
        link
        fedilink
        English
        arrow-up
        2
        ·
        2 days ago

        Tape isn’t readable by normal people even if they found it tomorrow with a drive already configured to be used.

        In 50 years good luck finding a working drive compatible with LTO4 when LTO32 is out (it’s backwards compatible only with previous gen).

        Unless you write on the box “here there are the keys for 100k bitcoins” they’ll just trash the tape

        • Dave@lemmy.nzOP
          link
          fedilink
          English
          arrow-up
          2
          ·
          2 days ago

          Yeah from some other comments I think my initial plan (that I’ll research some more) will be:

          • buy a new HDD, format with ZFS or btrfs for error correction
          • copy data onto drive
          • store in cupboard with sata-> USB cable and instructions about what it is, how to access .
          • every year, load the previous year’s data onto the drive
          • about every 5 years, replace the drive by copying onto a brand new one (timeframe will likely depend on when my other HDD drives die)

          This way I should get a chance to update storage medium as technology changes as well.

            • Dave@lemmy.nzOP
              link
              fedilink
              English
              arrow-up
              1
              ·
              20 hours ago

              Yeah since then I’ve been convinced I need two drives mirrored under zfs, which should handle that scenario.

              • philpo@feddit.org
                link
                fedilink
                English
                arrow-up
                2
                ·
                17 hours ago

                Hard drives loose their data fast if not powered (within a few years),so do SSD based media. Furthermore the former are very suspectable to mechanical destruction, electromagnetic interference,etc. And even if for some reason your drives last that long there will be nothing to connect them to - you know how we connected hard drives 25 years ago? Via SCSI/IDE. Good luck finding a converter to these now. If you go back further you need ISA controllers for the drives.

                This is a really bad idea. Really really bad, especially with the goal you want to achieve. Your data will be gone within 5 to 10 years.

                • Dave@lemmy.nzOP
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  16 hours ago

                  The idea is that I’d swap out drives every 5 years or so. If USB A is no longer in use I’d swap out at that point for something newer. Plus the drives would be powered on every year for the update, it’s just the point that I stop doing it (too old/hit by bus/etc) that the clock would start ticking.

                  I do like the M-Disc idea though. Probably a similar price, and more in line with the shelf-stable solution I was looking for.

                  • philpo@feddit.org
                    link
                    fedilink
                    English
                    arrow-up
                    2
                    ·
                    8 hours ago

                    There are still problems with the hard drive solutions:

                    • Powering up the drives for a short period does not help with error correction when sectors get compromised

                    • As said before it is relatively risky as mechanical parts of HDs do not like to be moved only occasionally. While this problem has become less severe over the last years it still exists.

                    • The updating will include copying from one drive to another - this process is highly suspectable to errors that might be correct with the right file systems - but it’s not a guarantee.

                    • And the main problem: You want to achieve a long shelf life - which means you must consider periods of time when you might not be able to maintain the data. What happens when you are not able to do so? And your next of kin are not quite ready to go through your things? To give you an example: You copy your data on the HDs today, maintain the disk’s for four years and want to change disk’s in 5, which means in 2030. Sadly a weeks before you are able to do so, John,your neighbourhood’s stupid school bus driver hits you and you suffer a major traumatic brain injury. Even worse,you don’t die right away but suffer for another 5 years in a nursing home before a infection gets you. Your family meanwhile is not quite ready to get through your things as you are still alive, aren’t you? (For real,this is the case a lot) After your funeral it takes them another year to finally get through all your things. Now your drives haven’t been used for 7 years. Even worse,one of them slips through your next of kind hand and hits the ground hard. How big do you think the chances are the data is still available? I think we both know the answer. While M-Disks are also suspectable to damage there are hardened multi-disk cases that make them pretty much indestructible - nothing any HD case can ever achieve.

          • Moonrise2473@feddit.it
            link
            fedilink
            English
            arrow-up
            4
            ·
            2 days ago

            you need to use fat32 if you want normal people to access the files

            Otherwise, they will get the “You need to format the disk in drive D: before using it. Do you want to format it?” dialog, they blindly click “yes”, then they will mumble to themselves “weird, he left behind a massive collection of blank drives…”

            • PassingThrough@lemm.ee
              link
              fedilink
              English
              arrow-up
              2
              ·
              2 days ago

              This is why I can’t/don’t have a lot of the “best practices” in my family archive. I’m not encrypting local drives, I’m not using BTRFS, or a ZFS pool. If I did I’d have to ensure my Will provided for the lawyer to hire a tech shop to help recover them. No, exFAT and NTFS, in the clear so those left behind can just plug them in and get to making their own copies. Otherwise the archive would die with me.

              Does that mean someone could steal my drives and go through my family photos? Sure. I hope it brings them much guilt, something a garbled encrypted drive could never do.

            • Dave@lemmy.nzOP
              link
              fedilink
              English
              arrow-up
              1
              ·
              2 days ago

              Oh shit you’re right. Argh ok I’m going to have to rethink that. Two drives and something to compare against each other to check for errors. I’m not sure about FAT32 as there are some multi-GB video files. Shit.

              • PassingThrough@lemm.ee
                link
                fedilink
                English
                arrow-up
                2
                ·
                2 days ago

                exFAT is a newer and viable alternative to FAT32, with better size limits and some pretty good cross-platform capabilities. That said, if your primary access is through Windows, NTFS may have some better features and is at least read-only on other platforms.

                • Dave@lemmy.nzOP
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  2 days ago

                  I don’t use Windows, I’m just thinking of someone needing to be able to pick up and use a drive, and for most people it’s going to be Windows.

                  Maybe I just need to leave instructions that specify it needs to be my laptop they use to get the photos off.