These are all the torrents currently managed and released by Anna’s Archive. For more information, see “Our projects” on the Datasets page. For Library Genesis and Sci-Hub torrents, the Libgen.li torrents page maintains an overview.

These torrents are not meant for downloading individual books. They are meant for long-term preservation.

Torrents with “aac” in the filename use the Anna’s Archive Containers format. Torrents that are crossed out have been superseded by newer torrents, for example because newer metadata has become available. Some torrents that have messages in their filename are “adopted torrents”, which is a perk of our top tier “Amazing Archivist” membership.

You can help out enormously by seeding torrents that are low on seeders. If everyone who reads this chips in, we can preserve these collections forever. This is the current breakdown:

Status Torrents Size Seeders
🔴 54 154.0TB <4
🟡 183 92.5TB 4–10
🟢 111 17.2TB >10

IMPORTANT: If you seed large amounts of our collection (50TB or more), please contact us at AnnaArchivist@proton.me so we can let you know when we deprecate any large torrents.

  • DengueDucky@lemmy.ml
    link
    fedilink
    English
    arrow-up
    1
    ·
    8 months ago

    It’s unclear to me how these torrents are used. If individual books are not downloaded from them, is this only to make it possible to create similar sites in the future, in case this one is taken down?

    • empireOfLove@lemmy.one
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      8 months ago

      They are meant for long-term preservation.

      This is basically a “distributed backup” of the entire database. The torrents are not actively serving files- they’re there to store multiple copies of the main database across the globe so that the entire database can be recovered (by anyone with the requisite knowledge, mind you) in the event that something happens to the original Anna’s Archive team or the main database is lost/seized by “law enforcement”.

      It’s equivalent to how backup managers in ye olden days would make broken up piece files of a certain size that could fit onto a CD or DVD, so you could fit the entire contents of a large 20+GB hard drive onto multiple smaller media. The backup itself is not accessed unless your main hard drive crashes, in which case you reassemble all the individual pieces back into your complete OS environment after replacing the hard drive.

  • onlinepersona@programming.dev
    link
    fedilink
    English
    arrow-up
    1
    ·
    8 months ago

    Bruh, this is a terrible way to share this. Why not torrents of the raw material in predefined categories that won’t change? Like “1984 - Sci-Fi - English - A-N”, “1984 - Sci-Fi - English - O-Z”, “1990 - Biology”, “2012 - Physics”. Then people would actually even download this to use it themselves, instead of some archive that has to be extracted and will take a multitude of the space again.

    The hell am I going to do with a 300GB archive file that I cannot even look into? I might as well be storing an encrypted blob 300GB large or just reducing the size of my partition by 300GB.

    It’s great that people want to preserve human knowledge, but there surely are better ways to do this.

      • WallEx@feddit.de
        link
        fedilink
        English
        arrow-up
        3
        ·
        8 months ago

        This is just bad communication, beating down on people that are delivering constructive criticism.

      • onlinepersona@programming.dev
        link
        fedilink
        English
        arrow-up
        2
        ·
        8 months ago

        Way to gatekeep. Don’t you think it would be better if more people could contribute bandwidth and storage with what they have instead of buy a new hardrive? Wouldn’t you want more redundancy, instead of less?

        • fatboy93@lemm.ee
          link
          fedilink
          English
          arrow-up
          2
          ·
          8 months ago

          I don’t think they use an indexable compression as well, right? That essentially kills stuff for me.

          The easiest way to host is not TB/PB sized archives but indices and slices for those.

          It easier for a lot of us to download a few gigs and share that, rather than download TB/PB sized archives.

    • Yote.zip@pawb.social
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 months ago

      The torrents are broken up into smaller pieces - don’t be intimidated by the big TB numbers from the sum total. Otherwise donations are always useful.

  • Adam@doomscroll.n8e.dev
    link
    fedilink
    English
    arrow-up
    1
    ·
    8 months ago

    It seems the majority of the torrents with poor seeder count are in the 1.5TB+ range. I just simply don’t have the storage for that. Most everything in the 0-300GB range is pretty well covered.

    • Appoxo@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 months ago

      Agreed. I’d like to share a bit of disk storage but I only have 2 TB and I need that for my own consumption.
      Give us smaller torrents (e.g. 50GB parts) instead.

      • MachineFab812@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        5 months ago

        Yes. When downloading the torrent, go to the files list for that torrent and un-check boxes. Resist the urge to leave only the first files in the list(as most people will do this leaving few-if-anyone seeding the rest), **and try instead to grab files from the middle or end, or just be random about it.

        When the torrent finishes downloading the files you’ve selected, it will automatically seed those portions of the torrent which you have downloaded.

        EDIT: I just remembered, some torrent programs will actually show you the seed ratio per file in the torrent. There are reasons hardly anyone is (sincerely) trying to reinvent this wheel.

        • interdimensionalmeme@lemmy.ml
          link
          fedilink
          English
          arrow-up
          0
          arrow-down
          1
          ·
          edit-2
          8 months ago

          This is too much work, someone automate this with the smartest algorithm and I will preserve with the power of 10x 2tb 10gbe seedbox.

          Automatically auto seed on a rotation the 10% least available file of each torrent Percentage should be adjustable globally and per torrent.

  • Bread@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    0
    ·
    8 months ago

    I have a spare 100 or so Terabytes and I can fit roughly another quarter petabyte in my server. I would like to help. I will look more into this potentially tomorrow when I have some free time. The preservation of knowledge is too important.

      • Bread@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        8 months ago

        I have a few questions as you appear to be part of the archive or at least very familiar with it.

        Roughly how often are the archives updated?

        Do you guys already have a proper backup method or are your seeds acting as that backup?

        Any idea realistically how much bigger the archive can get data wise in the next few years? Estimates or educated guesses are fine. I want to know how much I need to plan in advance.

        If I take the whole archive, must I deploy it or can it be searched through if I have the whole thing and I want something specific out of it?