• PKscope@lemmy.world
    link
    fedilink
    English
    arrow-up
    282
    arrow-down
    2
    ·
    5 days ago

    Tackling the problems that really matter. Good job, FBI.

    Fucking clowns.

  • rekabis@lemmy.ca
    link
    fedilink
    English
    arrow-up
    108
    ·
    4 days ago

    The FBI is probably going nuts here because someone inadvertently archived the Epstein files and everyone at HQ is panicking. They need to purge it for the Internet before someone discovers that archived content, and so they’re using CP as an excuse.

  • Balldowern@lemmy.zip
    link
    fedilink
    English
    arrow-up
    137
    ·
    5 days ago

    Why isn’t the FBI doing anything about Epstein island list ? That’s more important than some archive website.

    • conorab@lemmy.conorab.com
      link
      fedilink
      English
      arrow-up
      10
      ·
      4 days ago

      It occasionally catches things that archive.org misses too. Also really nice to have an alternative.

      It’d be nice to have a way of doing decentralised archiving while still keeping the trust. If you’re trying to prove that a site really said something at a certain date to another person, pointing to your own archive is kinda useless.

  • dan1101@lemmy.world
    link
    fedilink
    English
    arrow-up
    26
    ·
    4 days ago

    The news sites are trying to have it both ways. Serving the news articles to visitors and then covering them up with a paywall with browser tricks.

  • Knock_Knock_Lemmy_In@lemmy.world
    link
    fedilink
    English
    arrow-up
    70
    ·
    4 days ago

    The archive runs Apache Hadoop and Apache Accumulo. All data is stored on HDFS, textual content is duplicated 3 times among servers in 2 datacenters and images are duplicated 2 times. Both datacenters are in Europe, with OVH hosting at least one of them.

    To avoid detection, archive.today runs via a botnet that cycles through countless IP addresses, making it quite difficult for grumpy webmasters to stop their sites getting scraped. Access to paywalled sites is through logins secured via unclear means, which need to be replenished constantly: here’s the creator asking for Instagram credentials. Finally, the serving of the website is also subject to a perpetual game of cat and mouse: “I can only predict that there will be approximately one trouble with domains per year and each fifth trouble will result in domain loss.” As of today, archive.today still works, but users are redirected to archive.md.

      • Optional@lemmy.world
        link
        fedilink
        English
        arrow-up
        13
        arrow-down
        3
        ·
        4 days ago

        So basically you need to spam me. Because a donation plea every so often . . .doesn’t get enough addresses to sell?

        I’m saying it’s a flawed implementation is all.

        • NotSteve_@piefed.ca
          link
          fedilink
          English
          arrow-up
          26
          ·
          4 days ago

          Purely anecdotal but they’re the only news site that I’ve ever given my email to and I actually enjoy seeing their emails. They send entire (interesting) articles that can be read with no CSS/tracking images enabled and their monetisation is a small text ad that breaks a single couple of paragraphs.

          I’ve never gotten an email from them that was begging for money or anything like that, just basically an RSS feed of interesting articles

        • Prove_your_argument@piefed.social
          link
          fedilink
          English
          arrow-up
          11
          arrow-down
          2
          ·
          4 days ago

          The idea that forcing a signup (building a web of information about a user through the use of cookies and other browser metadata) to protect against AI (that is gonna use tooling, mirrors, proxies and any number of fully working methodologies) is ludicrous.

          They just want to track who you are, what you do, and then sell that data which should never have been gathered in the first place as part of their advertising revenue.

          • DesertCreosote@piefed.blahaj.zone
            link
            fedilink
            English
            arrow-up
            9
            ·
            4 days ago

            Normally I would agree with you, but given how much they care about privacy (as indicated by what they write about and talk about on their podcast), I don’t think tracking is what they’re after in this specific case.

            And they know that the signup won’t completely block AI, but it does help.

    • brbposting@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      13
      ·
      4 days ago

      Softest paywall ever - they do such good work, they can have an anonymous email of mine no problem

      Magic link’s so annoying though, just wanna password (they’re journalists not techies though is the long and short of it)

  • snoons@lemmy.ca
    link
    fedilink
    English
    arrow-up
    80
    arrow-down
    2
    ·
    5 days ago

    Friends of tech Bros Incorporated.

    Regulatory capture is complete in the states.

  • Treczoks@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    1
    ·
    3 days ago

    Shouldn’t they focus on the no. 1 law breaker and court ignorer in the country?

  • girlthing@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    31
    ·
    edit-2
    4 days ago

    The owner should release the source code / configuration, in whatever state it’s in, before things escalate further. It’d suck for all their work to go down the drain. I’m sure there’d be people willing to adopt the project and host instances.

    If you agree and you have Tumblr, would you consider asking them anonymously?

    https://blog.archive.today/ask

        • Pup Biru@aussie.zone
          link
          fedilink
          English
          arrow-up
          3
          ·
          edit-2
          3 days ago

          voyager automatically opens links in reader mode for me and it works about 80% of the time

          (but this article it doesn’t work for)

          • Cricket [he/him]@lemmy.zip
            link
            fedilink
            English
            arrow-up
            2
            ·
            3 days ago

            Interesting, my experience with reader mode to get around paywalls is just about the opposite - it works may 20% of the time. Probably different sites that we’re visiting.

    • punkibas@lemmy.zip
      link
      fedilink
      English
      arrow-up
      8
      ·
      4 days ago

      I have JavaScript disabled by default on all pages, I only activate it if I need to, as per the privacyguides recommendations, but on this site at least, it still won’t load the article. If I want to read it I’d have to either register or use the archive.

  • Broadfern@lemmy.world
    link
    fedilink
    English
    arrow-up
    30
    ·
    5 days ago

    That would explain why adguard’s public DNS started blocking it (labeled vaguely as “legal request”).