The Internet Archive just lost its appeal over ebook lending

fossilesque@mander.xyz · edit-2 6 months ago

The Internet Archive just lost its appeal over ebook lending

Grass@sh.itjust.works · 6 months ago

what does warrior do? The git readme seems to just be setup instructitons

fossilesque@mander.xyz · edit-2 6 months ago

click wiki link in readme: https://wiki.archiveteam.org/index.php?title=ArchiveTeam_Warrior

zzx@lemmy.world · 6 months ago

I had the same question. Here’s the answer:

The Archive Team Warrior is a virtual archiving appliance. You can run it to help with the Archive Team archiving efforts. It will download sites and upload them to our archive—and it’s really easy to do!

The warrior is a container running inside a virtual machine, so there is almost no security risk to your computer. (“Almost”, because in practice nothing is 100% secure.) The warrior will only use your bandwidth and some of your disk space, as well as some of your CPU and memory. It will get tasks from and report progress to the Tracker.

antonim@lemmy.dbzer0.com · 6 months ago

Yeah I’m wondering as well. It seems to save webpages, whereas the issue is with scanned books which may be removed from IA…

ZILtoid1991@lemmy.world · 6 months ago

They need to rename themselves “Intelligent Archive” then claim they’re an AI service that can just happen to regenerate whole books.

masterspace@lemmy.ca · 6 months ago

Fuck Copyright.

A system for distributing information and rewarding it’s creators should not be one based on scarcity, given that it costs nothing to copy and distribute information.

snooggums@midwest.social · 6 months ago

It was fine when the limited duration was a reasonable number of years. Anything over 30 years max before being in the public domain is too long.

Fuzzy_Red_Panda@lemm.ee · 6 months ago

Yeah. In a better world where the US court system doesn’t get weaponized and rulings aren’t delayed for years or decades, I would argue 8 to 15 years is the reasonable number, depending on the type of information being copyrighted.

drislands@lemmy.world · 6 months ago

My understanding is that the IA had implemented a digital library, where they had (whether paid or not) some number of licenses for a selection of books. This implementation had DRM of some variety that meant you could only read the book while it was checked out. In theory, this means if the IA has 10 licenses of a book, only 10 people have a usable copy they borrowed from the IA at a time.

And then the IA disabled the DRM system, somehow, and started limitlessly lending the books they had copies of to anyone that asked.

I definitely don’t like the obnoxious copyright system in the USA, but what the IA did seems obviously wrong. Like if your local library got a copy of Book X and then when someone wanted to borrow it they just copied it right there and let you keep the copy.

finitebanjo@lemmy.world · 6 months ago

Wrong? No.

Against the terms of agreements they made? Yes.

Actions also protected by laws exempting nonprofits and archives from copyright restrictions? Also supposed to be yes.

drislands@lemmy.world · 6 months ago

Against the terms of agreements they made? Yes.

To be fair, this is what I meant when I said wrong. Enough people have taken umbrage with my wording that I think I should update it, though. Thank you for your reply.

eskimofry@lemmy.world · 6 months ago

Like if your local library got a copy of Book X and then when someone wanted to borrow it they just copied it right there and let you keep the copy.

That’s how it works in the rest of the world.

dave@feddit.uk · 6 months ago

What part of the rest of the world are you in?

MonkderVierte@lemmy.ml · 6 months ago

Some university library probably.

MrScottyTay@sh.itjust.works · 6 months ago

They disabled drm during lockdown so people had something to do

accideath@lemmy.world · 6 months ago

Which was nice of them, but that doesn’t mean they should’ve done that, especially in the eyes of the law. (Also, if you’re after free ebooks, why are you pirating them on archive.org instead of libgen?)

CondensedPossum@lemmy.world · 6 months ago

“especially in the eyes of the law” bootlicking loser, get lost

accideath@lemmy.world · 6 months ago

Where did I say that find it good that they got sued or lost their appeal? I just said that the reason why they lost the appeal is because according to the law they’re bound to, what they did was wrong. And maybe they should’ve left that to a platform that enjoys a little more immunity from said law, because there are plenty of those. It was stupid of them. They painted an unnecessary target on their back that doesn’t help their cause and I‘d prefer them not to have to shut down at some point because I’m all for the Internet archive archiving anything and everything. They should’ve stayed a legitimate library and everything would have been fine and would have served their cause sufficiently well.

CondensedPossum@lemmy.world · 6 months ago

Thank you for fully documenting your bootlicking loser logic so that others can shudder at your fate

accideath@lemmy.world · 6 months ago

Ah, so you‘re one of those people that would be well at home at lemmygrad. And what fate are you talking about? Not getting sued?

CondensedPossum@lemmy.world · 6 months ago

The time for arguing with people like you is over. If you want to mewl about “wrong” when these corpos and states are trying to restrict our access to culture, fine. Keep crying. You aren’t a contributor, you aren’t in charge, go cry.

huiccewudu@lemmy.ca · edit-2 6 months ago

I definitely don’t like the obnoxious copyright system in the USA, but what the IA did seems obviously wrong.

The publisher-plaintiffs did not prove the “obvious wrong” in this case, however US-based courts have a curious standard when it comes to the application of Fair Use doctrine. This case ultimately rested on the fourth, most significantly-weighted Fair Use standard in US-based courts: whether IA’s digital lending harmed publisher sales during the 3-month period of unlimited digital lending.

Unfortunately, when it comes to this standard, the publisher-plaintiffs are not required to prove harm, rather only assert that harm has occurred. If they were required to prove harm they’d have to reveal sales figures for the 27 works under consideration–publishers will do anything to conceal this information and US-based courts defer to them. Therefore, IA was required to prove a negative claim–that digital lending did not hurt sales–without access to the empirical data (which in other legal contexts is shared during the discovery phase) required to prove this claim. IA offered the next best argument (see pp. 44-62 of the case document to check for yourself), but the data was deemed insufficient by the court.

In other words, on the most important test of Fair Use doctrine, which this entire case ultimately pivoted upon, IA was expected to defend itself with one arm tied behind its back. That’s not ‘fair’ and the publishers did not prove ‘obvious’ harm, but the US-based courts are increasingly uninterested in these things.

edited: page numbers on linked court document.

azuth@sh.itjust.works · 6 months ago

The decision is that even lending out ebooks against owned copies is illegal

What the IA may be illegal but is certainly not wrong.

bitwolf@lemmy.one · 6 months ago

Easy solution. Update the web-scraper they use to include an LLM. Then its for “training”

xenoclast@lemmy.world · 6 months ago

As long as they have a tech billionaire in charge they should be fine.

They could also rename the project to: “The AI Archive” and add lots of buttons with multicolor gradients.

bitwaba@lemmy.world · 6 months ago

Need to give it a quirky name.

The AIkive

JackGreenEarth@lemm.ee · 6 months ago

Another sad day for pro-preservation advocates

/home/pineapplelover@lemm.ee · 6 months ago

A sad day for intellects

DrCake@lemmy.world · 6 months ago

So when’s the ruling against OpenAI and the like using the same copyrighted material to train their models

index@sh.itjust.works · 6 months ago

stop asking questions and go back to work

norimee@lemmy.world · 6 months ago

Ah, I see you got that all wrong.

Open IA uses that content to generate billions in profit on the backs of The People. The Internet Archive just does it for the good of The People.

We can’t have that. “Good for The People” is not how the economy works, pal. We need profit and exploitation for the world to work…

RogueBanana@lemmy.zip · 6 months ago

“Good for the people”? You mean COMMUNISM?

finitebanjo@lemmy.world · 6 months ago

I think you accidentally swapped OpenAI and Open IA which happens to initialize Internet Archive, a little confusing.

norimee@lemmy.world · 6 months ago

I didn’t even realise. Thank you for pointing it out, I fixed it.

v_krishna@lemmy.ml · 6 months ago

OpenAI is burning billions of dollars not making profit.

buddascrayon@lemmy.world · 6 months ago

Wrong

https://futurism.com/the-byte/openai-copyrighted-material-parliament

v_krishna@lemmy.ml · 6 months ago

Eh? That article says nothing about their profit margins. Today they have something like $3.5B in ARR (not really, that’s annualized from their latest peak, in Feb they had like $2B ARR). Meanwhile they have operating costs over $7B. Meaning they are losing money hand over fist and not making a profit.

I’m not suggesting anything else, just that they are not profitable and personally I don’t see a road to profitability beyond subsidizing themselves with investment.

buddascrayon@lemmy.world · 6 months ago

It’s in the first bloody paragraph. 😮‍💨

OpenAI is begging the British Parliament to allow it to use copyrighted works because it’s supposedly “impossible” for the company to train its artificial intelligence models — and continue growing its multi-billion-dollar business — without them.

And if you follow the link the title of the article says it all:

#OpenAI is set to see its valuation at $80 billion—making it the third most valuable startup in the world

dan@upvote.au · edit-2 6 months ago

Just because the company has a high valuation, doesn’t mean they’re making a profit. They’re indeed losing a lot of money and will go bankrupt if they don’t get new investment and/or increase their ARR soon. Right now, they’ve only got 12 months left before they’re out of money. https://www.windowscentral.com/software-apps/openai-could-be-on-the-brink-of-bankruptcy-in-under-12-months-with-projections-of-dollar5-billion-in-losses

buddascrayon@lemmy.world · 6 months ago

The valuation is based on the expectation of the company to make massive profits. And if you think investor money is not profit for the people running Open AI, you’re crazy. We could only hope that they run out of money and go out of business. But that’ll never happen now with the amount of faith these corporations are putting in “AI” research.

Agret@lemmy.world · 6 months ago

Sounds like they are operating the same as all the other big tech companies then

ShaggySnacks@lemmy.myserv.one · 6 months ago

Burn a ton a cash to become the only major player in the market and the proceed to enshitify as no one else has anywhere to go.

irotsoma@lemmy.world · 6 months ago

But OpenAI not being allowed to use the content for free means they are being prevented from making a profit, whereas the Internet Archive is giving away the stuff for free and taking away the right of the authors to profit. /s

Disclaimer: this is the argument that OpenAI is using currently, not my opinion.

Anyolduser@lemmynsfw.com · 6 months ago

Hot on the heels of this one, I’d imagine.

wizblizz@lemmy.world · 6 months ago

Aaaaaany minute now.

shrugs@lemmy.world · 6 months ago

So, let’s say we create an llm that will be fed will all the copyrighted data and we design it, so that it recalls the originals when asked?! Does that count as piracy or as the kind of legal shananigans openai is doing?

iAmTheTot@sh.itjust.works · 6 months ago

Fat chance. Line must go up.

Lettuce eat lettuce@lemmy.ml · 6 months ago

Artificial scarcity at its finest. Imagine recording a song digitally, then pretending there are a limited amount of copies of that song in existence. Then you sell an agreement to another person that says they have to pretend there is only a certain made up number of copies that they bought, and if they allow more than that number of people to listen to those copies at rhe same time, they will get sued for “stealing” additional pretend copies?

I hope everybody can see how this is the insane and pathetic result of Capitalism’s unrelenting drive to commodify everything it possibly can in the pursuit of profit.

As always, the solution is sailing the high seas. Throughout history, those who created or saved illegal copies/translations of literature and art were important to preserving and furthering human knowledge.

Many incredibly powerful people, empires, and countries have tried very hard to suppress that, but they keep failing. You cannot suppress the human drive for curiosity and knowledge.

Ming@lemmy.dbzer0.com · 6 months ago

True, and the fleet is big and strong. There are many people seeding hundreds of terabytes of books/research papers/etc. The knowledge will not be lost. Yarr, can’t catch me in the high seas…

Redruth@feddit.nl · 6 months ago

sharing is theft

AngryCommieKender@lemmy.world · 6 months ago

Hoo, boy. You’re not gonna have a good time on Lemmy if you really believe that.

xenoclast@lemmy.world · 6 months ago

It’s ok. They don’t.

AngryCommieKender@lemmy.world · 6 months ago

I wasn’t gonna go digging on a less than a day old account.

Redruth@feddit.nl · 6 months ago

No really. I mean it. sharing is actually theft.

Frost-752@lemmy.world · 6 months ago

Quite the take

fpslem@lemmy.world · 6 months ago

Not a surprise, but still somehow crushing. It’s a loss for us all.

HexesofVexes@lemmy.world · 6 months ago

Ah, I see we’re burning the Library of Alexandria again… Just as with last time, the survival of texts will rely upon copies.

DancingBear@midwest.social · edit-2 6 months ago

In the future, armed with burning pencil writing fingers, books will be scanned and photographed, page by page. Before they are read.

metaStatic@kbin.earth · 6 months ago

“We are reviewing the court’s opinion and will continue to defend the rights of libraries to own, lend, and preserve books.”

Unpopular opinion: They stepped out of their fucking lane. There are already laws that protect actual libraries, in fact most nations have laws to ensure libraries have access to all locally published works.

One good thing to come of this is I’ve now joined my national and local libraries.

SkaveRat@discuss.tchncs.de · 6 months ago

Agreed. While a noble cause, it was honestly predictable.

I don’t understand why they did that. Their status was already quite shaky. They really shot themselves and their users in the foot

fossilesque@mander.xyz · 6 months ago

Direct link to the court document: https://storage.courtlistener.com/recap/gov.uscourts.ca2.60988/gov.uscourts.ca2.60988.306.1.pdf

Aatube@kbin.melroy.org · 6 months ago

Really unfortunate. I wonder why nobody foresaw this when they started the stupid NEL thing.

AlexWIWA@lemmy.ml · 6 months ago

Yeah they flew too close to the sun