The lawsuit would be a small blow to Meta but an absolutely massive one to open source. Google, Meta and Microsoft are essentially the only companies that can actually afford to pay for this data.
It’s these lawsuits that will pave the way towards a soft monopoly with a limited choice of censored models all behind pricey subscription services.
So your argument is that…. I’m having trouble seeing your argument actually. There’s no way in hell that any company will ever pay for rights to train on books, they will try to find a workaround. The most expensive part currently is the energy and companies barely stomach that, they’re definitely not going to deal with publishers that most definitely will charge much much more than that for what amounts to eternal use of their books. If this lawsuit succeeds it will kill all training of ai on books. Big companies won’t be excluded.
Google paid 60 million for reddits data. They will pay the price they are asked from the 5 big publishing houses and they will happily do it because it gives them a monopoly.
Googles revenue in 2022 was 280 billion. They can easily afford this and aren’t close to “barely stomaching” anything.
60 million is how much data costs from users that have no say over their data. The cost will be astronomically higher if it has to be negotiated with actual authors and publishers. The publishers will in no way allow bulk usage, just look at how they treat libraries for an example of this. It will be cost per usage in some manner, not a one time fee, or even a yearly fee.
The lawsuit would be a small blow to Meta but an absolutely massive one to open source. Google, Meta and Microsoft are essentially the only companies that can actually afford to pay for this data.
It’s these lawsuits that will pave the way towards a soft monopoly with a limited choice of censored models all behind pricey subscription services.
So your argument is that…. I’m having trouble seeing your argument actually. There’s no way in hell that any company will ever pay for rights to train on books, they will try to find a workaround. The most expensive part currently is the energy and companies barely stomach that, they’re definitely not going to deal with publishers that most definitely will charge much much more than that for what amounts to eternal use of their books. If this lawsuit succeeds it will kill all training of ai on books. Big companies won’t be excluded.
Google paid 60 million for reddits data. They will pay the price they are asked from the 5 big publishing houses and they will happily do it because it gives them a monopoly.
Googles revenue in 2022 was 280 billion. They can easily afford this and aren’t close to “barely stomaching” anything.
Wishful thinking imo.
60 million is how much data costs from users that have no say over their data. The cost will be astronomically higher if it has to be negotiated with actual authors and publishers. The publishers will in no way allow bulk usage, just look at how they treat libraries for an example of this. It will be cost per usage in some manner, not a one time fee, or even a yearly fee.