
TheRainbowegoSweet007
2206
74
6

TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI
https://fortune.com/2024/10/03/bytedance-tiktok-bytespider-scraper-bot/

Oct 6, 2024 2:17 AM
TheRainbowegoSweet007
2206
74
6
TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI
https://fortune.com/2024/10/03/bytedance-tiktok-bytespider-scraper-bot/
lucivjov
Time to start taking their data lol
zanaria
It's simple, the money they will get from selling the data will cover any costs they rack up in fines.
KarateCanine
*American corps: HEY! THEYRE DOING WHAT WE DO BUT BETTER!
*Insert star wars gif "she can't do that"*
We11thatwasC00L
Zymo
Im doing my part of the job with dozens of bots: spreading disinformation.
UhSadHJ84
Prove it pony boy
Zymo
The answer can be found above my dear.
thisiswhyicanthaveanythingnice
Somebody’s gonna invent an AI that’s gonna steal everything in everyone’s accounts, including bitcoin, in a fraction of a second.
thisiswhyicanthaveanythingnice
Oops, sorry I meant the AI that AI is going to create from AI.
sleete
Yea.. it's webhosting 101. Tons of bots out there that don't respect robots.txt. Should stay away from common slugs and don't add those pages to robot.txt, sitemap, and xml.
SquidBaitBadgerDroid
It’s a good way hide from bots but it will also remove your content from legitimate search engines
sleete
But that's the point, to keep that part of the site (like admin login page, logs, analytics, etc...) off everyone's radar.
SquidBaitBadgerDroid
You’re talking about security, which is different than scraping. What you’re suggesting will definitely destroy your SEO. You can password protect or redirect your admin without changing your site maps. Who adds their login page to a site map anyway? No one does that.
AI bots can index and scrape your site. In fact anything you put on a public facing website will get indexed and can be scraped. That’s how the internet works. If you don’t want that to happen then don’t put it on the internet
mrthewhitee
Open.ai already has all the data, so not sure why the scaremongering. Every AI company has already reached the end of the internet and are struggling with feeding on themselves.
Why is one more company do that a bigger deal?
GoddessPurpleFrost
Because CHINA and COMMUNISM! *spooky scary noises*
yamamasyamaha
Let’s ask the Uighur people if this is all just blown out if proportion
RedWingedBlackbirds
https://en.m.wikipedia.org/wiki/1989_Tiananmen_Square_protests_and_massacre
Fritzy19
Because it's China. They like to do pretty shady things. And are doing the data collection as their app is likely to be banned out of concern the data will be shared with the Chinese government, so if nothing else, its shady timing.
RuijiRiku
People are going to call out whataboutism but USA has done so much more damage. Millions of dead Iraq's because of a lie. For all lipservice that the US loves democracy and freedom, has no problem couping countries and installing/supporting dictators as long as it serves and kowtows to it's interests. Radiofree Asia, Radio Free Europe etc, the US is number one in global propaganda.
mrthewhitee
I don't trust American corporations any more than I trust China.
idonthaveauser
All the big, rich tech company did (and do) equally shady thing. And all are contributing to legislation to make whatever they did illegal for future companies. Which is noble, like abandoning slavery and atom bomb bevelopment, but they are keeping the riches they gathered while they did it for themselves.
idonthaveauser
No one is giving op the customer base they built by copying all our address books, nobody is giving up their AI model they developed in ways they themselves now advocate of being unethical ways. They all got themselves an advantage and are working on making it impossible for competitors to join in.