TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI

Oct 6, 2024 2:17 AM

TikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAI

https://fortune.com/2024/10/03/bytedance-tiktok-bytespider-scraper-bot/

social_media

tiktok

news

data

current_events

Time to start taking their data lol

10 months ago | Likes 3 Dislikes 0

It's simple, the money they will get from selling the data will cover any costs they rack up in fines.

10 months ago | Likes 1 Dislikes 0

*American corps: HEY! THEYRE DOING WHAT WE DO BUT BETTER!
*Insert star wars gif "she can't do that"*

10 months ago | Likes 5 Dislikes 2

10 months ago | Likes 1 Dislikes 0

Im doing my part of the job with dozens of bots: spreading disinformation.

10 months ago | Likes 1 Dislikes 0

Prove it pony boy

10 months ago | Likes 1 Dislikes 0

The answer can be found above my dear.

10 months ago | Likes 2 Dislikes 0

Somebody’s gonna invent an AI that’s gonna steal everything in everyone’s accounts, including bitcoin, in a fraction of a second.

10 months ago | Likes 1 Dislikes 3

Oops, sorry I meant the AI that AI is going to create from AI.

10 months ago | Likes 1 Dislikes 2

Yea.. it's webhosting 101. Tons of bots out there that don't respect robots.txt. Should stay away from common slugs and don't add those pages to robot.txt, sitemap, and xml.

10 months ago | Likes 28 Dislikes 0

It’s a good way hide from bots but it will also remove your content from legitimate search engines

10 months ago | Likes 4 Dislikes 1

But that's the point, to keep that part of the site (like admin login page, logs, analytics, etc...) off everyone's radar.

10 months ago | Likes 1 Dislikes 0

You’re talking about security, which is different than scraping. What you’re suggesting will definitely destroy your SEO. You can password protect or redirect your admin without changing your site maps. Who adds their login page to a site map anyway? No one does that.

AI bots can index and scrape your site. In fact anything you put on a public facing website will get indexed and can be scraped. That’s how the internet works. If you don’t want that to happen then don’t put it on the internet

10 months ago | Likes 1 Dislikes 0

Open.ai already has all the data, so not sure why the scaremongering. Every AI company has already reached the end of the internet and are struggling with feeding on themselves.

Why is one more company do that a bigger deal?

10 months ago | Likes 16 Dislikes 3

Because CHINA and COMMUNISM! *spooky scary noises*

10 months ago | Likes 6 Dislikes 11

Let’s ask the Uighur people if this is all just blown out if proportion

10 months ago | Likes 5 Dislikes 2

Because it's China. They like to do pretty shady things. And are doing the data collection as their app is likely to be banned out of concern the data will be shared with the Chinese government, so if nothing else, its shady timing.

10 months ago | Likes 6 Dislikes 1

People are going to call out whataboutism but USA has done so much more damage. Millions of dead Iraq's because of a lie. For all lipservice that the US loves democracy and freedom, has no problem couping countries and installing/supporting dictators as long as it serves and kowtows to it's interests. Radiofree Asia, Radio Free Europe etc, the US is number one in global propaganda.

10 months ago | Likes 2 Dislikes 0

I don't trust American corporations any more than I trust China.

10 months ago | Likes 1 Dislikes 0

All the big, rich tech company did (and do) equally shady thing. And all are contributing to legislation to make whatever they did illegal for future companies. Which is noble, like abandoning slavery and atom bomb bevelopment, but they are keeping the riches they gathered while they did it for themselves.

10 months ago | Likes 1 Dislikes 0

No one is giving op the customer base they built by copying all our address books, nobody is giving up their AI model they developed in ways they themselves now advocate of being unethical ways. They all got themselves an advantage and are working on making it impossible for competitors to join in.

10 months ago | Likes 1 Dislikes 0