Disrupting the largest residential proxy network

progbits · 2026-01-30T22:36:34 1769812594

I'm surprised by the negative takes...

Yes, proxies are good. Ones which you pay for and which are running legitimately, with the knowledge (and compensation) of those who run them.

Malware in random apps running on your device without your knowledge is bad.

throwoutway · 2026-01-31T03:08:46 1769828926

> Malware in random apps running on your device without your knowledge is bad.

And ones that have all the indicators of compromise of Russia, Iran, DPRK, PRC, etc

bdcravens · 2026-01-30T23:41:27 1769816487

Many are "compensated" (in the way of software they didn't pay for), so the real question is that of disclosure (in which case many software vendors check the box in the most minimal way possible by including it as fine print during the install)

happyopossum · 2026-01-31T00:31:36 1769819496

No, the question is not just disclosure. People have their bandwidth stolen, and sometimes internet access revoked due to this kind of fraud and misuse - disclosure wouldn’t solve that

the_fall · 2026-01-31T00:59:18 1769821158

Also, as a website owner, these residential proxies are a real pain. Tons and tons of abusive traffic, including people trying to exploit vulnerabilities and patently broken crawlers that send insane numbers of requests, and no real way to block it.

It's just nasty stuff. Intent matters, and if you're selling a service that's used only by the bad guys, you're a bad guy too. This is not some dual-use, maybe-we-should-accept-the-risks deal that you have with Tor.

bigfatkitten · 2026-01-31T04:45:20 1769834720

If they're lucky. Sometimes people have their doors kicked in by armed police.

CodeMage · 2026-01-31T01:47:02 1769824022

Getting rid of malware is good. A private for-profit company exercising its power over the Internet, not so much. We should have appropriate organizations for this.

vachina · 2026-01-31T04:01:59 1769832119

The proxies is the reason why you get spam in your Google search result, spam in your Play store (by means of fake good reviews), basically spam in anything user generated.

It directly affects Google and you, I don’t see why they should not do this.

Nextgrid · 2026-01-31T04:28:17 1769833697

Spam in Google search results is due to Google happily taking money from the spammers in exchange for promoting their spam, or that the spam sites benefit Google indirectly by embedding Google Ads/Analytics.

I don't see any spam in Kagi, so clearly there is a way to detect and filter it out. Google is simply not doing so because it would cut into their profits.

UqWBcuFx6NV4r · 2026-01-31T03:48:38 1769831318

Okay. You get right on that. In the meantime, would you rather they did nothing? What do you actually want, in concrete terms?

xyzzy_plugh · 2026-01-30T20:48:39 1769806119

> These efforts to help keep the broader digital ecosystem safe supplement the protections we have to safeguard Android users on certified devices. We ensured Google Play Protect, Android’s built-in security protection, automatically warns users and removes applications known to incorporate IPIDEA SDKs, and blocks any future install attempts.

Nice to see Google Play Protect actually serving a purpose for once.

trollbridge · 2026-01-30T22:00:42 1769810442

Yeah, it serves the purpose of blocking this kind of proxy traffic that isn't in Google's personal best interests.

Only Google is allowed to scrape the web.

1vuio0pswjnm7 · 2026-01-31T04:49:36 1769834976

"Only Google is allowed to scrape the web."

If I m not mistaken, the plaintiffs in the US v Google antitrust case in the DC Circuit tried to argue that website operators are biased toward allowing Google to crawl and against allowing other search engines to do the same

The Court rejected this argument because the plaintiffs did not present any evidence to support it

For someone who does not follow the web's history, how would one produce direct evidence that the bias exists

vachina · 2026-01-31T04:09:29 1769832569

This is demonstrably false by the success of many scrapers from AI companies.

Nextgrid · 2026-01-31T04:32:37 1769833957

LLMs aren't a good indicator of success here because an LLM trained on 80% of the data is just as good as one trained on 100%, assuming the type/category of data is distributed evenly. Proxies help when you do need to get access to 100% of the data.

a456463 · 2026-01-30T22:47:46 1769813266

Yup exactly. Google must be the only one allowed to scrape the web. Google can't have any other competition. Calling it in "user's best interest" is just like their other marketing cons: "play integrity for user's security" etc

viraptor · 2026-01-31T00:43:11 1769820191

Have you got any proof of Google scraping from residential proxies users don't know about, rather than from their clearly labelled AS? Otherwise you're mixing entirely different things into one claim.

misir · 2026-01-31T01:23:55 1769822635

That's the whole point. Websites that try to block scraping attempts will let google scrape without any hurdle because of google's ads and search network. This gives google some advantage over new players because as a new name brand you are hardly going to convince a website to allow scraping even if your product may actually be more advantageous to the website (for example assume you made a search engine that doesn't suck like google, and aggregates links instead of copying content from your website).

Proxies in comparison can allow new players to have some playing chance. That said I doubt any legitimate & ethical business would use proxies.

idiotsecant · 2026-01-31T01:18:45 1769822325

I don't think parent post is claiming that Google is using other people's networks to scrape the web only that they have a strong incentive to keep other players from doing that.

viraptor · 2026-01-31T01:26:20 1769822780

No, there are other scrapers that Google doesn't block or interact with. You can even run scraping from GCP. This has nothing to do with "only Google is allowed to scrape". They even host apps which exist for scraping data, like https://play.google.com/store/apps/details?id=com.sociallead...

direwolf20 · 2026-01-31T00:11:31 1769818291

Does it also block unwanted traffic from Google apps or does it have a particular hatred for companies that interfere with Google's business model?

tgsovlerkhgsel · 2026-01-31T00:26:53 1769819213

Play Protect blocks malicious apps, not network traffic, so no, it obviously doesn't interfere with Google's apps.

AFAIK it also left SmartTube (an alternative YouTube client) alone until the developer got pwned and the app trojanized with this kind of SDK, and the clean versions are AFAIK again being left alone. No guarantee that it won't change in the future, of course, but so far they seem to not be abusing it.

direwolf20 · 2026-01-31T01:42:53 1769823773

Does malicious mean interfering with Google's business model, or does it include intrusive advertising?

whartung · 2026-01-30T21:16:52 1769807812

My understanding is that routing through residential IPs is a part of the business of some VPN providers. I don't know how above board they are on this (as in notifying customers that this may happen, however buried in the usage agreement, or even allowing them to opt out).

But, my main point, is that the whole business is "on the up and up" vs some dark botnet.

kawsper · 2026-01-31T01:46:26 1769823986

Oxylabs sells proxies for scrapers, I suppose you can use the socks-proxy as a VPN, and they claim to use Honeygain.

Honeygain is a platform where people sell their residential internet connection and bandwidth to these companies for money.

For comparison Honeygain pays someone 10 cents per GB, and Oxylabs sells it for $8/GB.

aussieguy1234 · 2026-01-31T04:19:10 1769833150

That takes buying low and selling high to a whole new level

nielsbot · 2026-01-30T21:22:27 1769808147

FTA

> While operators of residential proxies often extol the privacy and freedom of expression benefits of residential proxies, Google Threat Intelligence Group’s (GTIG) research shows that these proxies are overwhelmingly misused by bad actors

direwolf20 · 2026-01-30T21:32:32 1769808752

Google's definition of a "bad actor" is someone who wants to use Google without seeing the ads. Or Kagi. Or an AI other than Gemini.

scirob · 2026-01-30T21:59:25 1769810365

so that only google and anthropic are allowed to scrape the web. No one else may have workarounds

a456463 · 2026-01-30T22:48:17 1769813297

Exactly. This is just google building a "moat" around their shady business.

cvalka · 2026-01-31T03:46:55 1769831215

chatmasta · 2026-01-31T02:13:48 1769825628

Why are they leaving Bright Data (aka Illuminati aka Hola VPN) untouched? They are doing this exact scheme on an industrial scale.

7thpower · 2026-01-31T03:38:59 1769830739

They have a robust KYC that appears to serve, at least in large part, as a way to stay off the shit list of companies with the resources to pursue recourse.

Source: went through that process, ended up going a different route. The rep was refreshingly transparent about where they get the data, why the have the kyc process (aside from regulatory compliance).

Ended up going with a different provider who has been cheaper and very reliable, so no complaints.

chatmasta · 2026-01-31T04:20:37 1769833237

Yeah, they make you do a Skype interview (or probably Zoom interview nowadays). You could call this KYC or collateral, depending on your view of the company. It does limit the nefariousness of their clientele but I doubt they do much, or any, monitoring of actual traffic after onboarding (not for compliance reasons, anyway).

londons_explore · 2026-01-30T21:23:40 1769808220

We need more residential proxies, not less.

I've had enough of companies saying "you're connecting from an AWS IP address, therefore you aren't allowed in, or must buy enterprise licensing". Reddit is an example which totally blocks all data to non-residential IP's.

I want exactly the same content visible no matter who you are or where you are connecting from, and a robust network of residential proxies is a stepping stone to achieving that.

ndiddy · 2026-01-30T21:44:37 1769809477

If you look at the article, the network they disrupted pays software vendors per-download to sneakily turn their users into residential proxy endpoints. I'm sure that at least some of the time the user is technically agreeing to some wording buried in the ToS saying they consent to this, but it's certainly unethical. I wouldn't want to proxy traffic from random people through my home network, that's how you get legal threats from media companies or the police called to your house.

londons_explore · 2026-01-30T21:51:22 1769809882

> that's how you get legal threats from media companies or the police called to your house.

Or residential proxies get so widespread that almost every house has a proxy in, and it becomes the new way the internet works - "for privacy, your data has been routed through someone else's connection at random".

Imustaskforhelp · 2026-01-30T22:20:17 1769811617

> Or residential proxies get so widespread that almost every house has a proxy in, and it becomes the new way the internet works - "for privacy, your data has been routed through someone else's connection at random".

Is this a re-invention of tor, maybe I2P?

rolph · 2026-01-30T23:53:04 1769817184

IP8 address tumbler? to wit, playing the shell game, to obstruct direct attribution.

dataviz1000 · 2026-01-30T22:36:10 1769812570

They provide an SDK for mobile developers. Here is a video of how it works. [0] They don't even hide it.

[0] https://www.youtube.com/watch?v=1a9HLrwvUO4&t=15s

ndiddy · 2026-01-30T23:09:26 1769814566

Of course they're pitching it like everything's above board, but from the article:

> While many residential proxy providers state that they source their IP addresses ethically, our analysis shows these claims are often incorrect or overstated. Many of the malicious applications we analyzed in our investigation did not disclose that they enrolled devices into the IPIDEA proxy network. Researchers have previously found uncertified and off-brand Android Open Source Project devices, such as television set top boxes, with hidden residential proxy payloads.

direwolf20 · 2026-01-31T00:12:26 1769818346

If popup ads that open the play store are ethical, this is ethical.

JDye · 2026-01-30T22:35:32 1769812532

I live in the UK and can't view a large portion of the internet without having to submit my ID to _every_ site serving anything deemed "not safe the for the children". I had a question about a new piercing and couldn't get info on it from Reddit because of that. I try using a VPN and they're blocked too. Luckily, I work at a copmany selling proxies so I've got free proxies whenever I want, but I shouldn't _need_ to use them.

I find it funny that companies like Reddit, who make their money entirely from content produced by users for free (which is also often sourced from other parts of the internet without permission), are so against their site being scraped that they have to objectively ruin the site for everyone using it. See the API changes and killing off of third party apps.

Obviously, it's mostly for advertising purposes, but they love to talk about the load scraping puts on their site, even suing AI companies and SerpApi for it. If it's truly that bad, just offer a free API for the scrapers to use - or even an API that works out just slightly cheaper than using proxies...

My ideal internet would look something like that, all content free and accessible to everyone.

Aurornis · 2026-01-30T22:53:31 1769813611

> that they have to objectively ruin the site for everyone using it. See the API changes and killing off of third party apps.

Third party app users were a very small but vocal minority. The API changes didn't drop their traffic at all. In fact, it's only gone up since then.

The datacenter IP address blocks aren't just for scrapers, it's an anti-bot measure across the board. I don't spend much time on Reddit but even the few subreddits I visited were starting to become infiltrated by obvious bot accounts doing weird karma farming operations.

Even HN routinely gets AI posting bots. It's a common technique to generate upvote rings - Make the accounts post comments so they look real enough, have the bots randomly upvote things to hide activity, and then when someone buys upvotes you have a selection of the puppet accounts upvote the targeted story. Having a lot of IP addresses and generating fake activity is key to making this work, so there's a lot of incentive to do it.

JDye · 2026-01-30T23:16:01 1769814961

I agree that write-actions should be protected, especially now when every other person online is a bot. As for read-actions, I'll continue to profit off those being protected too but I wouldn't be too bothered if something suddenly changed and all content across the internet was a lot easier to access programmatically. I think only harm can come from that data being restricted to the huge (nefarious) companies that can pay for that data or negotiate backroom deals.

direwolf20 · 2026-01-31T00:12:57 1769818377

Reddit's traffic is almost exclusively propaganda bots.

what · 2026-01-31T04:14:43 1769832883

Have you considered that it’s because a new industry popped up that decided it was okay to slurp up the entire internet, repackage it, and resell it? Surely that couldn’t be why sites are trying to keep non humans out.

201984 · 2026-01-31T00:37:21 1769819841

Fix your government.

JDye · 2026-01-31T01:07:52 1769821672

Thanks lad. Will get right on it.

Aurornis · 2026-01-30T22:04:02 1769810642

> I want exactly the same content visible no matter who you are or where you are connecting from

The reason those IP addresses get blocked is not because of "who" is connecting, but "what"

Traffic from datacenter address ranges to sites like Reddit is almost entirely bots and scrapers. They can put a tremendous load on your site because many will try to run their queries as fast as they can with as many IPs as they can get.

Blocking these IP addresses catches a few false positives, but it's an easy step to make botting and scraping a little more expensive. Residential proxies aren't all that expensive, but now there's a little line item bill that comes with their request volume that makes them think twice.

> We need more residential proxies, not less

Great, you can always volunteer your home IP address as a start. There are services that will pay you a nominal amount for it, even.

direwolf20 · 2026-01-30T21:33:58 1769808838

You can run one, something like ByteLixir, Traffmonetizer, Honeygain, Pawns, there are lots more, just google "share my internet for money"

What will you be proxying? Nobody knows! I haven't had the police at my house yet.

Seems a great way to say "fuck you" to companies that block IP addresses.

You may see a few more CAPTCHAs. If you have a dynamic IP address, not many.

dist-epoch · 2026-01-30T22:32:05 1769812325

How much can you make if you run all of them at the same time?

Doesn't the ISP detect them?

direwolf20 · 2026-01-31T00:05:40 1769817940

like $3 a month

and why would they

tokyobreakfast · 2026-01-30T22:27:02 1769812022

> I've had enough of companies saying "you're connecting from an AWS IP address

I run a honeypot and the amount of bot traffic coming from AWS is insane. It's like 80% before filtering, and it's 100% illegitimate.

yuliyp · 2026-01-31T01:30:18 1769823018

The end game of that is no useful content being accessible without login, or needing some sort of other proof-of-legitimacy.

Nextgrid · 2026-01-31T04:40:15 1769834415

That's already the case (irrespective of residential proxies) because content only serves as bait for someone to hand over personal information (during signup/login) and then engage with ads.

Proxies actually help with that by facilitating mass account registration and scraping of the content without wasting a human's time "engaging" with ads.

supertrope · 2026-01-31T02:04:01 1769825041

Amazon.com now only shows you a few reviews. To see the rest you must login. Social media websites have long gated the carrots behind a login. Anandtech just took their ball and went home by going offline.

nine_k · 2026-01-31T00:27:44 1769819264

There's a company that pays you to keep their box connected to your residential router. I assume it sells residential proxy services, maybe also DDoS services, I don't know. It's aptly named Absurd Computing.

crtasm · 2026-01-30T23:08:05 1769814485

I'm reading reddit.com from a Tor node, they also have a .onion domain you could use.

Jblx2 · 2026-01-30T23:58:57 1769817537

Anyone know how to create a usable reddit account from the .onion domain?

phyzome · 2026-01-31T00:05:59 1769817959

I've tried it, and my account was shadowbanned a few hours after I created it. It's very obnoxious.

cluckindan · 2026-01-31T00:28:33 1769819313

Reddit bots shadowban almost everyone who post before they have enough comment karma. Nothing to do with Tor or VPN.

xg15 · 2026-01-30T21:32:38 1769808758

Also, nevermind the tech companies building their own proxy networks, such as Find My or Amazon Sidewalk.

a456463 · 2026-01-30T22:40:49 1769812849

Agreed. With things people paid for and using our wifi data to build their "positioning dbs" that you can't block or turn off on your phone, without "rooting" your own device.

enneff · 2026-01-30T22:57:17 1769813837

How is Find My a proxy network?

direwolf20 · 2026-01-31T00:14:05 1769818445

In the literal sense. Your traffic is proxied through devices belonging to unwilling strangers.

enneff · 2026-01-31T00:22:23 1769818943

By “your traffic” you mean device location reports? Or something else?

DANmode · 2026-01-31T03:32:54 1769830374

The data that powers the app tracking your devices, shown on your devices, yes.

(What else?)

enneff · 2026-01-31T03:53:04 1769831584

I don’t know. I wouldn’t have thought of myself as proxying other people’s traffic by carrying my iPhone around. (For one thing, it’s my own phone that initiates all the activity- it monitors for Apple devices, the devices don’t reach out to my phone.) I can see how you could frame it that way, though. I just thought they might be referring to something else that I didn’t know about.

MBCook · 2026-01-31T04:16:56 1769833016

I remain skeptical. I can understand how one would might see it that way, but I think it’s stretching the word proxy too far.

Devices on Apple’s Find My aren’t broadcasting anything like packets that get forwarded to a destination of their choosing. I would think that would be a necessity to call it “proxying”.

They’re just broadcasting basic information about themselves into the void. The phones report back what they’ve picked up.

That doesn’t fit the definition to me.

I absolutely don’t mind the fact that my phone is doing that. The amount of data is ridiculously minuscule. And it’s sort of a tit for tat thing. Yeah my phone does it, but so does theirs. So just like I may be helping you locate your AirTag, you would be helping me locate mine. Or any other device I own that shows up on Find My.

It’s a very close to a classic public good, with the only restriction being that you own a relevant device.

packetslave · 2026-01-30T21:30:25 1769808625

> Reddit is an example which totally blocks all data to non-residential IP's.

No, we don't.

direwolf20 · 2026-01-30T21:31:49 1769808709

Have you tried it? Every new account will be shadowbanned and if it's shared you often get blank page 429. None of this was true before the API shutdown.

3rodents · 2026-01-30T22:14:15 1769811255

That’s not my experience, using various VPNs, public networks, Cloudflare and Apple private relays. A captcha is common when logged out but that’s about it, I have not encountered any shadow bans. I create a new account each week.

gruez · 2026-01-30T21:51:51 1769809911

>Every new account will be shadowbanned

That's not the same as "blocks all data to non-residential IP's"?

>if it's shared you often get blank page 429. None of this was true before the API shutdown.

See my other comment. I agree there's a non-zero amount of VPNs that are banned from reddit, but it's also not particularly hard to find a VPN that's not banned on reddit.

interloxia · 2026-01-30T22:01:40 1769810500

Probably not hard but my poor little innocent VPS at Hetzer that I have had for years is denied and that makes me sad.

piskov · 2026-01-30T21:56:49 1769810209

Yes you do.

Private VPS for personal VPN in Netherlands (digital ocean), then Hungary (some small local DC) — both are blocked from day one.

> You've been blocked by network security. To continue, log in to your Reddit account or use your developer token. If you think you've been blocked by mistake, file a ticket below and we'll look into it.

what · 2026-01-31T04:23:54 1769833434

Sounds like you just need to sign in or use the api?

Imustaskforhelp · 2026-01-30T22:21:55 1769811715

Proton VPN sometimes (mostly?) has this issue too. It's a bit of an hit or miss in there iirc but I have definitely seen the last message of your comment.

hackeman300 · 2026-01-30T21:46:59 1769809619

Try browsing from any Mullvad vpn. You will be "blocked by network security"

edoceo · 2026-01-30T22:35:20 1769812520

I use mullvad regularly & visit reddit from that connection - it works. But! You have to sign-in.

gruez · 2026-01-30T21:48:37 1769809717

That's just mullvad's IP pool being banned. The other VPN providers I use aren't banned, or at least are only intermittently banned that I can easily switch to another server.

yuliyp · 2026-01-31T01:31:22 1769823082

... if you're logged out. Log in so they don't have to lump you in with every scraper you're sharing a subnet with.

thot_experiment · 2026-01-30T22:09:02 1769810942

I have never interacted with a reddit employee who wasn't actively gaslighting me about the platform. Do you even use the site? I talked to a PM recently who genuinely thought the phone app was something people liked.

MBCook · 2026-01-31T04:20:29 1769833229

There are people who actively like it.

I don’t. But they 100% exist.

direwolf20 · 2026-01-31T00:14:47 1769818487

They probably get paid by how many people believe their nonsense.

leftouterjoins · 2026-01-31T00:28:44 1769819324

everything on Reddit is so locked down it’s useless. even if you do get to post something useful some basement dwelling mod will block it for an arcane interpretation of one of the subreddits 14 rules.

dvngnt_ · 2026-01-30T22:03:35 1769810615

there are several times where I've had to disable PIA to access reddit's login page

a456463 · 2026-01-30T22:41:20 1769812880

Have you tried using it logged out on a vpn? It is impossible.

a456463 · 2026-01-30T22:38:05 1769812685

This blog post from the company that used promise "don't be evil", one that steals water for data centers from vilages and towns via shady deals, whose whole premise it stealing other people's stuff and claiming it as their own and locking them out and selling their data.. Who made them the arbiter of the internet? No one!!!

They just stole this and get on their high horse to tell people how to use internet? You can eff right off Google.

BoredPositron · 2026-01-30T22:03:33 1769810613

I still "run" a small ISP with a few thousand residential ips from my scraping days. The requirements are laughable and costs were negligible in the early 2000s.

IhateAI · 2026-01-31T01:17:42 1769822262

How do you stop mobile proxies operating through similar nefarious business models... CGNAT prevents you from easily identifying the exit nodes.

UqWBcuFx6NV4r · 2026-01-31T03:57:35 1769831855

Working with network operators.

Nextgrid · 2026-01-31T04:48:11 1769834891

Network operators have zero reason to care, they get paid per the GB for the bandwidth.

ExpertAdvisor01 · 2026-01-31T03:00:42 1769828442

Of course brightdata doesn't get touched.

direwolf20 · 2026-01-30T21:35:20 1769808920

All of this sounds legal, so on what basis did they get them shut down?

SOTGO · 2026-01-30T21:58:13 1769810293

I haven't looked at any court documents, but the WSJ article from Wednesday reported that "Last year, Google sued the anonymous operators of a network of more than 10 million internet-connected televisions, tablets and projectors, saying they had secretly pre-installed residential proxy software on them... an Ipidea spokeswoman acknowledged in an email that the company and its partners had engaged in “relatively aggressive market expansion strategies” and “conducted promotional activities in inappropriate venues (e.g., hacker forums)...”"

There was also a botnet, Kimwolf, that apparently leveraged an exploit to use the residential proxy service, so it may be related to Ipidea not shutting them down.

direwolf20 · 2026-01-31T01:43:48 1769823828

Google does much worse in Google–branded devices and apps, like the wifi location data harvesting.

kotaKat · 2026-01-30T20:56:51 1769806611

I'm actually a little shocked seeing that there was a WebOS variant of the residential proxying SDK endpoint. Does that mean there might be a bit more unchecked malware lurking behind the scenes in the LG ecosystem?

Personally I'm surprised they didn't have a Samsung option.

wincy · 2026-01-30T21:31:45 1769808705

I keep my brand new LG C5 totally disconnected from the internet and use my Apple TV for movie watching. I’m not going to trust a company like LG to secure their devices.

xnx · 2026-01-30T21:53:30 1769810010

> trust a company like LG to secure their devices.

They have an interest in securing their devices so they can sell proxy service themselves.

htx80nerd · 2026-01-30T22:35:06 1769812506

nice to see in the comments how many people didnt even do a 30 second scan of the article before clicking `add comment`

samsullivan · 2026-01-30T21:18:26 1769807906

The need for proxies in any legitimate context became obsolete with starlink being so widespread. Throw up a few terminals and you have about 500-2k cgnat IP addresses to do whatever you like.

JDye · 2026-01-30T22:22:24 1769811744

2k IPs is not enough to do most enterprise scale scraping. Starlink's entire ASN doesn't seem to have enough V4 addresses to handle it even.

chatmasta · 2026-01-31T02:16:13 1769825773

The actual secret is to use IPv6 with varied source IPs in the same subnet, you get an insane number of IPs and 90% of anti-scraping software is not specialized enough to realize that any IP in a /64 is the same as a single IP in a /32 in IPv4.