thevoiceoftruth
New member
TLDR: if you knew every product sold on the internet, what would you do with that info?
Backstory-
I have spent the last 6-12 months building a whole series of web crawlers which crawl and collect 90% of all products on the web. Sounds far fetched right? Well it’s true and I’m able to do this because I have built up a very large homelab to support it all. For the technical folks it’s a petabyte hdd, 100tb flash, 160cores, 3tb of ddr4, 4 3090 GPUs, 10gb sfp, and cost me $15k.
In total it’s about 900 million product pages across 2 million websites and that does not factor in Amazon or eBay new products which I can pull in. It also doesn’t include Alibaba, aliexpress or other country specific marketplaces.
I’m able to do price watching, availability monitoring, product grouping, image similarity search and many other things with this data.
I have tested this out using cloudflare tunnels and hosting it all out of my house, the net result is that a user can preform any of the above searches and get results and images within 2 seconds.
Why did I build it-
Originally I started this because I was looking for some very specific baby clothes (a raccoon baby onesie). I couldn’t find anything on Amazon, eBay didn’t have what I wanted and I questioned the quality of stuff shipped from China, Etsy didn’t have the styles I wanted and was pricey, I ultimately went through 10 pages on google until I found a store that featured what I wanted. Would have been great if I didn’t have to spend 4 hours finding what I wanted.
My problem-
Google and others have a renewed focus in their shopping tools and they have implemented some of the features I originally thought made me unique. This was to be expected, I just didn’t think they would move so quick.
What I need help with-
I need some ideas from other folks in this sub about what they would do with this data. I have a strong preference in wanting to provide a platform for everyday people to use, but I want to put users first and don’t want to bastardize it with sponsored listings/ads; maybe one ad per page or every 5 minutes but nothing more as I’m anti-enshitificaion. I could also go with a b2b SaaS route, however there are so many different ones out there I’m not sure where the fit would be.
So the question is, if you had the details of every product on the internet, what business would you start to leverage that info?
Backstory-
I have spent the last 6-12 months building a whole series of web crawlers which crawl and collect 90% of all products on the web. Sounds far fetched right? Well it’s true and I’m able to do this because I have built up a very large homelab to support it all. For the technical folks it’s a petabyte hdd, 100tb flash, 160cores, 3tb of ddr4, 4 3090 GPUs, 10gb sfp, and cost me $15k.
In total it’s about 900 million product pages across 2 million websites and that does not factor in Amazon or eBay new products which I can pull in. It also doesn’t include Alibaba, aliexpress or other country specific marketplaces.
I’m able to do price watching, availability monitoring, product grouping, image similarity search and many other things with this data.
I have tested this out using cloudflare tunnels and hosting it all out of my house, the net result is that a user can preform any of the above searches and get results and images within 2 seconds.
Why did I build it-
Originally I started this because I was looking for some very specific baby clothes (a raccoon baby onesie). I couldn’t find anything on Amazon, eBay didn’t have what I wanted and I questioned the quality of stuff shipped from China, Etsy didn’t have the styles I wanted and was pricey, I ultimately went through 10 pages on google until I found a store that featured what I wanted. Would have been great if I didn’t have to spend 4 hours finding what I wanted.
My problem-
Google and others have a renewed focus in their shopping tools and they have implemented some of the features I originally thought made me unique. This was to be expected, I just didn’t think they would move so quick.
What I need help with-
I need some ideas from other folks in this sub about what they would do with this data. I have a strong preference in wanting to provide a platform for everyday people to use, but I want to put users first and don’t want to bastardize it with sponsored listings/ads; maybe one ad per page or every 5 minutes but nothing more as I’m anti-enshitificaion. I could also go with a b2b SaaS route, however there are so many different ones out there I’m not sure where the fit would be.
So the question is, if you had the details of every product on the internet, what business would you start to leverage that info?