Grass Network
Chris Nguyen
CTO and Co-founder, Wynd Labs
daily upload volume
egressed/month
egress fees
Grass, a decentralized platform for securely sharing unused internet bandwidth to support web data collection for AI, initially relied on a well-known public cloud provider to store datasets. They were approached by a number of large enterprise prospects interested in their product, but the cost structure of their cloud vendor, including egress fees, made it difficult to be competitive and capitalize on serving those enterprises’ needs.
In proof of concept testing, Backblaze B2 Cloud Storage delivered on upload and download speeds as well as S3 API compatibility, making it an easy choice. Grass integrated Backblaze as their primary storage, with their previous provider running in parallel briefly during the migration.
Backblaze hit the sweet spot of high performance, affordability, and compatibility, delivering the rapid upload and unrestricted download speeds that Grass’s use case demanded. The Backblaze pricing model, with no delete fees and free egress up to 3x data stored, enabled Grass to unlock growth and reinvest cycles in product development.
Grass enables the curation of massive, internet-scale datasets through a global network of opt-in users sharing unused internet bandwidth.
With three million users, Grass effectively has a network of 3 million nodes worldwide with a network bandwidth of around 100 Gbits per second and growing. These nodes receive web requests from Grass’ validators and scrape data from websites by passing these requests through as normal traffic. The scraped data is then sent back to Grass’ servers where it is cleaned and reformatted before being stored on Backblaze. This ethically sourced, massive dataset is then curated by Grass and sold to enterprises for AI training and other data-driven applications.
The founders of Wynd Labs envisioned a future where AI is fueled by ethically sourced, high-quality, unbiased data. The path to that future looked like Grass, a global, distributed, opt-in network that allows users to contribute their unused bandwidth and compute resources to scrape the internet. Grass initially stored the datasets it gathered with a traditional cloud provider but hit a blocker when approached by large enterprise prospects interested in the product: the cloud provider’s cost structure would effectively kill the deals, especially when clients demanded faster data retrieval at scale.
The storage provider fell short on three counts:
We looked at our cost basis at the time, and it was difficult for us to be competitive with our previous storage system. We couldn’t keep growing at that price.
Chris Nguyen, CTO and Co-founder, Wynd Labs
Grass explored multiple storage providers, comparing large, general-purpose cloud providers to more storage-specialized options. Backblaze won out with its affordability, usability, and positive reviews on platforms like Reddit—Wynd Labs CTO, Chris Nguyen, particularly trusted the reviews from engineers about the functionality of the product.
Nguyen then kicked off a proof of concept by starting with a pay-as-you-go plan to test upload speeds and integration via the Backblaze S3 compatible API, which he found to be straightforward. Convinced by the results, he engaged with Backblaze for a larger commitment and migrated data while continuing to run the previous provider parallel to mitigate downtime.
We were surprised by the level of feature support we received. It's a lot nicer than someone saying 'sorry this feature isn't supported’ and you never know when it’s going to be ready.
Chris Nguyen, CTO and Co-founder, Wynd Labs
Backblaze's scalable object storage solution has been instrumental in supporting the Grass network’s rapid growth and ambitious data collection goals. Specifically, Backblaze delivers:
By eliminating the high costs and limitations faced with their previous provider, Backblaze has freed up resources Grass can reinvest into further developing its infrastructure and technology. The partnership empowers Grass to pursue its goal of not only competing with industry giants but ultimately reshaping the AI landscape by making ethically sourced, real-time data accessible to all.
Backblaze excels at retrieval—we have yet to hear any complaints.
Chris Nguyen, CTO and Co-founder, Wynd Labs