Expanso – a startup built to help enterprises manage their ever-growing data needs with a distributed approach to big data processing powered by its open-source software Bacalhau – recently announced it has raised $7.5 million in seed funding led by General Catalyst and Hetz Ventures, along with Array Ventures.
Based out of Seattle, Expanso was co-founded by alums of Google, AWS, and Microsoft and will focus on open-source solutions and targeting enterprises to address what CEO David Aronchick believes is currently an enormous but overlooked challenge – Actually making use of enterprise data.
Distributed big data processing is often complex and challenging. And one of the biggest challenges is the time and cost of transferring data between different nodes to a centralized data lake. And this can make it difficult to be responsive to new data inflows in real-time. Plus, many platforms (while powerful) require converting existing code to new frameworks to access the data – let alone get insights. And distributed big data processing systems are often a rich target for security issues, such as leaking personally identifiable information (PII), regulatory concerns, and data breaches.
The open-source software Bacalhau – developed and backed by Expanso – is built on the principle of “Compute Over Data,” which means that it brings the processing jobs to where the data is, rather than moving it to the cloud first. This has several advantages, including:
1.) Reduced costs: Moving large amounts of data to and from the cloud is expensive.
2.) Enhanced speed: Bacalhau processes data locally, removing cloud transfer latency and boosting performance for data-heavy applications.
3.) Increased security: Not moving the data reduces the risk of data breaches and other security incidents.
With Bacalhau, users can streamline their existing workflows without extensive rewriting by running arbitrary Docker containers and WebAssembly (WASM) images as tasks. The software can run on-premises or inside any cloud, including Amazon Web Services (AWS), Microsoft Azure, Google Cloud, Oracle Cloud, etc.
KEY QUOTES:
“Infrastructure built to meet data where it is, even if distributed around the world, is long overdue. What Expanso is building with Bacalhau is intended to revolutionize the way big data is processed and global compute jobs are executed, while unlocking an entirely new class of applications. We’re excited to partner with General Catalyst, Hetz Ventures, and Array Ventures and use this funding to accelerate the development of Bacalhau and Expanso, and bring it to even more users.”
— Expanso CEO David Aronchick
“Expanso brings compute to the data, enabling businesses to operate securely at their operational pace and maximize the utility of valuable data. In less than a year, Dave and his team of exceptional technologists and entrepreneurs, have achieved significant milestones, with the platform now in use with various sectors, including some of the world’s largest defense organizations. We are proud to support Expanso as they work to enhance the impact of distributed data for businesses worldwide.”
— Quentin Clark, Managing Director of General Catalyst
“A missing part of the modern data stack is the ability to process data where it is being created rather than have to centralize everything first. Bacalhau fills in that missing link, allowing large numbers of remote workers to use DuckDB to filter, summarize, and transform data at the edge before communicating results to MotherDuck in the cloud.”
— Jordan Tigani, CEO and co-founder of MotherDuck