LlamaIndex: $8.5 Million Closed For Building A Data Framework For LLMs

By Annie Baker • Jun 28, 2023

LlamaIndex – a data framework for Large Language Models (LLMs) – announced it raised $8.5M in seed funding led by Greylock. Angel investors in the round include Jack Altman, Lenny Rachitsky, Mathilde Collin (CEO of Front), Raquel Urtasun (CEO of Waabi), Joey Gonzalez (Berkeley) among others. The company will use the funds to build an enterprise offering on top of the popular open-source project with the same name.

LLMs have a major number of potential use cases, yet they are limited to the data they are trained on. And while exploring the possibilities of GPT-3 this fall, ML researcher and engineer Jerry Liu discovered limitations specifically around the models’ ability to work with data private to an individual or enterprise, from files (PDFs, Powerpoints), to workplace apps (Notion, Slack, Salesforce), to databases (Postgres, MongoDB). The open-source project resonated quickly with the AI community. And after just 6 months, the project has 15K Github Stars, 19K Twitter followers, 200K Monthly downloads, and 6K Discord users.

Observing teams at companies like Uber, Instabase, and Front using LlamaIndex to prototype LLM-powered features over their data, LlamaIndex CEO and co-founder Jerry Liu recognized the project could be incredibly helpful to enterprises who often want to ingest and structure their private data in a way that can be used with LLMs, and perform various LLM tasks over that data and obtain high-quality responses with verifiable sources. And in March, Liu joined forces with Simon Suo (Co-founder and CTO) who he met while working on ML research at Uber, to start LlamaIndex.

Now LlamaIndex’s open-source toolkit can be used with any LLM and offers an easy-to-use and optimized experience. And the toolkit can handle a diverse range of data sources, from structured to semi-structured to unstructured text or even image data, and includes data ingestion, data indexing, and a query engine layer on top. Plus the company’s enterprise solution – which will be available later this year – will be built on top of the popular open-source project. This solution will help enterprises eliminate the technical and security barriers to data usage, and provide a range of services, from scalable/reliable data source connectors to security features such as access control and user management.

KEY QUOTES:

“Many users building applications on top of LLMs want to unlock new use cases with their own private data. To solve for this, I created an open-source project called LlamaIndex to help unlock the full capabilities and use cases of any LLM for both myself and other developers.”

— Jerry Liu, CEO & Co-founder of LlamaIndex

“LlamaIndex’s early traction within the open source community is a reflection of the project’s simplicity and short time-to-value. Jerry and Simon are strong AI technologists who have worked across both research and engineering at companies like Uber, Robust Intelligence, and Quora, and are exactly the type of founders we love to back.”

— Jerry Chen, Partner at Greylock