Your browser doesn't support javascript.
Milvus: A Purpose-Built Vector Data Management System
2021 International Conference on Management of Data, SIGMOD 2021 ; : 2614-2627, 2021.
Article in English | Scopus | ID: covidwho-1299241
ABSTRACT
Recently, there has been a pressing need to manage high-dimensional vector data in data science and AI applications. This trend is fueled by the proliferation of unstructured data and machine learning (ML), where ML models usually transform unstructured data into feature vectors for data analytics, e.g., product recommendation. Existing systems and algorithms for managing vector data have two

limitations:

(1) They incur serious performance issue when handling large-scale and dynamic vector data;and (2) They provide limited functionalities that cannot meet the requirements of versatile applications. This paper presents Milvus, a purpose-built data management system to efficiently manage large-scale vector data. Milvus supports easy-to-use application interfaces (including SDKs and RESTful APIs);optimizes for the heterogeneous computing platform with modern CPUs and GPUs;enables advanced query processing beyond simple vector similarity search;handles dynamic data for fast updates while ensuring efficient query processing;and distributes data across multiple nodes to achieve scalability and availability. We first describe the design and implementation of Milvus. Then we demonstrate the real-world use cases supported by Milvus. In particular, we build a series of 10 applications (e.g., image/video search, chemical structure analysis, COVID-19 dataset search, personalized recommendation, biological multi-factor authentication, intelligent question answering) on top of Milvus. Finally, we experimentally evaluate Milvus with a wide range of systems including two open source systems (Vearch and Microsoft SPTAG) and three commercial systems. Experiments show that Milvus is up to two orders of magnitude faster than the competitors while providing more functionalities. Now Milvus is deployed by hundreds of organizations worldwide and it is also recognized as an incubation-stage project of the LF AI & Data Foundation. Milvus is open-sourced at https//github.com/milvus-io/milvus. © 2021 Owner/Author.

Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: 2021 International Conference on Management of Data, SIGMOD 2021 Year: 2021 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: 2021 International Conference on Management of Data, SIGMOD 2021 Year: 2021 Document Type: Article