Semantic Memory

Semantic Memory is an open-source library and service specializing in the efficient indexing of datasets through custom continuous data pipelines.

Utilizing advanced embeddings and LLMs, the system enables natural language querying for obtaining answers from the indexed data, complete with citations and links to the original sources.

Designed for seamless integration with Semantic Kernel, Semantic Memory enhances data-driven features in applications built using SK.

ℹ️ NOTE: the documentation below is work in progress, will evolve quickly as is not fully functional yet.

Examples

Importing memory, locally, without deployments

Importing files into your Semantic Memory can be as simple as this:

var memory = new MemoryPipelineClient();

await memory.ImportFileAsync("file1.docx",
    new ImportFileOptions("user-id-1", "memory-collection"));

await memory.ImportFilesAsync(new[] { "file2.docx", "file3.pdf" },
    new ImportFileOptions("user-id-1", "memory-collection"));

The code leverages the default data ingestion pipeline:

Extract text
Partition the text in small chunks
Extract embedding
Save embedding into a vector index

Import memory using Semantic Memory Web Service

Depending on the configuration, the code above can run locally, inside your process, or remotely through a service.

If you're importing small files, and need only C# or Python, and can block the process during the import, local execution can be fine.

However, if you are in one of these scenarios:

I'd just like a web service to import data and send queries to answer
My app is written in TypeScript, Java, Rust, or some other language
I want to define custom pipelines mixing multiple languages like Python, TypeScript, etc
I'm importing big documents that can require minutes to process, and I don't want to block the user interface
I need memory import to run independently, supporting failures and retry logic

then you can deploy Semantic Memory as a web service, plugging in the default handlers or your custom Python/TypeScript/Java/etc. handlers, leveraging the asynchronous queues automatically available.

If you deploy the default web service available in the repo, you only need to change the configuration, and use the same code above.

To import files using Semantic Memory web service, simply use SemanticMemoryWebClient:

var memory = new MemoryWebClient("http://127.0.0.1:9001"); // <== URL where the web service is running

await memory.ImportFileAsync("file1.docx",
    new ImportFileOptions("user-id-1", "memory-collection"));

await memory.ImportFilesAsync(new[] { "file2.docx", "file3.pdf" },
    new ImportFileOptions("user-id-1", "memory-collection"));

Custom import pipelines

On the other hand, if you need a custom data pipeline, you can also customize the steps, which will be handled by your custom business logic:

var app = AppBuilder.Build();
var storage = app.Services.GetService<IContentStorage>();

// Use a local, synchronous, orchestrator
var orchestrator = new InProcessPipelineOrchestrator(storage);

// Define custom .NET handlers
var step1 = new MyHandler1("step1", orchestrator);
var step2 = new MyHandler2("step2", orchestrator);
var step3 = new MyHandler3("step3", orchestrator);
await orchestrator.AddHandlerAsync(step1);
await orchestrator.AddHandlerAsync(step2);
await orchestrator.AddHandlerAsync(step3);

// Instantiate a custom pipeline
var pipeline = orchestrator
    .PrepareNewFileUploadPipeline("mytest", "user-id-1", new[] { "memory-collection" })
    .AddUploadFile("file1", "file1.docx", "file1.docx")
    .AddUploadFile("file2", "file2.pdf", "file2.pdf")
    .Then("step1")
    .Then("step2")
    .Then("step3")
    .Build();

// Execute in process, process all files with all the handlers
await orchestrator.RunPipelineAsync(pipeline);

mygit-2023 / semantic-memory Goto Github PK

semantic-memory's Introduction

Semantic Memory

Examples

Importing memory, locally, without deployments

Import memory using Semantic Memory Web Service

Custom import pipelines

semantic-memory's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent