Skip to content

Conversation

@labrenbe
Copy link
Member

@labrenbe labrenbe commented Jan 15, 2026

Adds a new demo that shows Retrieval Augmented Generation with OpenSearch

Stack:

  • OpenSearch 3.1.0 with k-NN plugin
  • Ollama running two models: nomic-embed-text:v1.5 (embeddings) & llama3.1:8b (response generation)
  • JupyterLab with interactive RAG pipeline notebook

Data

  • Documentation for the Stackable Data Platform crawled from docs.stackable.tech
  • Pre-generated json file containing ~4200 documentation chunks (93MB, stored in git)
  • Loaded into OpenSearch with a Kubernetes Job

The data file should be regenerated after each Stackable release to ensure no outdated documentation is used in the demo. This process can take more than 1 hour and is also the reason why the data is commited to git and not generated during demo installation.

@labrenbe labrenbe self-assigned this Jan 15, 2026
@labrenbe labrenbe linked an issue Jan 15, 2026 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add OpenSearch AI demo

2 participants