intentor/llm-workbenchPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star1

A RAG-enabled workbench to index files and chat with an LLM about their contents.

License

MIT license

1 star 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.vscode		.vscode
doc		doc
src		src
tests/core/prompting		tests/core/prompting
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
contextualized_assistant.model		contextualized_assistant.model
db.json		db.json
makefile		makefile
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
sample.env		sample.env

Repository files navigation

LLM Workbench

A RAG-enabled workbench to index files and chat with an LLM about their contents.

How it works

Setup

Download and installOllama, a framework to interact with LLMs.

After installation, run Ollama:

ollama serve

In order to configure the model and the application, in a new terminal, run:

make setup

You can copysample.env as.env in the app root to customize application settings.

Running the app

make

A page to load context files and interact with the LLM will open in your browser.

Features

One-shot prompts to LLM.
File indexing for context querying.
Prompt tools to assist with prompt construction, context gathering, and response generation.
Replaying of a set of prompts, either from the current prompt history or a text file.
Displaying of all prompts and responses in the chat container.
Download of all or only the last chat messages in text or HTML files.

Prompt tools

Tools are used directly in the chat message input box.

Tool	Usage
`:<label>`	Add a label to a prompt for later reference. Labels should contain only lowercase alphanumeric characters and hyphens.
`{response:last}`	Replaced by the last response in the chat history.
`{response:label:<label>}`	Replaced by the labeled response in the chat history.
`/context`	Query chunks from uploaded files.
`/context?top-k=<number>`	Set the number of chunks to return.
`/context?file="<file name with extension>`	Query chunks only from the specified file.
`/rag <prompt>`	A shortcut to query the context and ask the LLM to use it to answer the prompt.
`/endpoint <url>`	Perform a`GET` to the provided URL.
`/echo`	Echo the prompt without sending it to the LLM. Can have replacements`{response*}` can be used for replacements.
`/template`	Get the last response as JSON and apply it to aJinja based template, allowing the custom formatting of response without relying on the LLM. The JSON data is available in the`context` variable. Refer to theTemplate usage section for details.

Prompt construction

:<label> /<tool> <prompt text, can contain {response:*} for replacement>

Template usage

Given a previous JSON response, it's possible to use the/template tool to create a template that will be processed using the JSON data.

Using the following JSON as the last response in the prompt history:

{"name":"User"}

The prompt below will generate a response using the JSON as input data in thecontext variable:

/template Name: {{context.name}}

The response to the prompt will be:

Name: User

Quick Cheat Sheet

Setting a variable

{% set variable_name = context %}

Date/time format (from ISO 8601)

{{context.field_date|parse_date|format_date("%d/%m/%y %H:%M:%S")}}

Conditional

{% if context.field_boolean %}Value if True{% else %}Value if False{% endif %})

Loop

 {% for item in context.list %} {{item.field}} {% endfor %}

Documentation

Please refer toJinja andjinja2_iso8601 documentations for more details on templating.

API mocking

In case you want to use API mocking to test context retrieval from endpoints, it's possible to use theJSON Server package for mocking.

Having Node.js/NPM installed, run the the following command to install dependencies:

make setup/server

Add the JSON you want to use as mock data in thedb.json file and run the server in a new terminal with the command below:

make run/server

The server will be accessible inhttp://localhost:3000/, with the root nodes of the JSON file as URL paths (e.g. in the demodb.json file) there's adata root node, which can be accessible throughhttp://localhost:3000/data.

Changing the model

By default, the workbench uses the LLM modelLlama3.1.

To change the LLM model used by the workbench, update theFROM parameter incontextualized_assistant.model file by a model available in theOllama library.

Using OpenRouter for generation

It's possible to useOpenRuter for requesting LLM generation from prompts, which replaces the default Ollama generator.

To setup OpenRouter, update theconfig.py settings below:

OPEN_ROUTER_KEY: Enter yourOpenRouter API key.
MODEL_GENERATOR: Change toOPENROUTER.
MODEL_LLM: Enter the model name from OpenRouter.

Note

The embebbding model still requires Ollama.

Known issues

The buttons in the screen are not always disabled during operations. Please be aware that clicking on different buttons during actions may lead to unintended consequences.
The download of chat history may not work during first attempt.
Complex Excel/.xlsx files may not be loadable due to format incompatibility withopenpyxl.
During replay, the scrolling may not be automatic.

About

A RAG-enabled workbench to index files and chat with an LLM about their contents.

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

License

intentor/llm-workbench

Folders and files

Latest commit

History

Repository files navigation

LLM Workbench

How it works

Setup

Running the app

Features

Prompt tools

Prompt construction

Template usage

Quick Cheat Sheet

Setting a variable

Date/time format (from ISO 8601)

Conditional

Loop

Documentation

API mocking

Changing the model

Using OpenRouter for generation

Known issues

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Packages