roadmap: Jan has threads based RAG #4817

dan-menlo · 2025-03-20T05:36:21Z

Goal

Have a simple "Chat with Docs"
- Only for new conversations
- LanceDB
But still support legacy RAG (let it be)

Tasks

Sync up w/ Louis

kalle07 · 2025-03-23T16:38:27Z

hey...
please start as simple as possible with nomic(as ambedder) and it will work well
if iam right some code you find here
https://github.com/nomic-ai/gpt4all
important that you can set embedding size and number of snippets
a fine start is "512token" and "4" snippets
create a collection and option to see my docs , how many words and how many verctors embedded (maybe live embedding)

after answer from the model, maybe an option also before (show the snippets as plain text)

next step. you can set for every LLM model and embedd-Model and document with your own settings (systempromt, token, ...all)
take a look Anything LM, they create workspaces for every settings

next step, implement more embedder models, bert-based are more or less same way, but it give some others qwen, jina and some ranking embedders...

and one nice option from LM-Studio(not opensource)
among other things they have an option for User-Poweruser-Developer (button below and instantly the layout and more option setting are changed)

kalle07 · 2025-03-28T08:04:58Z

one options for embedding "chat" or "query" so more chat or force to keep all answer out of the doc!

and one option (small button near the chat) to keep the snippets in VRAM after first answer or delete them.

it should be set to delete by default when I ask a new question afterwards...

kalle07 · 2025-04-11T17:23:57Z

maybe in next next step an option to send prefixes to embedder
https://www.youtube.com/watch?v=76EIC_RaDNw

and
re-ranker models (but i dont know what is the different)
one i have here
https://huggingface.co/kalle07/embedder_collection

louis-menlo · 2025-04-21T07:45:57Z

Thank you for sharing this, @kalle07. It's truly helpful.

ramonpzg · 2025-04-21T14:35:33Z

@louis-menlo -- I started working on a simple document parsing implementation with a bit of RAG using llamaindex.ts and LanceDB on a separate branch based on the Tauri branch. This is mainly for enabling users to add documents, images, videos and audio while having it all in a db.lance file for storing and querying as users need it.

@dan-menlo mentioned you have specific plans for implementing this via MCP. Can I confirm with you whether you would like me to add support for the things I mentioned or remove this as it will be done in a different way?

louis-menlo · 2025-04-28T02:18:10Z

cc @ramonpzg likely will drive this.

dan-menlo added this to Menlo Mar 20, 2025

github-project-automation bot moved this to Investigating in Menlo Mar 20, 2025

dan-menlo assigned louis-menlo Mar 20, 2025

david-menloai assigned ramonpzg Mar 24, 2025

david-menloai added this to Jan Apr 1, 2025

david-menloai added this to the v0.5.17 milestone Apr 1, 2025

david-menloai self-assigned this Apr 2, 2025

This was referenced Apr 4, 2025

Bug: RAG doesn't work as expected #4858

Closed

Feedback for “Knowledge Retrieval” #4772

Closed

dan-menlo added the type: Roadmap label Apr 8, 2025

dan-menlo changed the title ~~roadmap: Jan's "Chat with Docs" to replace legacy RAG~~ roadmap: Jan's to transition legacy RAG Apr 11, 2025

louis-menlo unassigned david-menloai Apr 11, 2025

louis-menlo mentioned this issue Apr 11, 2025

bug: Local models cannot recognize uploaded files menloresearch/cortex.cpp#1740

Closed

3 tasks

louis-menlo pinned this issue Apr 12, 2025

louis-menlo moved this to Todo in Jan Apr 12, 2025

louis-menlo changed the title ~~roadmap: Jan's to transition legacy RAG~~ roadmap: Jan has threads based RAG Apr 21, 2025

menloresearch deleted a comment from david-menloai Apr 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

roadmap: Jan has threads based RAG #4817

roadmap: Jan has threads based RAG #4817

dan-menlo commented Mar 20, 2025 •

edited by david-menloai

Loading

kalle07 commented Mar 23, 2025 •

edited

Loading

kalle07 commented Mar 28, 2025

kalle07 commented Apr 11, 2025

louis-menlo commented Apr 21, 2025

ramonpzg commented Apr 21, 2025

louis-menlo commented Apr 28, 2025

roadmap: Jan has threads based RAG #4817

roadmap: Jan has threads based RAG #4817

Comments

dan-menlo commented Mar 20, 2025 • edited by david-menloai Loading

Goal

Tasks

kalle07 commented Mar 23, 2025 • edited Loading

kalle07 commented Mar 28, 2025

kalle07 commented Apr 11, 2025

louis-menlo commented Apr 21, 2025

ramonpzg commented Apr 21, 2025

louis-menlo commented Apr 28, 2025

dan-menlo commented Mar 20, 2025 •

edited by david-menloai

Loading

kalle07 commented Mar 23, 2025 •

edited

Loading