hok@lemmy.dbzer0.com to

LocalLLaMA@sh.itjust.worksEnglish · 6 days ago

How do I get started with RAG (ideally with llama.cpp)?

1

12

How do I get started with RAG (ideally with llama.cpp)?

hok@lemmy.dbzer0.com to

LocalLLaMA@sh.itjust.worksEnglish · 6 days ago

1

I would like my model to know the code libraries I use and help me write code with them. I use llama.cpp’s server and web UI for inference, but I have no clue how to get started with RAG, since it seems it is not natively supported with llama.cpp’s server implementation. It almost looks like I would need to code my own agent.

I am not interested in commercial offerings or APIs. If you use RAG, how do you do it?

Chat

Sandbar_Trekker@lemmy.today
link
fedilink
English
arrow-up
4·
6 days ago
You can use something like Anything LLM for RAG:

https://github.com/Mintplex-Labs/anything-llm

It works with local models.

https://docs.anythingllm.com/agent/usage#what-is-rag-search-and-how-to-use-it

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Welcome to LocalLLaMA! Here we discuss running and developing machine learning models at home. Lets explore cutting edge open source neural network technology together.

Get support from the community! Ask questions, share prompts, discuss benchmarks, get hyped at the latest and greatest model releases! Enjoy talking about our awesome hobby.

As ambassadors of the self-hosting machine learning community, we strive to support each other and share our enthusiasm in a positive constructive way.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

21 users / day
134 users / week
496 users / month
782 users / 6 months
1 local subscriber
2.9K subscribers
307 Posts
1.17K Comments
Modlog