Update WP3_Advanced_RAG_discussion.md

nhsengland · Mar 15, 2024 · 02ce60f · 02ce60f
1 parent e6ddff9
commit 02ce60f
Showing 1 changed file with 83 additions and 1 deletion.
diff --git a/WP3_Advanced_RAG_discussion.md b/WP3_Advanced_RAG_discussion.md
@@ -8,16 +8,98 @@ Feel free to change the titles:
 
 ## Introduction
 
+Large language models have two main sources of information. 
+
+* Parametric knowledge: This is all the information that the model was trained on. LLMs are generally good at using this knowledge to synthesise language, but bad at retrieving specific facts from within it. 
+
+* Context window: This is where you write your prompt. In the case of chat based LLMs like Chat GPT, your previous conversation is also considered in the context window. LLMs are generally much better at retrieving facts that are given in the context window. 
+
+```mermaid
+    graph LR;
+    A[Query / Context Window] -->  H{LLM Generation};
+    AA[(Parametric Knowledge)] --> H
+    H --> I[Answer];
+
+```
+
+
+
 ## Basic RAG
 
+RAG introduces a third source of information, a database. 
+A RAG pipeline retrieves the most relevant information to the query from the database, then puts it into the context window. 
+
+There are advantages to storing and retrieving knowledge in this way. 
+
+* Easy to keep up to date (Parametric knowledge is only as new as the most recent training)
+* Easy to delete (forgetting parametric knowledge is hard)
+* LLMs can quote sources for their generated content
+* LLMs can say "I don't know" when it can't source a relevant fact to form its answer. 
+
+```mermaid
+    graph LR;
+    AA[(Vectorised Database)] --> B;
+    A[Query] --> B{Retrieval <br> with semantic search};
+    B -->C[Retrieved Content]--> E;
+    A-->E
+
+    E{Augmentation <br> Prompt Stuffing} -->F[Augmented Prompt / Context Window]--> H{LLM Generation};
+    AB[(Parametric Knowledge)] --> H
+    H --> I[Answer];
+
+```
+Generally in basic RAG retrieval is done using semantic search. 
+The database and the query are vectorised using a semantic embedder (encoding the meaning of the sentence). 
+
+
+Semantic search for RAG involves finding relevant content by understanding the context of a query, not just specific words. It retrieves information based on meaning, making search results more accurate and insightful for RAG applications.
+
+
+
+
 ## Advanced RAG
 
+The basic idea of RAG has been expanded on, incorporating numerous techniques to improve the generated output. 
+
+Many techniques improve the retrieval element specifically, and lots of these ideas are not new. Optimising search results is a well established and documented problem. 
+
 ### Retrieval
 
+There are situations where keyword or numerical filters would be more effective than semantic search. 
+When you use an online shop, you'll often use natural language to describe the item you want, but then use filters to narrow down the search space. 
+Self Querying retrieval uses and LLM to extract keywords from the query that can be used to filter down the database. 
+
+Self Querying retrieval: 
+
+
+```mermaid
+graph TD;
+A[Query <br> Find me a wine from New Zealand that goes well with red meats. It should be older than 1970s] --> B[New Zealand]
+A--> C[<1970]
+A--> D[goes well with red meats]
+C-->E[SELECT * FROM products
+WHERE location = 'New Zealand' 
+AND year < 1970]
+
+B-->E
+E--Filter-->F[(Database)]
+D--Semantic Search-->F
+
+F--> G[Retrieved content]
+
+```
+
+HyDE
+```mermaid
+
+```
+
 ### Augmentation
+Re-Rank
 
 ### Generation
+Fine Tuning 
 
 ## Modular RAG
 
-## Agents
+## Agents