Advanced RAG, Video by Donato Capitella on Self Query, Parent Document Retrival and HyDE #10

benWallace57 · 2024-02-07T11:05:55Z

benWallace57
Feb 7, 2024
Collaborator

https://www.youtube.com/watch?v=dCEMod64dko&ab_channel=DonatoCapitella

Often technologies are strong in some contexts and weak in others.
I like to frame these techniques by the strength that they are utilising, and also by the weakness they are mitigating.

RAG -> LLMS are strong when using the context window
Example: CHAT GPT is stronger when you ask it follow up questions.
Context has a weakness of it's window limit.
Solution -> put the best things in the context.

Self Query -> Semantic search is strong for natural language
Semantic search is weak for discrete or structured data
Keyword search is strong in those contexts
Example: "Wine from 1980" is semanticly similar to "Wine from 1940" But that's no good!
Solution -> Get an LLM to scan the query and determine whether keyword filters or limits should be used
Create a query that generates a filter if it would be useful.

Parent document retrieval -> Semantic search and vectorisation is strong when done on small chunks
Context can get weak when it is cut into small chunks.
Example: When a counter example is given within a document which is arguing the opposite point.
Solution -> Have parent and child chunks. Seach on the children, retrieve the parents

HyDE -> Semantic search is weak when the question is not in the same form as the documents in the database
Example: Query is in the form of questions but documents are in the form of reviews.
Solution -> Have an LLM generate hypothetical documents from the query, then do vectorised semantic search from these hypothetcal documents to retrieve real ones.

SamHollings · 2024-02-07T11:25:22Z

SamHollings
Feb 7, 2024
Maintainer

Ah yes, around the parent document retrieval, if you save the records in the database "hierarchically" you can make it so that when it finds a chunk, it returns a larger sections of the document, e.g. the whole section, or even the whole document. There is also the concept of having chunk overlap, so you get more context.

here is the bit of langchain around "parent document retrieval": https://python.langchain.com/docs/modules/data_connection/retrievers/parent_document_retriever

1 reply

benWallace57 Feb 7, 2024
Collaborator Author

Or I suppose level 2 Parent document retrieval would be some kind of graph based document retrieval.
Given relevant small chunk, return all graphically connected chunks.

Quick google found this https://www.nebula-graph.io/posts/graph-RAG

SamHollings · 2024-02-07T12:04:52Z

SamHollings
Feb 7, 2024
Maintainer

This video is really nice - the earlier part about the limitations of semantic search made me think about some stuff I'd not considered before!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Advanced RAG, Video by Donato Capitella on Self Query, Parent Document Retrival and HyDE #10

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

Advanced RAG, Video by Donato Capitella on Self Query, Parent Document Retrival and HyDE #10

benWallace57 Feb 7, 2024 Collaborator

Replies: 2 comments · 1 reply

SamHollings Feb 7, 2024 Maintainer

benWallace57 Feb 7, 2024 Collaborator Author

SamHollings Feb 7, 2024 Maintainer

benWallace57
Feb 7, 2024
Collaborator

Replies: 2 comments 1 reply

SamHollings
Feb 7, 2024
Maintainer

benWallace57 Feb 7, 2024
Collaborator Author

SamHollings
Feb 7, 2024
Maintainer