Corrective RAG ✅

Corrective-RAG (CRAG) is a strategy for Retrieval-Augmented Generation (RAG) that includes self-reflection and self-grading of retrieved documents. Here’s a simplified breakdown of the steps involved:

Relevance Check: If at least one document meets the relevance threshold, the process moves forward to the generation phase.
Knowledge Refinement: Before generating an answer, the process refines the knowledge by dividing the document into smaller segments called "knowledge strips".
Grading and Filtering: Each "knowledge strip" is graded, and irrelevant ones are filtered out.
Additional Data Source: If all documents are below the relevance threshold, or if the system is unsure about their relevance, it will seek additional information by performing a web search to supplement the retrieved data.

Above steps are mentioned inOfficial Paper

Corrective Retrieval-Augmented Generation (CRAG) is a method that works like abuilt-in fact-checker.

Official Implementation

Here’s a code snippet for defining a table with theEmbedding API, and retrieves the relevant documents:

importpandasaspdimportlancedbfromlancedb.pydanticimportLanceModel,Vectorfromlancedb.embeddingsimportget_registrydb=lancedb.connect("/tmp/db")model=get_registry().get("sentence-transformers").create(name="BAAI/bge-small-en-v1.5",device="cpu")classDocs(LanceModel):text:str=model.SourceField()vector:Vector(model.ndims())=model.VectorField()table=db.create_table("docs",schema=Docs)# considering chunks are in list formatdf=pd.DataFrame({'text':chunks})table.add(data=df)# as per document feededquery="How Transformers work?"actual=table.search(query).limit(1).to_list()[0]print(actual.text)

Code snippet for grading retrieved documents, filtering out irrelevant ones, and performing a web search if necessary:

defgrade_documents(state):"""        Determines whether the retrieved documents are relevant to the question        Args:            state (dict): The current graph state        Returns:            state (dict): Updates documents key with relevant documents        """state_dict=state["keys"]question=state_dict["question"]documents=state_dict["documents"]classgrade(BaseModel):"""            Binary score for relevance check        """binary_score:str=Field(description="Relevance score 'yes' or 'no'")model=ChatOpenAI(temperature=0,model="gpt-4-0125-preview",streaming=True)# grading using openaigrade_tool_oai=convert_to_openai_tool(grade)llm_with_tool=model.bind(tools=[convert_to_openai_tool(grade_tool_oai)],tool_choice={"type":"function","function":{"name":"grade"}},)parser_tool=PydanticToolsParser(tools=[grade])prompt=PromptTemplate(template="""You are a grader assessing relevance of a retrieved document to a user question.\n        Here is the retrieved document:\n\n{context}\n\n        Here is the user question:{question}\n        If the document contains keyword(s) or semantic meaning related to the user question, grade it as relevant.\n        Give a binary score 'yes' or 'no' score to indicate whether the document is relevant to the question.""",input_variables=["context","question"],)chain=prompt|llm_with_tool|parser_toolfiltered_docs=[]search="No"fordindocuments:score=chain.invoke({"question":question,"context":d.page_content})grade=score[0].binary_scoreifgrade=="yes":filtered_docs.append(d)else:search="Yes"continuereturn{"keys":{"documents":filtered_docs,"question":question,"run_web_search":search,}}

Check Colab for the Implementation of CRAG with Langgraph:

Movatterモバイル変換

Corrective RAG ✅