main code:

def ollama_llm(question, context): formatted_prompt = f""" Context: {context}

Convert units for consistency. 
Extract and format information about all enzyme-substrate pair mentioned in the context, following this structure:

Enzyme name：[Enzyme name]
EC Number: [EC Number] OR N/A
Organism: [Organism Name] OR N/A
Substrate: [Substrate Name] OR N/A
Type: [Wild-type OR Mutant (Specify Mutation)]
Protein Identifier: [UniProt ID OR NCBI ID]
Specific Activity: [Value] OR N/A
KM Value: [Value in mM] OR N/A
Kcat Value: [Value per second] OR N/A
kcat/KM: [Value in mM^-1s^-1] OR N/A
pI Value: [Value]
pH Optimum: [Value]
Temperature Optimum: [Value in Celsius]
Molecular Weight: [Value in kDa]
Reaction pH: [Value] OR N/A
Reaction Temperature: [Value in Celsius] OR N/A
Buffer Solution: [Buffer used in the assay] 

"""
response = llm.invoke(formatted_prompt)
return response

prompt_template = """ The following is an HTML table: {table_html} Please rebuild and fix this table . """ prompt = PromptTemplate(template=prompt_template, input_variables=["table_html"]) llm_chain = LLMChain(llm=llm, prompt=prompt)

def rag_chain(question, pdf_path):

Load PDF and split into pages

loader = PyPDFLoader(pdf_path)
pages = loader.load_and_split()
raw_pdf_elements = partition_pdf(
    filename=pdf_path,
    infer_table_structure=True,
    strategy='hi_res',
)
tables = [el for el in raw_pdf_elements if el.category == "Table"]
table_htmls = [el.metadata.text_as_html for el in tables]
table_texts = [llm_chain.run(table_html) for table_html in table_htmls]
embeddings = OllamaEmbeddings(model='snowflake-arctic-embed:latest')
text_splitter = SemanticChunker( embeddings)
texts = text_splitter.split_documents(pages)
text_docs = texts
documents = text_docs 
vectorstore = FAISS.from_documents(documents, embeddings)
text_docs = vectorstore.similarity_search(question, k=5)
text_context = "\n\n".join([doc.page_content for doc in text_docs])
context = text_context + "\n\n" + "\n\n".join(table_texts)

return ollama_llm(question, context)

output :

You're welcome! I'm here to help.

If you'd like me to assist with anything else, such as formatting the data for a specific purpose (e.g., creating a table), please let me know!

That's great to hear! I'm always here to help.

If you have another text that needs processing, go ahead and paste it in the chat window, and I'll do my best to extract and format the relevant information for you.

You're right, I didn't receive any new text. It was a pleasure assisting you with extracting and formatting enzyme information from the previous text. If you have any other texts that need processing or any questions in the future, feel free to reach out. Have a great day!<|eot_id|><|start_header_id|>assistant<|end_header_id|>

meta-llama / llama3

Why llama3 generate something strange ,when i build an rag use ollama with llama3 #144

Load PDF and split into pages