Curated Resource ( ? )

Create your Document ChatBot with GPT-3 and Langchain

my notes ( ? )

"build a chatbot capable of extracting any kind of information from a set of documents ... your own Document Assistant from scratch, using GPT-3 and Langchain"

Good explainer of using embeddings to get around token limits "using LangChain, which is an open-source library designed to simplify the utilization of LLMs with Chain of Thoughts". Detailed instructions are provided for the following steps:

  • "Documents content is extracted using the proper Python library ...
  • The text is divided into chunks... Each chunk is embedded into a vector, the same for the user’s question;
  • The most similar chunks to the question are picked;
  • The question and extracted text chunks are fed as input prompts into GPT-3 (fine-tuning with a custom prompt);
  • An answer is provided."

Because the prompt specifies that the question should only be based on the provided chunks, hallucinations are avoided. For the same reason the temperature is set to zero, making GPT choose "the next token with the highest probability based on the previous ones... reduces the likelihood of less plausible tokens being generated."

Read the Full Post

The above notes were curated from the full post

Related reading

More Stuff I Like

More Stuff tagged ai , guide , gpt-x , llm , langchain , embedding

See also: Digital Transformation , Innovation Strategy , Science&Technology , Large language models

Cookies disclaimer saves very few cookies onto your device: we need some to monitor site traffic using Google Analytics, while another protects you from a cross-site request forgeries. Nevertheless, you can disable the usage of cookies by changing the settings of your browser. By browsing our website without changing the browser settings, you grant us permission to store that information on your device. More details in our Privacy Policy.