Content
summary Summary

Large language models in the form of chatbots are essentially a new computer interface. PrivateGPT shows how this can be applied to your private data.

Large language models from companies like Microsoft or OpenAI can capture content in documents and make it "chattable". That is, they can have a conversation about the content, explain details or interpret statements, generate summaries, infer new content, and so on. This can help with research and understanding, and when it works reliably, it is a revolutionary new way to interact with computers and content.

But there's a catch: the chatbots from the big tech companies need to read your documents. For privacy reasons, you may want to avoid that. One possible alternative comes from the open-source movement: PrivateGPT, a local document chatbot.

PrivateGPT makes local files chattable

The open-source project enables chatbot conversations about your local files. You can add files to the system and have conversations about their contents without an internet connection. For example, you can analyze the content in a chatbot dialog while all the data is being processed locally. The software currently supports twelve file formats via LangChain, including PowerPoint, Word, PDF and HTML.

Ad
Ad

PrivateGPT uses GPT4ALL, a local chatbot trained on the Alpaca formula, which in turn is based on an LLaMA variant fine-tuned with 430,000 GPT 3.5 turbo outputs. Alternatively, other locally executable open-source language models such as Camel can be integrated.

Companies could use an application like PrivateGPT for internal knowledge management, customer service, or even to create communication templates from their data without giving third parties access to it.

All necessary files and installation instructions for PrivateGPT are available on Github. A video tutorial on the installation is available from Matthew Berman. In addition, PrivateGPT-App is a web application that serves as a visual interface to PrivateGPT.

Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Summary
  • Chatbots based on large language models can infer content in conversation, including content from their own documents.
  • This enables new forms of research and learning. For those who don't want to share their private documents with large corporations, PrivateGPT is a local open source alternative.
  • PrivateGPT uses a locally running open-source chatbot that can read documents and make them chat-ready - it doesn't even need an Internet connection.
Sources
Online journalist Matthias is the co-founder and publisher of THE DECODER. He believes that artificial intelligence will fundamentally change the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.