This repository is a Matatika Example Workspace that uses Open AI Generative Pre-trained Transformer (GPT) to answer questions.
Matatika loads the document vectors for the published datasets into Elastic Search and uses the Matatika questions
API to generates answers.
Get started using either Matatika Cloud or locally with Matatika Community Edition.
- Create a new workspace using this workspace as the template existing repository
- Run the "Load local documents" pipeline to load all document from
sample_documents
- Prerequisite - you will need a folder in Google Drive for this pipeline
- Prerequisite - with Matatika Community Edition you will need to configure your Google OAuth credentials
- Create a new workspace using this workspace as the template existing repository
- Provide configuration for the "Load remote documents" pipeline
- Under the
gdrive
section, login with your Google account and authorize Matatika to access Google Drive - Select the folder with your documents to index
- Save the pipeline
- Under the
- Run the pipeline
Once the pipeline run completes, a dataset will have been created for each loaded document, with a description containing the parsed document content and tags representing the the source file path.
In addition to indexing the published documents for search, this workspace has the questions feature enabled see workspace.yml
-> FEATURE_QUESTIONS
Once in the Matatika App or Lab, you can simply research topics and generate answers.
Look out for the big '+' button on any page.