Semantic Search Engine for DocuSign Hackathon - Agreement Trap

A Streamlit-based application that allows users to import documents from DocuSign or other document management systems like your local files to build your own vector database and perform a semantic search across uploaded documents using the advanced hugging face semantic search model and Gemini Model.

Youtube Demo

Setup and Installation

Clone the repository

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # or venv\Scripts\activate on Windows

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables in .env file
Run the application:
```
streamlit run app.py
```

Service Architecture

┌─────────────────┐     ┌──────────────┐     ┌────────────────┐
│  Streamlit UI   │────▶│  DocuSign    │───▶│  Document      │
└─────────────────┘     │  /local files│     │  Processing    │
         │              └──────────────┘     └────────────────┘
         │                                           │
         ▼                                          ▼
┌─────────────────┐     ┌──────────────┐     ┌────────────────┐
│  Vector Store   │◀────│  Embedding   │◀────│  Text          │
│  (Pinecone)     │     │  Service     │     │  Extraction    │
└─────────────────┘     └──────────────┘     └────────────────┘

Technology Stack

Machine Learning Models

Sentence Transformers: Using all-MiniLM-L6-v2 for generating document embeddings

Vector Database

Pinecone: Vector similarity search and storage

AI Model

Gemini: AI model for generating nice responses to user queries

Document Processing

PyMuPDF (fitz)
PyPDF2
python-docx

Integration

DocuSign eSignature API
Streamlit for the user interface
HuggingFace Inference API

I Wrote all the code from scratch and it's only for this hackathon with the help of AI tools.

For more information, Drop me a message on LinkedIn

#Docusign #huggingFace #gemini #semantic_search #streamlit

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.devcontainer		.devcontainer
Context files		Context files
__pycache__		__pycache__
pages		pages
services		services
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
config.py		config.py
example.env		example.env
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Search Engine for DocuSign Hackathon - Agreement Trap

Setup and Installation

Service Architecture

Technology Stack

Machine Learning Models

Vector Database

AI Model

Document Processing

Integration

About

Releases

Packages

Languages

License

Sppdd/Semantic-Search-Engine

Folders and files

Latest commit

History

Repository files navigation

Semantic Search Engine for DocuSign Hackathon - Agreement Trap

Setup and Installation

Service Architecture

Technology Stack

Machine Learning Models

Vector Database

AI Model

Document Processing

Integration

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages