Medical Bill Information Extractor

Project Overview

The Medical Bill Information Extractor is a project designed to automatically extract relevant information from medical bills using Optical Character Recognition (OCR) technology. This project leverages Google Tesseract, OpenCV2, and regular expressions to process and extract crucial data such as patient names, provider names, service codes, and payment amounts from input images of medical bills.

The extracted information is stored in a structured format (e.g., JSON) for further analysis and processing. This project aims to streamline the extraction process, significantly reducing the manual effort involved in handling medical bills.

Features

OCR Technology: Utilizes Google Tesseract for text recognition.
Image Processing: Employs OpenCV2 for image preprocessing to enhance OCR accuracy.
Data Extraction: Uses regular expressions to extract specific information from recognized text.
Structured Output: Stores extracted data in structured formats such as JSON.
Open Source: Available on GitHub for anyone to contribute and use.

Requirements

To run this project, you need the following libraries and tools installed:

opencv-python
Pillow
numpy
pdf2image
pytesseract
matplotlib

Installation

Clone the Repository:

git clone https://github.com/RepZ97/Medical-Bill-Information-Extraction.git
cd Medical-Bill-Information-Extraction

Set Up a Virtual Environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install Required Libraries:
```
pip install -r requirements.txt
```
Install pdf2image:
```
pip install pdf2image
```
Install and Configure Tesseract OCR:
- Download and install Tesseract OCR from the following link: pytesseract.
- Make sure Tesseract is added to your system's PATH.

Contributing

Contributions are welcome! Please fork the repository and submit pull requests with your improvements.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
images		images
README.md		README.md
Sample_For_Assignment.pdf		Sample_For_Assignment.pdf
medical_bill_extract.ipynb		medical_bill_extract.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical Bill Information Extractor

Project Overview

Features

Requirements

Installation

Contributing

License

About

Releases

Packages

Languages

RepZ97/Medical-Bill-Information-Extraction

Folders and files

Latest commit

History

Repository files navigation

Medical Bill Information Extractor

Project Overview

Features

Requirements

Installation

Contributing

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages