Skip to content

JustJordanT/OCR-Document-Analyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

TypeScript AI Document Analyzer and Organizer

Project Overview

A TypeScript application that uses Azure OCR to scan, analyze, and organize documents based on their content.

Key Features

  1. Document upload
  2. OCR processing using Azure Computer Vision API
  3. Content analysis and categorization
  4. Metadata extraction
  5. Automated document organization
  6. Search functionality
  7. User-friendly interface

Technical Components

  • C# programming
  • Azure Cognitive Services integration
  • Text processing and analysis
  • File management system
  • User interface development

Benefits

  • Efficient document management
  • Quick information retrieval
  • Practical experience with Azure services and C# development

Potential Extensions

  • Multi-language support
  • Integration with cloud storage services
  • Machine learning for improved categorization

Application Flow

graph TD
    A[Start] --> B[User Uploads Document]
    B --> C[Azure OCR Processing]
    C --> D[Text Extraction]
    D --> E{Document Type Analysis}
    E -->|Invoice| F1[Extract Invoice Data]
    E -->|Receipt| F2[Extract Receipt Data]
    E -->|Contract| F3[Extract Contract Data]
    E -->|Other| F4[Extract General Metadata]
    F1 --> G[Categorize and Tag Document]
    F2 --> G
    F3 --> G
    F4 --> G
    G --> H[Store Document and Metadata]
    H --> I[Update Search Index]
    I --> J[Display in User Interface]
    J --> K[End]
    
    L[User Initiates Search] --> M[Query Search Index]
    M --> N[Retrieve Matching Documents]
    N --> O[Display Search Results]
    O --> K
Loading

About

TypeScript Azure OCR Document Analyzer

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published