Skip to content
@huridocs

HURIDOCS

HURIDOCS equips human rights defenders with tools to mobilise information for justice and accountability.

Popular repositories Loading

  1. uwazi uwazi Public

    Uwazi is a web-based, open-source solution for building and sharing document collections

    TypeScript 254 81

  2. pdf-document-layout-analysis pdf-document-layout-analysis Public

    A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…

    Python 244 31

  3. casebox casebox Public archive

    Forked from KETSE/casebox

    Casebox: Secure all your information and team communication in one place

    JavaScript 49 31

  4. pdf_paragraphs_extraction pdf_paragraphs_extraction Public

    Python 49 7

  5. OpenEvSys OpenEvSys Public archive

    OpenEvSys is free open source software designed for use by organisations who need a software tool to manage information on human rights violations

    PHP 30 20

  6. pdf-text-extraction pdf-text-extraction Public

    This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the under…

    Makefile 27 1

Repositories

Showing 10 of 34 repositories
  • trainable-entity-extractor Public

    Trainable Entity Extractor

    huridocs/trainable-entity-extractor’s past year of commit activity
    Python 0 Apache-2.0 0 0 7 Updated Feb 7, 2025
  • ML-Benchmarks Public

    Repository to store all the ML benchmarks

    huridocs/ML-Benchmarks’s past year of commit activity
    0 0 0 0 Updated Feb 7, 2025
  • uwazi Public

    Uwazi is a web-based, open-source solution for building and sharing document collections

    huridocs/uwazi’s past year of commit activity
    TypeScript 254 MIT 81 460 10 Updated Feb 7, 2025
  • pdf_metadata_extraction Public

    pdf_information_extraction

    huridocs/pdf_metadata_extraction’s past year of commit activity
    Python 4 0 0 8 Updated Feb 6, 2025
  • pdf-document-layout-analysis Public

    A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.

    huridocs/pdf-document-layout-analysis’s past year of commit activity
    Python 244 Apache-2.0 31 2 6 Updated Feb 4, 2025
  • queue-processor Public

    queue-processor

    huridocs/queue-processor’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Feb 4, 2025
  • NER-in-docker Public

    NER-in-docker

    huridocs/NER-in-docker’s past year of commit activity
    Python 0 0 0 7 Updated Feb 3, 2025
  • huridocs/dummy_extractor_services’s past year of commit activity
    Python 0 0 0 0 Updated Feb 3, 2025
  • pdf-text-extraction Public

    This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of text extraction from PDF files.

    huridocs/pdf-text-extraction’s past year of commit activity
    Makefile 27 Apache-2.0 1 2 0 Updated Feb 3, 2025
  • pdf-table-of-contents-extractor Public

    This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of identifying and structuring the document's TOC.

    huridocs/pdf-table-of-contents-extractor’s past year of commit activity
    Makefile 11 Apache-2.0 3 1 0 Updated Feb 3, 2025

Top languages

Loading…

Most used topics

Loading…